Asking assisant questions and getting response Python

Spaceout · May 7, 2024, 4:22pm

How do I exactly make a function that asks an already created Assistant a prompt, and then it gives back the answer, in lets say string, just the answer no metadata or anything? And if I make it so i ask questions in a loop, will it be in the same thread? If not how do i make it so? And can i ask it to create a new thread.

Sorry if this is a dumb question or already has been asked. Thank you!

_j · May 7, 2024, 4:39pm

The OpenAI API only returns RESTful JSON return objects from language models.

Assistants are multi-faceted, you’ll need to get the thread ID from creating a thread. To then place a message in the thread. You’ll need to submit a run with the assistant ID and the thread ID and get the run ID. You’ll have to keep polling the run ID to get a status. You’ll need to get the latest message from the thread and parse response text out of the return object.

A thread is like a ChatGPT conversation, it continues to accumulate messages of chat and AI responses, at growing expense.

The documentation link on the sidebar has example multi-step flow just for making one request and ending.

If you want to send input, receive response, the chat completions endpoint is the place for that, with only one object to extract a response to the user from.

Spaceout · May 7, 2024, 5:01pm

Yea okay, I think I figured it out more or less, with the help of this other sample code. But I have another question, ive been testing it out and the costs are kinda crazy for a few questions. like for example ~20 questions of api gpt 3.5 turbo 1106 and ~15000 tokens is like 13 cents already. Am I doing something wrong is it supposed to be like this?

Spaceout · May 7, 2024, 5:02pm

Is it because of thread stacking? btw not all 20 questions were in one go. And any way to fix this?

_j · May 7, 2024, 5:05pm

If you do not want to pay “input tokens” for everything you sent before and everything the AI said before, you can either use the API truncation_strategy run parameter to limit the number of past chat turns taken from a thread, or you can abandon the thread.

Topic		Replies	Views
Assistant API token Usage - promt_tokens usage is too high API api-usage , assistants , assistants-api	8	1946	April 10, 2024
How exactly do you get charged for using the API for assistants? API assistants-api	33	7416	November 27, 2023
Assistant 2.0 Tokens Usage - Usage is too high API assistants-api , assistants-pricing	8	1813	April 30, 2024
Assistant Thread limitations API gpt-4 , api , assistants-api	5	1142	July 30, 2024
Assistant API / costs / where do I find my token consumtions in assistants\|messages\|threads API	4	2038	December 14, 2023

Asking assisant questions and getting response Python

Related topics