Assistants API Token Inconsistencies

eth.barr · January 25, 2024, 5:58pm

I’m having trouble with the assistant api. This is how I’m initializing it:

updated_assistant = client.beta.assistants.update(
                    assistant_id=assistant_id,
                    model=model,
                    file_ids=[file_id]
                )

I’m updating the model, assistant id, and file id based on user information. The only relevant thing that changes is the model, I switch between gpt-3.5-turbo-1106 and gpt-4-1106-preview. When using GPT 3.5 Turbo, everything works as expected. I get correct answers from the uploaded file, and the total token cost is only ~1000. When I switch to GPT 4 Turbo, however, I get strange results. While it still answers questions correctly, it consistently uses over 4000 tokens to do so, making it ridiculously more expensive to use than GPT 3.5 Turbo. The code is the same for each model, the only thing that changes is the model used.

Topic		Replies	Views
Pricing of Assistant API misleading API	1	1917	December 11, 2023
Assistant API + gpt4o + filesearch uses more tokens then gpt3.5 API assistants-api	1	92	July 5, 2024
GPT-4o / GPT-4 API pricing differences when using API/Playground API gpt-4	4	6854	May 23, 2024
Assistants API token usage and pricing breakdown clarification API gpt-4 , api , assistants	10	9028	February 6, 2024
Using Assistant API GPT-4o with File Search enabled automatically ups the tokens used by 3.5k Bugs api , assistants-api , gpt-4o	2	338	June 27, 2024

Assistants API Token Inconsistencies

Related Topics