# of tokens used and costs randomly exploded over night

jacob11 · November 22, 2023, 7:31pm

From yesterday to today, my costs have randomly exploded - I am categorizing job titles which costs approximately 1.1k token call (context tokens) - I am tracking the tokens myself. I am using GPT-3.5 turbo

However, today I have categorized around 1500 jobs, this has cost a total of 5.8 million context tokens - it should be roughly 1.5 million, so almost a quarter.

Has anyone noticed anything changing?

zenz · November 22, 2023, 7:52pm

Systems are nominal on my end. Costs tracking as expected today.

Foxalabs · November 22, 2023, 8:11pm

Is this all purely internal work, as in you have no other employees or anyone with access to your API keys? also they are not in any public application?

Do you use assistants or any of the new features? or is this all based on traditional API calls?

jacob11 · November 22, 2023, 8:53pm

No additional elements, I use request so fully in control over how many tokens i use.

adaptiv · November 22, 2023, 9:55pm

I had the same experience yesterday with roughly 40 million tokens in just a few minutes. Multiple threads were generating messages in a loop. You might wanna check your runs for the messages and if they were re-created multiple times.

Check my posting for more details:

joris · November 22, 2023, 9:59pm

I’m seeing the same. Almost a 4 times increase in the amount of tokens per job. I’m certain the amount of context is the same as two days ago.

adaptiv · November 23, 2023, 8:34am

You have that on the GPT-3.5 or on the gpt-4 assistant model?

joris · November 23, 2023, 9:16am

GPT-4 Turbo Chat Completion.

I switched to Azure a few hours ago and that works as expected. Doing exactly the same request, but then on Azure, and token count looks normal again.
So, there seems to be an issue with the way OpenAI count’s the tokens.

adaptiv · November 23, 2023, 11:32am

I wonder how this is even possible to fall under the radar and OpenAI just ignores messages over support. To me, it looks like they are not doing their business serious. It is like a scam, or at least a rip-off.

adaptiv · November 23, 2023, 11:33am

Is the assitant API working over azure?

joris · November 23, 2023, 12:41pm

Assistant API is not yet available. Vision is though.

devve · November 25, 2023, 2:12pm

I am having a similar issue, I am very upset about this although luckily I have usage caps set so it is not costing me that much money but its very concerning. I am using a wordpress plugin on my website to make requests to chatgpt4 turbo and the tokens used do not at all match the actual usage reported by the plugin (its like 7 or 8 times what I expect). I have been discussing with the plugin creator but now I’m not sure it is their fault it seems this may be an OpenAI issue?

devve · November 25, 2023, 2:16pm

this seems very similar to what I am seeing, I am using a wordpress plugin that makes calls to chatgpt4-turbo and the usage is insanely high compared to what it should be. I have been talking with the plugin creator but I think this seems likely to be an openAI issue actually not their issue.

adaptiv · November 25, 2023, 5:53pm

Yeah, it really seems to be always a range between 7 to 8times of the normal token usage. Some messages work normally and some are done 20 times… And the average is then 8x times. Thats what I have observed. There will be a team from around @nikunj investigating the issue. So It’s good to see that it’s acknowledged to be an issue.

devve · November 25, 2023, 6:52pm

Glad to see they are looking into it, just to add more clarity, the chatgpt version I am seeing this issue with is called ‘GPT-4-1106-preview’ on the https://platform.openai.com/usage page.

devve · November 25, 2023, 6:59pm

Hello @nikunj, is there any way I can be added to any updates that relate to this issue? I have had to pause all usage of chatgpt for the moment due to this issue so would like to know as soon as it’s resolved. Thanks

owencmoore · November 28, 2023, 11:58pm

We’ve seen this happen before when someone is expecting the API to return JSON and have retry code for if it doesn’t, and the model started replying with it wrapped in ```json, resulting in the retry loging being hit way more often than before.

So check if this is the case, and you can try using JSON mode to avoid this issue.

adaptiv · November 30, 2023, 7:54pm

They “only” started with checking it this week. Until now, I did not hear back from them, but they made me aware that it could take some weeks to understand the underlying issue.

adaptiv · November 30, 2023, 7:59pm

We, in fact, do output JSON and were having these issues with re-generated content in a loop. Sadly, the JSON mode is only available to the chat completion API and not the assistant.

yvaine · November 30, 2023, 9:16pm

May I ask what is Vision? Is it another product?

Topic		Replies	Views
Assistants API token usage and pricing breakdown clarification API gpt-4 , api , assistants	10	10599	February 6, 2024
Assistant API - What are Context Tokens in the Billing calculation? API assistants	24	12736	May 6, 2024
SOS: ALARMING Situation of Excessive Billing Threatening the Survival of my Company AI Project GPT API api-billing	20	2729	May 28, 2024
Massive spend today - WTF went wrong? API	22	3484	September 17, 2023
Assistants API (gpt-3.5-turbo-16k) usage exceeds limit due to message loop Bugs gpt-35 , gpt-35-turbo , chatgpt	18	6698	December 23, 2023

# of tokens used and costs randomly exploded over night

Related topics