How Do We Get Charged: Exceeded Maximum Token Length

bickster · July 24, 2023, 10:21pm

Hi,

When the API responds that you’ve exceeded the maximum token length for a specific model do we get charge for number of tokens in the request prompt and the response tokens that did not get returned? Or do we get charged just for the request prompts? Or do we not get charged at all when the error occurs?

Thanks,
Chris

_j · July 24, 2023, 10:53pm

The request never makes it to an AI model, it is stopped by the tokenizer and endpoint. So it makes no sense to be billed more than any other input error you created. – otherwise I’d be on the hook for 12345678 max_tokens

ps: I’ve got a new billing game to play: minimum input, maximum output. Only costs $0.07 if you win.

gpt-3.5-turbo-16k-0613, 1 request
11 prompt + 4,813 completion = 4,824 tokens

Aimjock · July 25, 2023, 12:54am

Wow, I can’t even begin to imagine what prompt could give that result!

6xmrkp4vqp · August 29, 2023, 12:31pm

Thanks for your response @_j . How do you know though? Are there any official resources which give more information on this?

_j · August 29, 2023, 8:00pm

One can immediately make a bad call, and see that it not added to your every-five-minute display in daily usage.

Topic		Replies	Views
Question Regarding API Costs and "Context Length Exceeded" Error API api	2	1809	October 28, 2023
Will Open AI charge for exceeded input limit tokens as well? API pricing	2	843	October 31, 2023
Pricing question (Does OpenAI calculate bill based on actual usage or max_tokens) API	1	1261	November 6, 2023
GPT rate limit handling, prompts rejected for TPM still charged to account? API gpt-4 , gpt-35-turbo , api , rate-limit	2	988	January 25, 2024
How Are Tokens Counted? API	4	1748	April 13, 2023

How Do We Get Charged: Exceeded Maximum Token Length

Related topics