API token limitation differs from website UI token limitation

davos · August 8, 2023, 5:52pm

Is there a workaround for this?

I can hit the API with a request and have an error like this barfed openai.error.InvalidRequestError: This model’s maximum context length is 4097 tokens, however you requested 6227 tokens (2137 in your prompt; 4090 for the completion)

And I can put that exact same request in the web interface on chatgpt and get a successful output

I’d like to use the API as opposed to building some selenium jank - any solutions?

Foxalabs · August 8, 2023, 6:58pm

Hi and welcome to the developer forum!

ChatGPT has automatic truncation of prompt text that is too long, the API will let you know when the prompt text is too long, but as a programmer you should handle errors such as this by performing your own truncation or summarisation method, you can use GitHub - openai/tiktoken: tiktoken is a fast BPE tokeniser for use with OpenAI's models. to calculate your text length in tokens to ensure you don’t go over the limit and handle that occurrence gracefully.

jochenschultz · August 8, 2023, 8:01pm

You should also do that when you do multiple requests in parallel as I just found out after some debugging why I sometimes get timeouts on 50 requests and sometimes I can do 100 without any problems.

I have a TPM (tokens per minute) limit of 120k on azure and I am sending code to the model for evaluations on a large catalog of weighted criteria.

And yeah the code snippets vary in size and so sometimes the token limit is reached with just a few requests.

So I have to take up to (my manually set) [process limit per minute] code snippets - generate the prompts for it and then calculate the overall needed reserved tokens (tiktoken calculated request tokens + max_tokens) and sum them up iterativly to see which snippets shoould be added to the evaluation stack.

So, yay… solved that.

davos · August 8, 2023, 10:30pm

I’m not sure truncating the prompt is ideal in this use case I’d prefer to limit the completion/response

Topic		Replies	Views
How can I adjust the length of the prompt so that it does not exceed the max tokens? API api	4	3634	December 18, 2023
Getting around "max_tokens" API	8	31636	December 12, 2023
Why is gpt-3.5-turbo-1106 max_tokens limited to 4096? API	3	14137	January 11, 2024
Token Limitization Error when prompting Prompting chatgpt , api	8	3380	December 6, 2023
Encountered maximum token exceed exception via API call API	4	3843	December 18, 2023

API token limitation differs from website UI token limitation

Related topics