How to handle large amount of text with gpt-3.5-turbo and what happens at token limit?

Lauenborg · March 2, 2023, 1:33pm

So Im wondering what the best way is to handle large amount og data with gpt-3.5-turbo? If i.e. store respons in an array would the best solution be to delete early responses to free up tokenspace for new responses?

And what happens when token limit is exceeded? If i send a message with an array that exceeds the token limit, will i get charged for the array send and not get a response, og will it just cancel the fetch?

catriel12 · April 3, 2023, 11:57am

I have the same questions.

As far as I know if you want to keep context in the GPT API you will need to append your prompts and the responses in an array and send the array with each ChatCompletion. Each response is a dictionary like
{"role":"system","content":"GPT RESPONSE"}
While the user input is:
{"role":"user","content":"MY_PROMPT"}

As far as I know if the request has more tokens that supported it will dropped and you should not be charged, but someone should verify this.
If you encounter this limit you can technically remove the first element in the array to save on tokens and then try again but then you are losing some of the context. Alternativley I heard that maybe you can use another library or GPT API to summurize the context and shorten the length of the array that way.
If someone has a better idea I’d be glad to hear it.

Topic		Replies	Views
Is it possible to have the response fit inside the max token limit? API gpt-35-turbo	1	2890	May 10, 2023
The cumulative token problem and role = system usage, options? API	8	4584	July 21, 2023
MAX TOKENS is 4,096 tokens for gpt-3.5-turbo should fit the the messages sent and the answer generated? API api	9	6302	July 26, 2023
Help Needed: Tackling Context Length Limits in OpenAI Models Community gpt-4 , chatgpt , token , rate-limit , openai	8	21038	February 8, 2024
Exceeding token limit while maintaining context Bugs gpt-35-turbo , api	3	1724	May 5, 2024

How to handle large amount of text with gpt-3.5-turbo and what happens at token limit?

Related topics