Conversation context and quadratic billing

linus · March 29, 2023, 10:28am

as @paul.armstrong mentioned this is the case.

There are some strategies you could deploy to help you on this, for example: OpenAI API: chat completion pruning methods this is a great way to reduce tokens. Or to limit the resubmitted messages to the last 5 ones in your request.

Topic		Replies	Views
Retain past responses in memory without sending them again at every API request API gpt-4 , gpt-35-turbo , chatgpt	11	11201	January 25, 2024
A conversation using the API API	6	2870	December 16, 2023
Efficient stateful completion chatbot API	10	5259	July 9, 2024
Pricing, Billing and Tokens? Math is not adding up API api	9	2436	February 16, 2024
Impact of conversations on the number of tokens API gpt-4 , token , api-billing	5	3922	October 5, 2023

Conversation context and quadratic billing

Related topics