Context tokens in Assistant API

ziper.rom1 · February 20, 2024, 10:13am

So based on your explanations, the Context Tokens usage in a Thread grows at each interaction as the AI models is memoryless and each interaction add context to the Thread ?

ContextUsage(t+1) = MIN(ContextUsage(t), MAX_CONTEXT_LENGTH) + (userInput(t+1) + functionCall(t+1) + ...)

Meaning that if the cost of a Thread Context Tokens reach 1$, all subsequent conversation calls will cost at least 1$ ?

EDIT

The answers are yes.

I read this thread that answer all my questions. Got a case in a debugging session with no Context Token limitation in a Thread with gpt-4-turbo-preview model that burned all my credit (20$) because subsequent Run calls were charged more than 1$ each …

This charge mechanism should be documented somewhere in the OpenAI doc in details, all my app workflow needs to be reworked considering this Thread growing context mechanism.

I need to think about solutions to reduce a Thread context, like summarizing the Thread context when reaching a certain limit and create a new Thread with this summarized information as a pre-prompt history context.

Topic		Replies	Views
Understanding the Role of Context Tokens in Assistant Instructions API assistants , assistants-api , assistants-pricing	0	624	February 6, 2024
Does `context token` including the uploaded file in Assistant messages? API assistants-pricing	4	1085	March 26, 2024
Assistants API Persist Context? API gpt-4 , chatgpt , api , token , assistants-api	2	337	May 3, 2024
What are context tokens? How exactly they are calculated when using Assistant API for retrieval? Is there a way to estimate the amount of context token that can be used when accessing Assistant APIs for retrieval? API assistants-api , assistants-pricing	0	614	March 20, 2024
Why are my context tokens used so quickly? API api	3	2519	January 5, 2024

Context tokens in Assistant API

Related Topics