Assistant API token Usage - Token usage more than the whole attached file Plus prompts

_j · January 22, 2024, 2:28am

The documentation answers that the Assistants agent framework pays no mind to your budget…

Retrieval currently optimizes for quality by adding all relevant content to the context of model calls. We plan to introduce other retrieval strategies to enable developers to choose a different tradeoff between retrieval quality and model usage cost.

“All relevant content” = all that will fit in the model’s context length.

The assistant and its internal functions for retrieval and other tools has its own language that also consumes tokens.

Topic		Replies	Views
Assistant API - What are Context Tokens in the Billing calculation? API assistants	24	12437	May 6, 2024
Assistant API - way too much "input" tokens used API assistants-api , assistants-pricing	7	4689	September 6, 2024
Seeking Advice on Reducing Costs for RAG Chatbot Using File Search Assistant API api	4	1011	July 6, 2024
Assistants API token usage and pricing breakdown clarification API gpt-4 , api , assistants	10	10503	February 6, 2024
Why are my context tokens used so quickly? API api	3	2814	January 5, 2024

Assistant API token Usage - Token usage more than the whole attached file Plus prompts

Related topics