Why are my context tokens used so quickly?

_j · January 5, 2024, 8:26am

The cost associated with the autonomous nature of agents has been an ongoing concern.

Identified the day of release:

And more the same week:

There is no implementation of controls in two months following, except by the model selected and its maximum context length.

The solution to budget control is to continue using the chat completions endpoint, with which you have the inconveniencing of needing to unpack “files” into text the AI can understand, but the advantage that you can make knowledge injected (search “RAG”) be very relevant to the current user input and not rely on continued external calls to internal tools. You can also control how much chat history length is really sent.

Topic		Replies	Views
Assistant API - What are Context Tokens in the Billing calculation? API assistants	24	12809	May 6, 2024
Assistants API context tokens Number API assistants-api	4	1003	December 4, 2023
Assistant API token Usage - promt_tokens usage is too high API api-usage , assistants , assistants-api	8	2000	April 10, 2024
Does `context token` including the uploaded file in Assistant messages? API assistants-pricing	4	1316	March 26, 2024
Context tokens in Assistant API API assistants-api	2	2159	February 20, 2024

Why are my context tokens used so quickly?

Related topics