Prompt tokens usage seems too high

_j · January 21, 2024, 6:42pm

The chat completions messages are wrapped in a container of tokens along with the name of the role, and in addition, the AI is prompted with more tokens where it is supposed to write its answer.

That gives a token overhead of 7 tokens per first message, and 4 for each additional.

For more understanding, imagine that the % symbol represents special tokens (which you can’t send yourself) that are injected into AI language. An API call that tells the AI what it is and what the user wants would have this received by the AI model internally:

%system%You are TokenBot%%user%Say Hello%%assistant%

Topic		Replies	Views
Using the API the token count is off API	10	1711	January 16, 2024
Prompt tokes are much lower than the number mentioned in the response API	6	134	January 10, 2025
How does ChatML do the exact formatting? API	3	8559	June 6, 2023
When sending a message to OpenAI chat api does it add json special characters ex. "{" to the final amount of prompt_tokens? API chatgpt , token , billin	2	1319	August 29, 2023
Using too many tokens for "incoming" requests API gpt-4 , token	4	918	February 14, 2024

Prompt tokens usage seems too high

Related topics