Estimate token count using structured output

thomas246 · October 1, 2024, 1:51pm

Hi, is it possible to estimate the number of input tokens used by a pydantic model? It would be useful to estimate the cost in advance before sending a request to the API. Thanks!

platypus · October 1, 2024, 2:55pm

Hi @thomas246 !

You could estimate the “minimum cost” - basically by passing your JSON schema through tiktoken to get the number of tokens. The cost will never be less than this, and it gives you a starting point from which to reason about what will be total cost, i.e. JSON_SCHEMA + X. Your X here will be dependent on what is expected by your schema (e.g. if you have arrays, open-ended strings, etc). But you could probably come up with some “sample” values, pass those through tiktoken as well, and then you have a rough baseline to compare against. You can draw some standard error band around this number, and monitor your number of returned tokens, and flag if there is an outlier.

Topic		Replies	Views
Token count for completion call? API	6	2228	December 19, 2023
Are there any calculators that would give me an estimate of how much it would cost to run tokens? API	2	1625	September 12, 2024
How can we count the used tokens in a conversation? API gpt-4 , chatgpt	2	5276	May 17, 2023
How do I calculate the pricing for generation of text? API	11	7381	March 6, 2023
Feature request: Query token counts via API Prompting	3	1640	May 24, 2022

Estimate token count using structured output

Related topics