Completions API: how to pre-evaluate number of tokens needed?

apris · May 9, 2024, 4:37pm

My task is to process very big text (kind of a book). What is the best way to do that with completions ? My basic plan is to split it to chunks and process the one by one. But how to estimate the maximum size of chunk I can provide? Or is it easier to process this error that text is too long and in this case split it?

_j · May 9, 2024, 5:09pm

You can measure the encoded length of a text passage by using the tiktoken library. Then pass the sections that you may propose to send (split by sentence or paragraph) until you are beyond your threshold.

Token usage of English writing can be estimated at 4 tokens per character, with that compression becoming less the further you stray from germanic and latin languages.

Each model has a context length, from which the response also must be formed within.

Newer OpenAI models block receiving of over 4k of text as a response. So if you task is “improve” this, you’ll probably want to limit to 2k tokens, or even 1k will ensure quality through the length generated.

apris · May 11, 2024, 5:26am

tiktoken shows me around 1 token for English word or I miss smth? Also about 4k limit - what about GPT-4 turbo with 128k tokens?

_j · May 11, 2024, 6:54am

All current OpenAI language models use the same token encoder.

Topic		Replies	Views
Handling text larger than limits? API	2	3535	December 17, 2023
How are openai consumers generating large responses even when there is token limit on models? API chatgpt , chat-completion	2	1559	February 14, 2024
Token limits GT-4OMINI model-2024-07-18 API api	3	374	December 12, 2024
How to define user message size limit for particlular prompt? API	2	564	February 16, 2024
Any idea how to input more than 8k token in GPT 4? Prompting gpt-4	4	2067	December 17, 2023

Completions API: how to pre-evaluate number of tokens needed?

Related topics