How can I adjust the length of the prompt so that it does not exceed the max tokens?

salemmo409 · July 25, 2023, 3:22pm

When some prompts exceed the max of tokens I receive an error, this happen with “chat” and “completion” endpoints, so how to be sure that the prompt will not exceed the max tokens?

Foxalabs · July 25, 2023, 4:13pm

Hi,

Typically you make use of the TikToken tokenizer to check how many tokens your prompt string is, then you add on a fixed amount for the internal workings, say 50 tokens worth and you should have an accurate measure of your prompt size.

from tiktoken import get_encoding

tokenizer = get_encoding("cl100k_base")

def tokenize():
    text = request.json['text']
    tokenized_text = tokenizer.encode(text)
    tokenized_text = [{'token': token, 'text': tokenizer.decode([token])} for token in tokenized_text]
    return jsonify(tokenized_text=tokenized_text)

curt.kennedy · July 25, 2023, 4:41pm

Another “hack” is to embed the data using ada-002, which uses the cl100k_base tokenizer. It will return the tokens used (and the embedding vector). Useful if you already plan on embedding.

If not using embeddings, its still a cheap and lightweight way to go (can be done with the API, using only requests to the API endpoint, without additional OpenAI libraries)

_j · July 25, 2023, 8:37pm

A typical scenario is that the chat user input reduces the number of past conversation turns you can pass.

Track each turn’s token use, and you know when adding the input and most recent turns in reverse order will exceed the input budget.

“Your input exceeded the maximum of 6400 tokens”
“Your huge prompt made us discard all but the last two questions”

Auto-adapting the max_tokens a bit is more sketchy, because you don’t want question 20 to be less fulfilling than question 1 by being accidentally chopped off.

Topic		Replies	Views
API token limitation differs from website UI token limitation API	4	634	December 18, 2023
Getting around "max_tokens" API	8	31684	December 12, 2023
Struggling with max_tokens and getting responses within a given limit, please help! API chatgpt	5	21360	October 28, 2023
Token limits on prompting Prompting plugin-development	4	2497	June 16, 2023
Encountered maximum token exceed exception via API call API	4	3860	December 18, 2023

How can I adjust the length of the prompt so that it does not exceed the max tokens?

Related topics