Clarification for max_tokens

sps · March 9, 2023, 5:24pm

Hi @nashid.noor, @overbeck.christopher and @kathyh

Every model has a context length. It cannot be exceeded.

As I shared above max_tokens only specifies the max number of tokens to generate in the completion, it is not necessarily the amount that will get generated.

However, if the sum of tokens in prompt + max_tokens exceeds the context length of the model, the request will be considered invalid and you’ll get a 400.

e.g

This model's maximum context length is 4096 tokens. However, you requested 4157 tokens (62 in the messages, 4095 in the completion). Please reduce the length of the messages or completion.

Topic		Replies	Views
Question regarding max_tokens Prompting	11	38126	December 13, 2023
Doubt on prompt tokens and completion tokens API api	2	1345	April 18, 2024
I need help using openai API API chatgpt , gpt-4o-mini	2	246	October 29, 2024
Max_tokens seems to do nothing for me 3.5 Turbo API	14	3351	December 18, 2023
Not allowed to have all 8192 tokens API gpt-4	16	11590	December 18, 2023

Clarification for max_tokens

Related topics