Error Encountered When Using max_tokens Parameter with GPT-4 API

sps · October 17, 2023, 11:11am

Welcome to the OpenAI community @defendershow

The reason you’re encountering length as finish_reason is because your input is large enough to consume most of the model’s context length, which then reflects in generated response truncated.

The max_tokens is checked before sampling and in your case:
input + max_tokens > context length
Hence it results in 400 error.

Also, in case of chat completion all the context length apart from the input is set to max_token automatically.

Topic		Replies	Views
Gpt4 token usage not using more than 3000 tokens even though it’s listed at much higher availability API	12	1978	December 17, 2023
Max_tokens seems to do nothing for me 3.5 Turbo API	14	3413	December 18, 2023
Not allowed to have all 8192 tokens API gpt-4	16	12199	December 18, 2023
Subject: Issue with Token Limit for `gpt-4o-mini` Model in `v1/chat/completions` API Documentation gpt-4	3	1735	September 3, 2024
Gpt-4-1106-preview: 400 This model's maximum context length is 4097 tokens API api , token , gpt-4-turbo	8	5641	March 18, 2024

Error Encountered When Using max_tokens Parameter with GPT-4 API

Related topics