Gpt-3.5-turbo-1106 real max tokens

lena · January 16, 2024, 12:21pm

Documentation is lying or how to understand this?
For gpt-3.5-turbo-1106 it says 16,385 max tokens — https://platform.openai.com/docs/models/gpt-3-5

In reality, we get an error and response that gpt-3.5-turbo-1106 supports only 4096 tokens

kjordan · January 16, 2024, 12:30pm

16k is the context size limit, including both input + output tokens.

4k is the limit of the output tokens. Make sure you set max_tokens <= 4096 and input_tokens + max_tokens <= 16385 when you call the API.

lena · January 16, 2024, 12:33pm

Ah yes, I see, you are right. I found the error in our request

Topic		Replies	Views
Problem with context token for gpt-3.5-turbo-0125 Community chatgpt	1	253	June 17, 2024
Gpt-3.5-turbo-1106 has a 16k context windown but get max token error API gpt-35-turbo	1	2692	November 9, 2023
Why is gpt-3.5-turbo-1106 max_tokens limited to 4096? API	3	14036	January 11, 2024
OpenAI API error message vs. website documentation list different max context limits Bugs api	1	624	December 15, 2023
The Difference in Token Limits: GPT-3.5 1106 vs 0613 Documentation gpt-4 , gpt-35-turbo , plugin-development , api	3	2999	December 19, 2023