Problem with context token for gpt-3.5-turbo-0125

taihocklin · June 17, 2024, 8:29am

I finetuned a model based on gpt-3.5-turbo-0125, which is said to have 16,385 tokens. However, I got an error below when I set the max_tokens to 15000. Is there something wrong?

openai.BadRequestError: Error code: 400 - {‘error’: {‘message’: ‘max_tokens is too large: 15000. This model supports at most 4096 completion tokens, whereas you provided 15000.’, ‘type’: ‘invalid_request_error’, ‘param’: ‘max_tokens’, ‘code’: None}}

jr.2509 · June 17, 2024, 8:35am

The good news is that nothing is wrong

The maximum token parameter enables you to control the maximum number of output tokens can produce. For gpt-3.5-turbo along with most other models the maximum output tokens are limited to 4,096.

The 16,385 tokens refers to the context window, which is the sum of both the input and output tokens.

Topic		Replies	Views
Gpt-3.5-turbo-1106 has a 16k context windown but get max token error API gpt-35-turbo	1	2682	November 9, 2023
Max tokens chat completion gpt4o API gpt-4o	4	16405	September 5, 2024
Why is gpt-3.5-turbo-1106 max_tokens limited to 4096? API	3	13831	January 11, 2024
Only allowed to set max_tokens to 4095 API	4	546	May 17, 2024
Gpt-4-1106-preview Context Length? API	1	6681	November 9, 2023

Problem with context token for gpt-3.5-turbo-0125

Related topics