Not allowed to have all 8192 tokens

_j · August 24, 2023, 3:03am

You understood wrong.

max_tokens is the limit of the response you will get back.

max_tokens also reserves space exclusively for this response formation.

The context length of a model is first loaded with the input, and then the tokens that the AI generates are added after that, in the remaining space.

Language is formed in a transformer language model by continuing the next token that should logically appear one at a time based on previous input and generated response so far.

(In an ideal world, there would be two parameters, a response_limit which would ensure that you don’t spend too much money, and a minimum_response_area_required to throw an error if you provided too much input to allow expected response formation. However millions of developers and lines of implemented code use the existing system.)

Topic		Replies	Views
Gpt4 token usage not using more than 3000 tokens even though it’s listed at much higher availability API	12	1977	December 17, 2023
openai.error.InvalidRequestError: Token limit exceeded HOWEVER the input, prompt, and output are far below the token limit API api	5	7831	February 9, 2024
Max_tokens seems to do nothing for me 3.5 Turbo API	14	3410	December 18, 2023
Not enough tokens error, even though I've paid A LOT (maximum context length error) API api	5	6176	September 9, 2023
Struggling with max_tokens and getting responses within a given limit, please help! API chatgpt	5	21483	October 28, 2023

Not allowed to have all 8192 tokens

Related topics