Questions about the Output Token limit

sphaerox · May 17, 2024, 10:57am

hello, I have a question about the token limit. Since more and more models can now put tokens into context, I ask myself why the output tokens remain limited?

How does the output token limit come about? I always thought the tokens were counted like this: input+output.

But that’s apparently not the case, why is that the case, what is the limiting factor?

If it’s the computing power, then you could always generate 8k tokens and continue with the “Continue” button. But it doesn’t seem to exist anymore?

Gr · May 17, 2024, 1:23pm

This site has some information on how tokens work.

sphaerox · May 17, 2024, 5:31pm

Thank you very much for the link, unfortunately I had already gotten that far, but I still don’t understand why the output is limited if tokens can be seen as a whole. As already explained in my initial question, I have many questions about this.

Topic		Replies	Views
Inputs tokens limit, data extraction API gpt-4 , gpt-35-turbo , api , token , rate-limit	2	4685	February 3, 2024
Tokens limit gpt-3.5-turbo-0125 API token , gpt-0125	1	3674	February 15, 2024
Only allowed to set max_tokens to 4095 API	4	571	May 17, 2024
Gpt-4o total token limits? API token , gpt-4o	1	2150	September 9, 2024
How does ChatGPT have such massive token limit? API	12	32115	December 12, 2023

Questions about the Output Token limit

Related topics