Max output token explanation

classictablet333 · October 27, 2024, 3:01am

What is the point of Max output tokens?
If the context is 128K, why is the max token so small 4k?
how can the massive input token limit be useful if the output token is this small?
What strategies can we use to get far more output from a large input?(imagine a large corpus of data and the analysis is based on that large context and needs far more output beyond the limit.)

thank you

merefield · October 27, 2024, 5:44am

Most demands on LLMs are asymmetric in nature.

Long input context is incredibly important.

Context. the longer the input context the more potentially relevant tokens arrive at the other end.
Tools: It is especially useful when using functions (tools) for both feeding the LLM with function definitions but crucially also providing answers, eg fron RAG.
Summarisation: it is critical in applications where you wish to summarise.

The list goes on …

If you need a longer output just call the LLM again …

Topic		Replies	Views
Question about output tokens in Agents SDK API	0	232	March 17, 2025
Questions about the Output Token limit API chatgpt	2	526	May 17, 2024
How to print the output over 10,000 tokens? API gpt-4o-mini	5	1070	December 26, 2025
What exactly is "MAX TOKENS" in gpt-3.5-turbo model? API	2	17166	July 11, 2023
Error about token length, max token Feedback	1	108	March 25, 2025

Max output token explanation

Related topics