I need to use the API to generate lengthy (1000-2000 word) personalized reports. For the longer reports I’m often left with the AI saying “repeat for other categories” instead of actually completing the report. My prompt needs to be quite long (500-1000 words) to give context + instruction. My hypo…

4096 response limit vs 128 000 context window

jr.2509 February 27, 2024, 11:20am 3

Hi Alwyn and welcome!

It’s very common that the output tokens remain significantly below the 4096 output token limit. ~800/900 words tend to be on the upper end of what the model returns in one API call. None of your proposed actions will change that.

@_j recently made a few good posts about the issue but I struggle to find them right now.

Edit: Here is one of the posts that speaks to that:

1 Like

Topic		Replies	Views
Question about token limit differences in API vs Chat Documentation chatgpt , api	5	4453	May 26, 2023
Long Prompt with Large Text Data Prompting gpt-35-turbo , chatgpt , api	3	11497	July 14, 2023
Chained Prompt to complete text larger than 4000 tokens? API	14	5777	December 25, 2023
Need clarification on the context window length API gpt-4	4	2959	July 9, 2023
How to build a Question and Answer Bot for context greater than 2048 tokens? Prompting	3	1679	December 17, 2023

4096 response limit vs 128 000 context window

Related topics