OpenAI Response API - Limit number of outputs

_j · April 18, 2025, 5:03pm

There is no specific control that covers what you likely experience.

However, what you experience is hard to understand, with the term “number of outputs” not referring to anything clear-cut about API model behavior.

The only think I can think you might be talking about is previously-seen issues where a structured output JSON is not immediately followed by a token to end the response creation, but instead, the model continues writing a second JSON object sometimes.

Otherwise, it could just be prompting technique vs a model not following along.

You can start with reducing top_p from its default of 1.00 to 0.10 and see if more “best choices” in token production get you more of the expected response style.

Topic		Replies	Views
GPT-4o giving only one suggestion despite asking for multiple; GPT-4o-mini hallucinating when prompt gets detailed Prompting gpt-4 , hallucinations , gpt-4o-mini	2	172	October 24, 2025
Openai web search token limit issue Bugs	4	507	March 25, 2025
The response API generated many answers for one request API gpt-4 , chatgpt	0	112	April 27, 2025
Multiple Outputs from Responses API API responses-api	1	430	June 17, 2025
How can I make sure I always get a consistent output from OpenAI? API assistants-api	2	444	November 18, 2024

OpenAI Response API - Limit number of outputs

Related topics