Agent using WebSearchTool with structured outputs results in validation error with JSON unexpectedly ending with EOF around 6000 characters

alish.sult · March 15, 2025, 10:47am

I’ve tested both gpt-4o and gpt-4o-mini, when giving the model WebSearchTool and structured outputs using output_type the JSON prematurely ends in EOF at around 6000 characters resulting in a validation error.

for TypeAdapter(GraphConstruction); 1 validation error for GraphConstruction
ValidationError: 1 validation error for GraphConstruction
  Invalid JSON: EOF while parsing an object at line 1 column 6752 [type=json_invalid, input_value='{"entities":[{"entityId"...}],"links":[{"from_":1 ', input_type=str]

In fact almost the same issue happens in consumer facing ChatGPT when turning on search and instructing the model to find information and return in a JSON schema. Around the output length of 6000 characters in ChatGPT instead of EOF I consistently see the json interrupted by:
::contentReference[oaicite:0]{index=0}

Temprorary Solution: Limiting Max Tokens in ModelSettings to 128000 mysteriously gets rid of the JSON integrity problems and removes the validation error from JSON terminating mid generation. I consistently got a validation error without the Max Tokens and if I set Max Tokens to any arbitrary number error never occured again. Asking the model not to go beyond 5500 characers also works.

Hope this gets patched soon.

sps · March 18, 2025, 11:48am

Welcome to the dev forum, @alish.sult!

I’ve reproduced the issue during my testing. Thanks for reporting it.

jo.rabin · March 28, 2025, 10:46am

To confirm that I am intermittently seeing a similar error while using responses api, WebSearchTool and JSON output, model is gpt-4o-mini:

*1 validation error for SolutionAnswerFormat Invalid JSON: EOF while parsing a string at line 1 column 7233 [type=json_invalid, input_value='{“solutions”:[{“Solution…ution Title”:"Angel Co ', input_type=str] For further information visit …

qhenkart · April 21, 2025, 4:51am

Happy to see this post, This is affecting all of our production prompts making web search a non starter

Edit:
Wow, after setting the max tokens to 128,000 I finally get an error! Context Length Exceeded with 128,867 tokens!!! My input tokens are 867 and output tokens are about 1,500. So the 120k is probably from the web search. Previously I had the max tokens set to 16k, I also tried removing it (which I thought defaults to max), both would fail JSON validation but with a successful API response.

This provides a huge amount of insight into whats failing under the hood. OpenAI is filling up the context and breaking its own request in the process, and since its an internal process, it doesn’t even register the error

kirran.raj · May 2, 2025, 11:27am

I faced a similar issue , adding a prompt line as below and giving example of output format in prompt solved my issue. Even though earlier I had used structured output and had pydantic validation.
Output Requirements:

Ensure the JSON is valid and complete (i.e., properly closed braces/brackets, no trailing commas).

qhenkart · May 2, 2025, 1:12pm

I wish it were that simple. Luckily, OpenAI seems to have patched this bug. Json schemas now work with or without the extra instruction

Topic		Replies	Views
Web Search Completion Cuts Off Response and ignores structured outputs on complex prompts API api , structured-output	8	1096	December 26, 2025
Web search on Responses API breaks inline citations when passed a pydantic data model as text_format Bugs structured-output , web-search , responses-api	3	304	December 28, 2025
Fine-tune 4o model - endless inference for JSON Bugs	7	265	December 26, 2025
Openai web search token limit issue Bugs	4	578	March 25, 2025
Response_format=json_object returns invalid json with finish_reason=stop Bugs json-mode	7	948	January 7, 2025

Agent using WebSearchTool with structured outputs results in validation error with JSON unexpectedly ending with EOF around 6000 characters

Related topics