Invalid JSON response when using Structured Output

CycleMost · February 15, 2025, 7:47pm

I’m using the assistant API to run a thread and get a response back in JSON format, using a specified schema (json_schema format). In some cases (maybe 1% of the time, and not reproducible except randomly), the response contains invalid JSON. For example, here is a simplified version of the response that contains invalid json:

{
  "summary": "blah blah",
  "details": [
  ]
},
"notes": "blah blah"
}

There is an extra } which is causing the issue. The response should have been:

{
  "summary": "blah blah",
  "details": [
  ],
  "notes": "blah blah"
}

… and I am using strict: true in my schema definitions.

Has anyone else experienced something similar? I assumed that when using the json_schema output type, the output would always be valid JSON - not just 99% of the time.

dal1e · February 15, 2025, 8:39pm

Yes 8 from the 10871 calls made, using the https://api.openai.com/v1/chat/completions API endpoint. To be a bit more specific, the invalid JSON was the stringified JSON data within the choices[0].message.content property

The workaround for me is to check if the stringified JSON data returned is actually parsable, if not do a retry.

"choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "{ \"date\" : [ <RANDOM CUTOFF> }"

CycleMost · February 16, 2025, 12:29am

Thank you. I was going to implement a similar retry strategy. Also, looking at the documentation for structured outputs, it says this:

Structured Outputs can still contain mistakes. If you see mistakes, try adjusting your instructions, [etc…]

sps · February 16, 2025, 6:40am

@CycleMost @dal1e

Are you prompting the model in the system message to respond with a valid JSON object?

dal1e · February 16, 2025, 6:23pm

@sps Currently not, but it is a trivial to add. But as I mentioned, only 8 of the 10871 calls, that is statistically reasonable, one could argue that it aligns with the expected rare event probability. Currently my system prompt - among other instructions only - instructs to stick to the schema provided.

[...]
- Strictly adhere to the described <XYZ> schema in the response.
[...]

Topic		Replies	Views
Assistant Response is sometimes an invalid JSON Array API assistants-api	6	550	May 8, 2024
Response has valid json but it's nested in broken json Bugs	16	3530	September 9, 2024
Invalid json in structured mode Bugs	6	89	February 14, 2025
Response_format=json_object returns invalid json with finish_reason=stop Bugs json-mode	7	174	January 7, 2025
{ "type": "json_object" } not always working Prompting gpt-4	5	305	January 2, 2025

Invalid JSON response when using Structured Output

Related topics