How to get 100% valid JSON answers?

blozixdextr · December 11, 2023, 8:13pm

Hello everyone,

I’m currently working on a project where I need to ensure that I receive 100% valid JSON responses, especially since we’re utilizing the GPT-4 API. It’s critical for our application that the JSON output is consistently reliable and error-free.

Could anyone here advise on best practices or methods to guarantee completely valid JSON responses, particularly from the GPT-4 API? Are there specific tools or techniques I should be using to validate these responses effectively?

Thank you in advance for your help and insights!

Diet · December 11, 2023, 8:16pm

well, you can run a bunch of API calls in parallel, and select one output that passes your validator.

RonaldGRuckus · December 11, 2023, 8:18pm

Instead of aiming for perfection, aim for 99% and then use proper error handling with feedback loops.
Give few-shot examples of good JSON objects. Be careful with these examples though because they will effect the outcomes
Keep it shallow
There is a JSON mode but it can still fail and I wouldn’t recommend using it.

There’s a lot more advice but it depends on your use-case.

Foxalabs · December 11, 2023, 11:07pm

You might get 99.99999% performance with traditional deterministic code, but there are likely edge cases that will reduce that number down to 99.99% or lower.

With a statistically influenced model such as an LLM, there is no way to ensure that. There are no systems that offer single attempt perfection, you can approximate it with repetition and proper error checking and result testing.

TonyAIChamp · December 12, 2023, 1:46am

Why nobody mentioned "response_format": { "type": "json_object" } ?

Foxalabs · December 12, 2023, 1:48am

It’s still only mostly correct

(More words)

TonyAIChamp · December 12, 2023, 1:48am

But much more stable than without it (especially for 3.5) and with generation of say 5-10 variations you can make it almost bullet proof

Foxalabs · December 12, 2023, 1:52am

I am “100%” with you on that but in a production environment you need to wrap even the most certain of calls in a try/except and hopefully also an output data check.

mojave · December 12, 2023, 1:52am

With 3.5 and before we had response format in JSON, I would include in the instructions to always return JSON, give it an example return format and, most importantly, tell it: “do not include any explanation”.

That would give me bare JSON every time. 100%, but I wasn’t running 1000s of requests, so I suppose it wasn’t actually guaranteed.

Foxalabs · December 12, 2023, 1:55am

This is it exactly, I still have systems that I have not updated to use the json mode, they work extremely reliably, but once in 5-20k calls, one will fail with some random missing } or a misplaced , most of it can be caught with checks, but it’s a bad habit to get into when you omit error catching.

RonaldGRuckus · December 12, 2023, 2:00am

“I didn’t see any cars”

Says every person that gets hit by a car

jy2fzyf47q · December 12, 2023, 5:26am

I know few methods after recent trial:

llama.cpp:
Recent month has seen their practice in formatted stream output with some framework by re limitation or something.
postprocessiong:
With hard code or even with recalling llm api, such methods have shown in langchain framework
Certain prompt:
LLM is sensitive to prompt, so prompts may work in some scene but not in others

frankpepe · December 12, 2023, 7:28pm

I have the same situation where I am forcing it to return JSON object, with varying lengths. What seems to have worked for me is doing a bit of everything: JSON response format, mentioning it in the system as well as the prompt, and giving it a example of format. Important to not that giving a example of output would tweak my results so I’m only passing format like this:

---BEGIN FORMAT TEMPLATE---
{'${BEHAVIOR}': '${REASONING}',
'${BEHAVIOR}': '${REASONING}',
'${BEHAVIOR}': '${REASONING}'
}
---END FORMAT TEMPLATE---

cml · January 21, 2024, 8:12pm

I have never had function calling return an invalid response. I use it for all json responses even if they’re not functions.

muhammadali · May 27, 2024, 3:05pm

Over time, I have faced some common issues with the JSON responses from OpenAI. Though they are not very frequent, they can still occur. Often, the responses can be malformed with missing brackets, extra words like json, or using single quotes instead of double quotes.

I have written this small utility which tries to address these discrepancies with post-processing.

You can find the project on GitHub:

GPT JSON Sanitizer

Feel free to check it out, and let me know of more cases, as I know there will be more, and we can together extend it further.

Lalo · June 11, 2024, 7:59pm

I’m trying to get a response from the chat completion similar to this (list of JSON objects):

messages=[
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hello!"}
  ]

I have tried with this approach:

However, when using the response format in JSON I just get the first object of the list. In this case

{“role”: “system”, “content”: “You are a helpful assistant.”}

I obtain the best results when not using the ‘response format in JSON’ parameter, but then it works just 2/5 times. I have tried to recall the API using the wrong response as input, but it’s not working all times. I have also tried working with two models, one to generate data, the second to convert to JSON, this is also not working 100%.

What is the best approach to get your JSON response right from the first step?? (in my case a list of JSON objects )

Topic		Replies	Views
Valid json every time? Prompting	17	11470	January 3, 2024
Response has valid json but it's nested in broken json Bugs	16	3325	September 9, 2024
Prompt integrating JSON, or JSON request after the prompt API chatgpt , api , json	4	17071	May 29, 2023
Inconsistent and invalid JSON response API	8	5923	June 11, 2024
HTML / JSON / Markdown Output Generation is Very Clunky or out right broken API api , html , json	14	7334	December 1, 2023

How to get 100% valid JSON answers?

Related topics