Structured Outputs Deep-dive

_j · September 10, 2024, 8:01pm

It seems this article’s reference to CFG, context-free grammar, and the flowchart illustration that starts it, is highly specific yet highly speculative. OpenAI doesn’t talk about how their JSON mode works.

In fact, the JSON response can be seen to be backwards-looking, or token-run based. Generate and extend your json-mode response one token at a time, by increasing max_token values, and you’ll see previous logits replaced with new ones. This may be switch transformer or mixture-of-experts architecture, highly suspected, at work.

If it was clever, the algorithm wouldn’t consider 500 tab characters or newlines to be a valid production of this mode, which is why first JSON had to be mentioned or the request would be rejected, and then now an output described to the AI by schema, which was a good practice before this.

Also, Pydantic is a mere client implementation possibility - it doesn’t have anything to do with what goes over the wire.

Topic		Replies	Views
Structured Outputs not reliable with GPT-4o-mini and GPT-4o API structured-output	38	6652	January 23, 2025
Why aren’t we talking more about attribute-specific prompting in Structured Outputs? API json-mode , structured-output	10	594	August 19, 2024
Advanced structured Output -Use case: accident research API gpt-4 , gpt-35-turbo , api , classification , function-calling	10	2666	March 5, 2025
How to prevent ChatGPT from answering questions that are outside the scope of the provided context in the SYSTEM role message? API	53	176711	December 2, 2023
Structured Outputs & Functions - Schema-Writer Playground AI Preset to make them API playground	13	1211	November 1, 2024

Structured Outputs Deep-dive

Related topics