How do structured outputs via functions or response_format actually get processed in the prompt?

mluca · February 26, 2025, 2:40pm

Have seen variations of this question asked but don’t believe I’ve seen a direct answer. I’m curious when an output schema is passed in using either the functions parameter or response_format parameter, does anyone know how that gets passed in during inference? I.e., does the model process system + user prompt and then a string of the output schema with some under the hood instructions on adhering to it? Is there some deep internal serialization of the schema?

Any help would be appreciated

Topic		Replies	Views
Does the LLM have access to my Pydantic model when using Structured Output? API structured-output	2	976	October 26, 2024
Json output different from normal output API json	1	277	February 19, 2025
Prompts when using structured output Prompting structured-output	4	224	June 26, 2025
What is the exact system prompt that gets inserted when calling a tool? API gpt-4 , chatgpt , api , assistants-api	6	1954	January 1, 2025
What is the final message (string input) generated from chat.completions API chat-completion	8	1914	March 11, 2024

How do structured outputs via functions or response_format actually get processed in the prompt?

Related topics