Streaming structured output tool calls with additional message content

sps · December 25, 2024, 10:14am

The primary objective of structured outputs is to adhere to the JSON schema specification. This is accomplished by constraining the sample space of tokens, based on the specific part of the schema that the tokens are being generated for.

In scenarios where you desire the model to also generate a plaintext message to the user in addition to the structured output, it would be beneficial to include an additional parameter, such as message, thinking, or reasoning, of type string. Then have the model generate for this parameter aligning with your expected output within the content attribute.

Here’s an example from the same blog for showing reasoning step for solving a mathematical problem.

Topic		Replies	Views
Function calling response format API api	12	1163	October 3, 2024
Is it possible to have tool_call and content in single completion message API gpt-4 , api	6	3332	May 15, 2024
Partially structured output? Free text output, but force correct tool call JSON API structured-output	9	684	October 8, 2024
How can I use function calling with response format (structured output feature) for final response? Feedback gpt-4 , assistants-api	10	2371	January 31, 2025
Mixing streaming chat completions with tool_calls? API	1	2044	December 11, 2023

Streaming structured output tool calls with additional message content

Related topics