Is is possible to Stream Structured Output with Pydantic?

agnes.pap · January 9, 2025, 7:48am

I’d like to stream structured output responses, but I noticed that client.beta.chat.completions.parse doesn’t seem to support streaming.

Has anyone successfully implemented streaming structured outputs, or does this require a workaround without using a pydantic class for response_format? Would love to hear if there are best practices or alternative approaches

_j · January 9, 2025, 8:09am

It should. You can up your API game with the streaming helpers, that also have response object collectors.

github.com

openai/openai-python/blob/main/helpers.md

# Structured Outputs Parsing Helpers

The OpenAI API supports extracting JSON from the model with the `response_format` request param, for more details on the API, see [this guide](https://platform.openai.com/docs/guides/structured-outputs).

The SDK provides a `client.beta.chat.completions.parse()` method which is a wrapper over the `client.chat.completions.create()` that
provides richer integrations with Python specific types & returns a `ParsedChatCompletion` object, which is a subclass of the standard `ChatCompletion` class.

## Auto-parsing response content with Pydantic models

You can pass a pydantic model to the `.parse()` method and the SDK will automatically convert the model
into a JSON schema, send it to the API and parse the response content back into the given model.

```py
from typing import List
from pydantic import BaseModel
from openai import OpenAI

class Step(BaseModel):
    explanation: str
    output: str

This file has been truncated. show original

Example, switching over to my “python examples” directory.

with client.beta.chat.completions.stream(
    model="gpt-4o",
    messages=[{"role": "system", "content": "You are a helpful AI assistant"},
        {"role": "user", "content": request_content}],
    stream_options={"include_usage": True},
    max_completion_tokens=2000,  # openai.LengthFinishReasonError if JSON unparsable
    #logprobs=True,
    #top_logprobs=1,
    response_format=SimpleResponse,
    #tools=NotGiven  # cannot be empty list
) as stream:
    for event in stream:
        process_event(event)

You get to write your own process_event handler; a hint, though:

def process_event(event):
    '''Handler for ChatCompletionStreamEvent = Union[
    ChunkEvent,
    ContentDeltaEvent,
    ContentDoneEvent[ResponseFormatT],
    RefusalDeltaEvent,
    RefusalDoneEvent,
    FunctionToolCallArgumentsDeltaEvent,
    FunctionToolCallArgumentsDoneEvent,
    LogprobsContentDeltaEvent,
    LogprobsContentDoneEvent,
    LogprobsRefusalDeltaEvent,
    LogprobsRefusalDoneEvent,
]
'''

Or get really dirty into the API SDK and wait for things to break:

from openai._types import NOT_GIVEN, IncEx, NotGiven, Union, Any
from openai import BaseModel
from openai._streaming import json
from openai.lib import pydantic_function_tool
from openai.lib.streaming.chat import ChatCompletionStreamManager
from openai import ContentFilterFinishReasonError, APIResponseValidationError
from openai import Client
client = Client()

with ChatCompletionStreamManager(
    api_request=lambda: client.chat.completions.create(...

expertise.ai.chat · January 9, 2025, 2:51pm

Take a look at this post.

Topic		Replies	Views
Streaming structured outputs field by field? API streaming , structured-output , agents-sdk	0	150	May 5, 2025
How to Resume Streaming in Python After Submitting Function Call Outputs in OpenAI Assistants API? API	1	161	February 14, 2025
Streaming using Structured Outputs API streaming , structured-output	21	10883	April 22, 2025
Streaming Structured Output using Assistants API API assistants-api , streaming , structured-output	1	211	February 19, 2025
[Typescript] Require high reliability structured output & streaming - possible? API api	1	333	November 28, 2024

Is is possible to Stream Structured Output with Pydantic?

Related topics