Structured Outputs with Assistants

devnotdev · August 9, 2024, 10:31am

Has anyone got the new Structured Responses working with Assistants?

I keep getting ‘Object of type ModelMetaclass is not JSON serializable’

eg.

from pydantic import BaseModel

class Insights(BaseModel):
        segments: List[str]
        garm: List[str]
        reasoning: str

run = await openai_client.beta.threads.runs.create(
        model="gpt-4o-2024-08-06",
        thread_id=thread_id,
        assistant_id='',
        temperature=0,
        response_format=Insights
    )

Am I doing something wrong? I am using the Async client too.

vismantas · August 9, 2024, 12:43pm

Following the example code on the documentation page I also could not get the code to run - was getting the same exception.

Try writing it like the code below to fix it:

         response_format={
           'type': 'json_schema',
           'json_schema': 
              {
                "name":"whocares", 
                "schema": Insights.model_json_schema()
              }
         }

devnotdev · August 9, 2024, 12:57pm

Thanks, they really need to add this to the Documentation.

TheZeke · August 9, 2024, 7:25pm

Firstly, thank you for this. It really does need to be added to the documentation. The docs are not clear at all.

It should be noted that all tools must be of type function when response_format is of type json_schema.

This is/was in the documentation in some form but it is not made clear. The documentation at https://platform.openai.com/docs/guides/structured-outputs/introduction makes it sound like you can use it When using function calling or When using a json_schema response format but that does not apply to assistants.

I tried implementing json_schema with tools=[{"type": "file_search"}] and it is still not a thing. I’ve needed this feature so it’s more like chat completions for a while but right now; I’m having to convince the assistant API to respond with JSON through some kludgery instead, and then dealing with broken JSON output through exception handling.

Ugghhh.

onbekend · August 10, 2024, 7:02am

i spent so much time trying to troubleshoot this. thank you for creating this topic.

vismantas · August 10, 2024, 2:29pm

I’m dealing with the same problem - not being able to use the tools. One workaround is to do two consecutive runs for a given prompt - first one to solve the problem using whatever the tools you need to use. The second one with the structured output and the prompt “please output the results as json” - the results from the first run will be in the immediate message history so should work ok.

RvB1701 · August 13, 2024, 1:07pm

I am still having issues with setting the assistant with the structured outputs.
First, it’s unclear where we should set it(nothing was mentioned in the documentation). in the creation of the assistant/messages/threads?
secondly, I tried out the suggestion above as follows:

    class Step(BaseModel):
        explanation: str
        output: str

    
    class ReportReasoning(BaseModel):
        steps: list[Step]
        final_answer: str

        new_run = client.beta.threads.runs.create(
            thread_id=new_thread.id,
            assistant_id=assistant.id,
            response_format={
                'type': 'json_schema',
                'json_schema':
                    {
                        "name": "whocares",
                        "schema": ReportReasoning.model_json_schema()
                    }},
        )

THIS causes the next error -

openai.BadRequestError: Error code: 400 - {'error': {'message': 'Invalid tools: all tools must be of type `function` when `response_format` is of type `json_schema`.', 'type': 'invalid_request_error', 'param': 'response_format', 'code': None}}

vismantas · August 14, 2024, 10:08am

The answer is in the error message. You can use any other tools than your own functions. So no code interpreter, no file search. you can set tools = in the call to create the run - this will override whatever tools you have set up on the agent. This way you can at least check, that the structured output part is working. To use tools split your workflow into two parts - create a run where you try to solve the task given by the prompt with all the tools you need. In the second run disable all the tools but set the structured output schema and in the prompts ask the agent to “please output the results as the given json” or something of the like.

mg_dev · October 3, 2024, 3:00am

I’m still struggling to find a solution when using the Assistants API.

Similar to the issue raised by RvB1701 , I’m unsure where we’re supposed to specify the response type. I’m a bit confused, and I’m starting to wonder if I need to specify the response type when retrieving the messages after the run has completed.

@vismantas I don’t think we’re supposed to pass a Pydantic model directly to response_format. While you could supply the JSON schema, I’ve read that if you specify the Pydantic model, it’s supposed to return it as a Pydantic model.

@devnotdev Did you get your solution working?

I really hope to get some clarity on this because it’s crucial for what I’m working on.

vismantas · October 3, 2024, 1:49pm

AFAIK Assistants API and Completions API behave slightly differently. Specifically Completions API does accept pydantic model directly, Assistants API, at least at the time of originally answering this questions were not. I’ve not checked the API for changes since, so this might be outdated.

mg_dev · October 3, 2024, 5:26pm

@RvB1701

I was able to get my code to run using .model_json_schema to define the schema. Like @vismantas mentioned, it doesn’t look like the Assistant API fully supports pydantic models yet. You can still define the model with pydantic but then use .model_json_schema() to convert the pydantic model into a json schema.

The only difference I can think of might have to do with which tools you are using with your assistant. I am not using any tools right now as I was just trying to get the Structured Output to work.

I used a Jupyter notebook, and setup the client, thread, and message using the documentation. I then updated my run as follows:

from pydantic import BaseModel
from typing import List

class Message(BaseModel):
    message: str

class Messages(BaseModel):
    messages: List[Message]

run = client.beta.threads.runs.create(
    thread_id=thread.id,
    assistant_id=assistant.id,
    response_format={
        "type": "json_schema",
        "json_schema": {
            "name": "test_schema",
            "schema": Messages.model_json_schema()
        }
    }
)

jatzypp · October 15, 2024, 10:33am

Quite sloppy that they don’t mention that a pydantic model is not directly accepted as a response format in the assistants API at all in their documentation…

itzamirrezab · October 20, 2024, 3:09pm

I can’t make it work even with your solution:

class FieldRule(BaseModel):
    selector_type: str
    selectors: List[str]
    attribute: Optional[str]

class Rules(BaseModel):
    title: FieldRule
    description: Optional[FieldRule]
    link: FieldRule

class RuleSet(BaseModel):
    name: str
    rules: Rules

class ExtractionRules(BaseModel):
    rules_sets: List[RuleSet]
    has_news: bool

def extract_xpaths(html_content):
    try:
        completion = openai_client.beta.chat.completions.parse(
            model="gpt-4o-2024-08-06",
            messages=[
                {"role": "system", "content": system_instruction},
                {"role": "user", "content": html_content}
            ],
            response_format={
           'type': 'json_schema',
           'json_schema': 
              {
                "name":"ExtractionRules", 
                "schema": ExtractionRules.model_json_schema()
              }
         } 
        )
        if completion and completion.choices and len(completion.choices) > 0:
            return completion.choices[0].message.parsed
        else:
            logger.error("Invalid completion response from OpenAI")
            return None
    except Exception as e:
        logger.error(f"Error in extract_xpaths: {str(e)}")
        return None

Error:
Object of type Tag is not JSON serializable

Would appreciate any help.

maxmeyer3141 · October 28, 2024, 9:59pm

after searching in the github repo of openai i found out that the pydantic models are not allowed to have default values and have other problems with pydantic model schema generations… my guess is that you use Optional[FieldRule] in your model.
here is the source

github.com/openai/openai-python

Apply more fixes for Pydantic schema incompatibilities with OpenAI structured outputs

opened 05:21PM - 17 Aug 24 UTC

mcantrell

### Confirm this is a feature request for the Python library and not the underly…ing OpenAI API. - [X] This is a feature request for the Python library ### Describe the feature or improvement you're requesting I noticed that you guys are doing some manipulation of Pydantic's generated schema to ensure compatibility with the API's schema validation. I found a few more instances that can be addressed: Issues: * optional fields with pydantic defaults generate an unsupported 'default' field in the schema * date fields generate a format='date-time' field in the schema which is not supported The test cases below builds on your `to_strict_json_schema` function and removes addresses these problematic fields with the `remove_property_from_schema` function: ```python class Publisher(BaseModel): name: str = Field(description="The name publisher") url: Optional[str] = Field(None, description="The URL of the publisher's website") class Config: json_schema_extra = { "additionalProperties": False } class Article(BaseModel): title: str = Field(description="The title of the news article") published: Optional[datetime] = Field(None, description="The date the article was published. Use ISO 8601 to format this value.") publisher: Optional[Publisher] = Field(None, description="The publisher of the article") class Config: json_schema_extra = { "additionalProperties": False } class NewsArticles(BaseModel): query: str = Field(description="The query used to search for news articles") articles: List[Article] = Field(description="The list of news articles returned by the query") class Config: json_schema_extra = { "additionalProperties": False } def test_schema_compatible(): client = OpenAI() # build on the internals that the openai client uses to clean up the pydantic schema for the openai API schema = to_strict_json_schema(NewsArticles) # optional fields with pydantic defaults generate an unsupported 'default' field in the schema remove_property_from_schema(schema, "default") # date fields generate a format='date-time' field in the schema which is not supported remove_property_from_schema(schema, "format") logger.info("Generated Schema: %s", json.dumps(schema, indent=2)) completion = client.beta.chat.completions.parse( model="gpt-4o-2024-08-06", temperature=0, messages=[ { "role": "user", "content": "What where the top headlines in the US for January 6th, 2021?", } ], response_format={ "type": "json_schema", "json_schema": { "schema": schema, "name": "NewsArticles", "strict": True, } } ) result = NewsArticles.model_validate_json(completion.choices[0].message.content) assert result is not None def remove_property_from_schema(schema: dict, property_name: str): if 'properties' in schema: for field_name, field in schema['properties'].items(): if 'properties' in field: remove_property_from_schema(field, property_name) if 'anyOf' in field: for any_of in field['anyOf']: any_of.pop(property_name, None) field.pop(property_name, None) if '$defs' in schema: for definition_name, definition in schema['$defs'].items(): remove_property_from_schema(definition, property_name) ``` ### Additional context _No response_

jcrisch · November 7, 2024, 9:53am

Like @TheZeke mentioned earlier, it’s still impossible to combine file_search with JSON response functions. As a workaround until an official update is released, I created a function called ‘extract_my_data’ which allows me to access my data in a JSON schema format when triggered.

And I just ignore the output of the run.

jatzypp · November 12, 2024, 9:29am

What model deployment are you using and what version? I’m getting

'error': {'message': "Invalid parameter: 'response_format' of type 'json_schema' is not supported with model version `gpt-4o-08-06`.", 'type': 'invalid_request_error', 'param': 'response_format', 'code': None}}

with 2024-08-06 using the exact codes as you are.

xavieraguayo27 · December 12, 2024, 8:48pm

"response_format": {
    "type": "json_object"
},

“tool_resources”: {
“code_interpreter”: null,
“file_search”: {
“vector_store_ids”: [
“vs_QkrKgLCYpASSDF21333Z2apeo”
]
}
},
Segun tengo entendido ofrecen 2 maneras de especificar el formato de respuesta: un json_object o un json_schema que pertenece a base model creo.
Si mal no recuerdo esto tambien se puede agregar con el file_search pero agregandolo desde la pagina oficial, me funciono correctamente las 3 herramientas: function_calling , response_format y file_search.

rohandahal · March 6, 2025, 3:21am

Thanks man, that worked. I was able to get json response using 4o model.

Topic		Replies	Views
How can I use structured_output with Azure OpenAI? API azure	10	4472	March 31, 2025
Assistants API - Why is JSON mode not available when using file search / code interpreter? API assistants-api	14	4732	October 15, 2024
Structured Outputs don't currently work with file_search tool in assistants api Feedback api , assistants-api , assistants-files	22	3468	December 18, 2024
Structured Outputs not working with Assistants API python , assistants-api	1	467	October 21, 2024
Handling Structured Output in Function Tool Calls with `file_search` Community function-calling , structured-output	6	321	February 18, 2025

Structured Outputs with Assistants

Related topics