Structure Outputs Not Working in evals

jtwalter · July 3, 2025, 7:07pm

I’m running the following code to run an already defined eval I’ve created with some input data I’ve saved:

run = client.evals.runs.create(
    eval_obj.id,
    name="r1",
    data_source={
        "type": "completions",
        "model": "gpt-4.1-nano",
        "input_messages": {
            "type": "template",
            "template": [
                {"role": "developer", "content": MAIN_PROMPT},
                {"role": "user", "content": "{{ item.filtered_pdf }}"},
            ],
        },
        "response_format": {
            "type": "json_schema", 
            "json_schema": REPORT_SCHEMA
        },
        "source": {"type": "file_id", "id": file.id},
    },
)

The eval runs fine; however, the output is not formatted as specified in the json_schema. To test that my schema was working as intended I ran the following chat completion creation request using the same response format:

response = client.chat.completions.create(
    model="gpt-4.1-nano",
    messages=[
        {"role": "developer", "content": MAIN_PROMPT},
        {"role": "user", "content": ...},
    ],
    response_format= {
            "type": "json_schema", 
            "json_schema": REPORT_SCHEMA
        },
    temperature=0.0,
)

Here the output content looks exactly as specified in the schema, leading me to believe this a bug specifically with evals.

Topic		Replies	Views
How to Validate JSON Schema Keys and Values in OpenAI Evals? API gpt-4	0	122	May 14, 2025
Issue with Structured Output in OpenAI API API structured-output	1	717	March 25, 2025
Assistant not returning results in JSON even with schema defined API api	1	401	September 8, 2024
Invalid JSON response when using Structured Output Bugs json , json-mode	4	527	February 16, 2025
GPT4.1 doesn't follow strict json schema API api , gpt41	6	159	July 3, 2025

Structure Outputs Not Working in evals

Related topics