GPT4-o inconsistently respects field descriptions with Structured Outputs

medihack · December 21, 2024, 9:49am

It seems that GPT-4o follows the instructions defined in the description of a field very inconsistently (or, better to say, rarely). In contrast, if I put the description of a field directly in the system prompt, it is much better respected. (I use the OpenAI Python client with Pydantic models and the response_format option.)

For example, this works much worse:

system_prompt = "Extract information from the below medical report in JSON format:"

class Summary(BaseModel):
    ...
    score: int = Field(description="The NIH Stroke Scale/Score (NIHSS). Add 5 to the reported score.")
    ...

And this works much better:

system_prompt = """
Extract information from the below medical report in JSON format using these properties:
...
- score: The NIH Stroke Scale/Score (NIHSS). Add 5 to the reported score.
...
"""

class Summary(BaseModel):
    ...
    score: int
    ...

(Adding 5 only checks if it follows the instructions and has no medical background.)

Has anyone else had this experience, too?

platypus · December 21, 2024, 10:58am

Hi @medihack and welcome to the community!

Yes this has been my experience as well. There have been other similar discussions here and the consensus is that leaving field descriptions empty, and placing all the definitions and instructions in the system prompt, yields the best and most consistent results. So the Pydantic model is used purely to define the output schema.

Topic		Replies	Views
How to regulate content of json fields in Structured Outputs? API	0	232	August 7, 2024
Increased False Negatives in Attribute Extraction Using Structured Outputs API library-python , structured-output	3	124	February 13, 2025
How to describe fields when using structured output API structured-output	1	486	December 16, 2024
Incomplete Results Using Pydantic and Function Calling API gpt-4 , gpt-35-turbo , api	2	2215	July 24, 2024
GPT-4o doesn't consistently respect JSON schema on tool use Bugs gpt-4o	10	9034	September 16, 2024

GPT4-o inconsistently respects field descriptions with Structured Outputs

Related topics