How to populate the output_tools variable in the run grader API?

brew · August 8, 2025, 4:04pm

I am using the run_grader API function to test my graders. Let’s look at what we can do according to the documentation:

curl -X POST https://api.openai.com/v1/fine_tuning/alpha/graders/run \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "grader": {
      "type": "score_model",
      "name": "Example score model grader",
      "input": [
        {
          "role": "user",
          "content": "Score how close the reference answer is to the model answer. Score 1.0 if they are the same and 0.0 if they are different. Return just a floating point score\n\nReference answer: {{item.reference_answer}}\n\nModel answer: {{sample.output_text}}"
        }
      ],
      "model": "gpt-4o-2024-08-06",
      "sampling_params": {
        "temperature": 1,
        "top_p": 1,
        "seed": 42
      }
    },
    "item": {
      "reference_answer": "fuzzy wuzzy was a bear"
    },
    "model_sample": "fuzzy wuzzy was a bear"
  }'

The documentation states:

model_sample

string

Required

The model sample to be evaluated. This value will be used to populate the sample namespace. See the guide for more details. The output_json variable will be populated if the model sample is a valid JSON string.

So, and I got that to work. When I supply a string, output_text is populated. When I supply a valid JSON as String, the output_json variable is populated as well as the output_text. So far, so good. But what (string(?)) do I have to supply to “model_sample“ so that output_tools will be populated? I naively tried to supply a string of an array of tool calls compliant to the Chat Completions API, but it didn’t work.

Topic		Replies	Views
Large JSON object to "submit_tool_outputs"? API	0	632	March 14, 2024
Submitting tool outputs back to assistant API gpt-4 , api , function-calling , tools	7	6263	December 27, 2023
How to use code interpreter tool with completions API? API	1	755	October 23, 2024
Are evals and graders deprecated/not adapted for Responses API? API evals	0	145	May 27, 2025
Sample code of model graders for reinforcement fine-tuning is not working Bugs api	0	73	July 14, 2025

How to populate the output_tools variable in the run grader API?

Related topics