Are evals and graders deprecated/not adapted for Responses API?

JosePer · May 27, 2025, 8:16pm

I read the docs about evals and graders and at least IMHO it looks like it is deprecated or not accurate for Responses API.

I found the idea of evals very useful for my use case but I have this problems/questions and I couldn’t find answers after two days experimenting an searching:

Is there a way to pass a vector_store_id to a Responses API eval run and then check (with a grader) if the tool file_search was called?
The doc says that you can use {{sample.output_json}} or {{sample.output_tools}} for accessing the JSON response or the tools used by the model, but it’s impossible due to UI limitations when creating the Evaluations using the Dashboard. You can only use {{ sample.output_text }}
Is there a way to debug what is the content of the {{ sample }} item? Because it would be really helpful.

Any information on this guys will be very helpful, I have been testing for two days and I feel I’m blocked.

Thanks!

Topic		Replies	Views
Evaluations and Chat completions : Need support for tool use and image API chat-completion , evals	0	158	November 13, 2024
Approach for using Evals for Assistants? API assistants , assistants-api , evals	2	231	March 8, 2025
How to Use New Evals UI in Dashboard API evals	6	263	May 12, 2025
Creating Evals but no {{item.output}} API	1	73	May 25, 2025
Evaluations, Stored Completions, and sample.output_text? API api	0	126	November 12, 2024

Are evals and graders deprecated/not adapted for Responses API?

Related topics