How to eval and test realtime voice agents?

Tom_Scrace · July 21, 2025, 9:17am

Has anybody been able to build evals for the realtime models? I have an agent built with Agents SDK and 4o-realtime, but I’m not really sure how to go about creating evals for voice-to-voice models. Automated testing is also a challenge!

Thanks!
Tom

Topic		Replies	Views
Benchmark & Evaluation Frameworks for Assistants API gpt-4 , chatgpt , api , assistants-api	0	546	April 25, 2024
Evaluation Tools for Assistants API gpt-4 , gpt-35-turbo , chatgpt , api , assistants-api	1	988	April 12, 2024
Voice Agent using Realtime API Community api , assistants-api	0	440	April 2, 2025
Chained approach vs gpt-4o-audio-preview API voice , gpt-4o-audio-preview	2	298	April 30, 2025
Any API or another way to build voice agents on the top of ChatGPT app? Community chatgpt	0	151	November 22, 2024

How to eval and test realtime voice agents?

Related topics