Has anybody been able to build evals for the realtime models? I have an agent built with Agents SDK and 4o-realtime, but I’m not really sure how to go about creating evals for voice-to-voice models. Automated testing is also a challenge!
Thanks!
Tom