New audio models in the API + tools for voice agents

Realtime vision coming soon I hope!

2 Likes