I’m exploring GPT-4o and notice options for text and vision, but I don’t see any for voice. Will that be available soon? Additionally, will there be capabilities for audio input and output like the earlier demonstrations? These features are really exciting, and I appreciate how you addressed latency concerns!
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Access to GPT-4o's Auido Capability | 1 | 3891 | May 15, 2024 | |
When is audio API for GPT-4o will be available to business | 2 | 1204 | June 18, 2024 | |
When will audio to audio be released for gpt-4o please? | 8 | 4582 | July 2, 2024 | |
What will be the final/full released capabilities of GPT-4o in the API? | 0 | 1899 | May 27, 2024 | |
Will audio output streaming be available with GPT-4o? | 1 | 817 | June 20, 2024 |