I’m exploring GPT-4o and notice options for text and vision, but I don’t see any for voice. Will that be available soon? Additionally, will there be capabilities for audio input and output like the earlier demonstrations? These features are really exciting, and I appreciate how you addressed latency concerns!
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Access to GPT-4o's Auido Capability | 1 | 4012 | May 15, 2024 | |
| What will be the final/full released capabilities of GPT-4o in the API? | 0 | 2020 | May 27, 2024 | |
| Will audio output streaming be available with GPT-4o? | 1 | 1134 | June 20, 2024 | |
| Latest real time Audio capabilities to Api | 2 | 589 | August 3, 2024 | |
| When will audio to audio be released for gpt-4o please? | 8 | 4835 | July 2, 2024 |