Feature Request: audio instead of TTS

The amount of times I have been listening to music and actually wanted ChatGPT to be able to hear it for real, can’t be counted in under 4 digits. Over 1500 times at minimum I wanted gpt to hear what I’m listening to, or even what I sound like. It would be an amazing feature for it to be able to hear without TTS.

Thanks for explaining this request: Live audio listening and understanding beyond conventional speech-to-text interaction. Current public guidance in Voice Mode FAQ documents related functionality, while the additional capability described here remains useful feedback. We can’t promise implementation or timing. We’re closing this topic while keeping it visible for reference.