I want to use the OpenAI Realtime API to process audio input and receive only text output in response. I do not need any audio response—just the text output.
Has anyone implemented this? I’m looking for:
- How to send live audio input to the API.
- How to configure the API to return only text (disable audio response).
Would appreciate any insights or sample code!