Can gpt-realtime produce transcription of both input and output, so I can display the user query and model response in text in real time in addition to audio?
I suppose I can feed audio through transcription model, but I was wondering if this can be done with one model.
Thanks!