Is there any way to use realtime audio API and we can set a bahvioural prompt and configure the output to JSON?

Aravind_Ks · June 20, 2025, 8:13am

Can we use any OpenAI real-time audio API to use real-time audio with more customised output, like in a JSON format, where we can set how the JSON should be? Also, can we able to set the behavioural prompt to the model also?
My idea is to use speech → model gets the audio in real-time → process the output into a JSON → then the app converts the JSON → actionable commands.

Any insights, guidelines or solutions will be helpful

Topic		Replies	Views
Realtime API Audio Modality output API realtime , api-realtime , api-realtime-speech	7	874	December 13, 2024
GPT4o Realtime Prompt Engineering API	1	322	January 9, 2025
Can GPT-4o directly analysis audio not depend on transcript? API	2	351	November 28, 2024
OpenAI Realtime API for Audio Input → Text Output Only API	2	263	June 20, 2025
Return a specific prompted JSON object via API API	1	1063	January 25, 2024

Is there any way to use realtime audio API and we can set a bahvioural prompt and configure the output to JSON?

Related topics