Is There an API for ChatGPT’s Video Chat (Advanced Voice Mode)?

abezaidman · February 6, 2025, 3:39pm

Hi everyone,

I’m wondering if OpenAI has released or plans to release an API that supports the video chat capabilities found in ChatGPT’s Advanced Voice Mode. I know the Realtime API handles real-time audio and text, but is there any roadmap or current support for video input/output via API?

Any insights or updates would be appreciated. Thanks!

_j · February 6, 2025, 4:10pm

“Video chat” is actually by providing a low sample rate stream of images.

The realtime endpoint doesn’t support images:

You can implement it with Chat Completions, however:

The audio latency is reduced by not needing a separate transcription of audio input, nor TTS for output.

However, image processing requires more time before output generation begins. I would use detail:low on images from video at maximum 512px.

You also have to build your own voice activity detection if not simply making a record/send button.

Further insights: not a peep.

alexsam986 · February 9, 2026, 12:40pm

Hi everyone,

I’m wondering if OpenAI has released or plans to release an API that supports the video chat capabilities found in ChatGPT’s Advanced Voice Mode. I know the Realtime API handles real-time audio and text, but is there any roadmap or current support for video input/output via API?

Any insights or updates would be appreciated.

Thanks!

Topic		Replies	Views
Video conversation gpt-4 conversion API API gpt-4	1	499	August 1, 2024
Realtime API Video Input like Advanced Voice Mode API realtime	4	1003	August 21, 2025
Latest real time Audio capabilities to Api API audio , gpt-4o	2	576	August 3, 2024
When will API support image/audio as input and output? API gpt-4 , chatgpt , api	1	1753	October 9, 2023
Voice to voice via API possible? API gpt-4 , api	1	595	May 27, 2024

Is There an API for ChatGPT’s Video Chat (Advanced Voice Mode)?

Related topics