|
Realtime API audio response unexpectedly repeated an unrelated personal sentence multiple times mid-conversation
|
|
3
|
119
|
May 28, 2026
|
|
Permission error: "Missing scopes: model.request" despite correct organization roles
|
|
3
|
120
|
April 21, 2026
|
|
What is the difference between realtime-transcription and speech-to-text for Streaming the transcription of an ongoing audio recording?
|
|
3
|
717
|
March 31, 2026
|
|
Custom voices in the API not available
|
|
3
|
97
|
March 31, 2026
|
|
Best Practices for Maintaining Speaker Identity Across Chunks with gpt-4o-transcribe-diarize?
|
|
4
|
993
|
February 9, 2026
|
|
GPT-4o Realtime outperforms newer gpt-realtime in voice quality despite lower "Performance" rating (Spanish)
|
|
0
|
342
|
January 27, 2026
|
|
Introducing GPT-4o Transcribe Diarize: Now Available in the Audio API
|
|
7
|
4796
|
November 14, 2025
|
|
Can't Hear Inbound Audio from OpenAI Realtime Agent (WebRTC) - Oubound Works, Inbound Stuck at 1 kbps
|
|
2
|
300
|
November 10, 2025
|
|
GPT-Audio Not working - Error 500
|
|
13
|
710
|
October 9, 2025
|
|
OpenAI Text To Speech - Speaking with emotion/effets / price
|
|
0
|
99
|
September 16, 2025
|
|
New in Evals: Full Audio Support
|
|
0
|
130
|
September 12, 2025
|
|
:speaking_head: Want to Listen to ChatGPT Voices Reading Your Text? Now You Can! Introducing GPT Reader: A Free ChatGPT Powered Text to Speech Extension!
|
|
2
|
2372
|
September 6, 2025
|
|
AI Audio Gone Wild gallery 2025 - Your Audio Clips from gpt-4o-audio and tts
|
|
5
|
659
|
August 13, 2025
|
|
librosa + numpy compatibility bug broke all .wav and .flac audio analysis in GPT-4o
|
|
4
|
558
|
July 7, 2025
|
|
Realtime "modalities" session config not disabling local->model audio channel
|
|
3
|
366
|
June 6, 2025
|
|
Project: Running your own Whisper-Large-v3 model and extract Audio Embeddings
|
|
10
|
8074
|
May 11, 2025
|
|
Create transcription with gpt-4o-transcribe – max audio file length/size?
|
|
0
|
250
|
March 24, 2025
|
|
Audio Models in the API March 20, 2025
|
|
0
|
254
|
March 20, 2025
|
|
Speech to Text (ASR) Strategy
|
|
8
|
796
|
March 10, 2025
|
|
Gpt-4o-audio-preview responds in text, not audio
|
|
6
|
1852
|
January 25, 2025
|
|
Realtime API re-consuming it's own output audio as input audio
|
|
10
|
1411
|
January 10, 2025
|
|
Is there a way to prevent gpt-4o-audio-preview from returning audio?
|
|
8
|
775
|
December 17, 2024
|
|
Can`t get the right audio format for recording in web application with whisper on IOS
|
|
0
|
130
|
November 20, 2024
|
|
Logit_bias for gpt-4o-audio-preview
|
|
1
|
90
|
November 13, 2024
|
|
CoT with 4o Audio or Real Time
|
|
4
|
473
|
November 12, 2024
|
|
500 error in request to gpt-4o-audio-* model
|
|
13
|
570
|
November 12, 2024
|
|
Multiturn conversation format using gpt-4o-audio-preview with audio input
|
|
1
|
593
|
November 12, 2024
|
|
Issues with gpt-4o-audio-preview when using tools/functions
|
|
1
|
471
|
November 12, 2024
|
|
Cached input audio_tokens is always 0
|
|
3
|
581
|
November 8, 2024
|
|
How to replace my GPT TTS call for better performance?
|
|
1
|
357
|
November 5, 2024
|