|
Best Practices for Maintaining Speaker Identity Across Chunks with gpt-4o-transcribe-diarize?
|
|
4
|
721
|
February 9, 2026
|
|
GPT-4o Realtime outperforms newer gpt-realtime in voice quality despite lower "Performance" rating (Spanish)
|
|
0
|
169
|
January 27, 2026
|
|
Introducing GPT-4o Transcribe Diarize: Now Available in the Audio API
|
|
7
|
4203
|
November 14, 2025
|
|
Can't Hear Inbound Audio from OpenAI Realtime Agent (WebRTC) - Oubound Works, Inbound Stuck at 1 kbps
|
|
2
|
215
|
November 10, 2025
|
|
GPT-Audio Not working - Error 500
|
|
14
|
615
|
October 15, 2025
|
|
OpenAI Text To Speech - Speaking with emotion/effets / price
|
|
0
|
75
|
September 16, 2025
|
|
New in Evals: Full Audio Support
|
|
0
|
97
|
September 12, 2025
|
|
:speaking_head: Want to Listen to ChatGPT Voices Reading Your Text? Now You Can! Introducing GPT Reader: A Free ChatGPT Powered Text to Speech Extension!
|
|
2
|
2140
|
September 6, 2025
|
|
AI Audio Gone Wild gallery 2025 - Your Audio Clips from gpt-4o-audio and tts
|
|
5
|
528
|
August 13, 2025
|
|
librosa + numpy compatibility bug broke all .wav and .flac audio analysis in GPT-4o
|
|
4
|
486
|
July 7, 2025
|
|
Realtime "modalities" session config not disabling local->model audio channel
|
|
3
|
313
|
June 6, 2025
|
|
Project: Running your own Whisper-Large-v3 model and extract Audio Embeddings
|
|
10
|
6495
|
May 11, 2025
|
|
What is the difference between realtime-transcription and speech-to-text for Streaming the transcription of an ongoing audio recording?
|
|
2
|
596
|
April 1, 2025
|
|
Create transcription with gpt-4o-transcribe – max audio file length/size?
|
|
0
|
230
|
March 24, 2025
|
|
Audio Models in the API March 20, 2025
|
|
0
|
231
|
March 20, 2025
|
|
Speech to Text (ASR) Strategy
|
|
8
|
695
|
March 10, 2025
|
|
Gpt-4o-audio-preview responds in text, not audio
|
|
6
|
1769
|
January 25, 2025
|
|
Realtime API re-consuming it's own output audio as input audio
|
|
10
|
1332
|
January 10, 2025
|
|
Is there a way to prevent gpt-4o-audio-preview from returning audio?
|
|
8
|
739
|
December 17, 2024
|
|
Can`t get the right audio format for recording in web application with whisper on IOS
|
|
0
|
118
|
November 20, 2024
|
|
Logit_bias for gpt-4o-audio-preview
|
|
1
|
73
|
November 13, 2024
|
|
CoT with 4o Audio or Real Time
|
|
4
|
438
|
November 12, 2024
|
|
500 error in request to gpt-4o-audio-* model
|
|
13
|
510
|
November 12, 2024
|
|
Multiturn conversation format using gpt-4o-audio-preview with audio input
|
|
1
|
555
|
November 12, 2024
|
|
Issues with gpt-4o-audio-preview when using tools/functions
|
|
1
|
449
|
November 12, 2024
|
|
Cached input audio_tokens is always 0
|
|
3
|
538
|
November 8, 2024
|
|
How to replace my GPT TTS call for better performance?
|
|
1
|
334
|
November 5, 2024
|
|
Waiting for gpt-4o-audio-preview
|
|
11
|
4016
|
November 4, 2024
|
|
TranscriptionVerbose.duration is a number, not a string
|
|
0
|
99
|
October 25, 2024
|
|
Translation api returns incorect api key while the same key works for chat
|
|
2
|
133
|
October 11, 2024
|