|
Introducing GPT-4o Transcribe Diarize: Now Available in the Audio API
|
|
7
|
3613
|
November 14, 2025
|
|
Can't Hear Inbound Audio from OpenAI Realtime Agent (WebRTC) - Oubound Works, Inbound Stuck at 1 kbps
|
|
2
|
161
|
November 10, 2025
|
|
Best Practices for Maintaining Speaker Identity Across Chunks with gpt-4o-transcribe-diarize?
|
|
3
|
454
|
November 4, 2025
|
|
GPT-Audio Not working - Error 500
|
|
14
|
567
|
October 15, 2025
|
|
OpenAI Text To Speech - Speaking with emotion/effets / price
|
|
0
|
65
|
September 16, 2025
|
|
New in Evals: Full Audio Support
|
|
0
|
93
|
September 12, 2025
|
|
:speaking_head: Want to Listen to ChatGPT Voices Reading Your Text? Now You Can! Introducing GPT Reader: A Free ChatGPT Powered Text to Speech Extension!
|
|
2
|
2023
|
September 6, 2025
|
|
AI Audio Gone Wild gallery 2025 - Your Audio Clips from gpt-4o-audio and tts
|
|
5
|
468
|
August 13, 2025
|
|
librosa + numpy compatibility bug broke all .wav and .flac audio analysis in GPT-4o
|
|
4
|
439
|
July 7, 2025
|
|
Realtime "modalities" session config not disabling local->model audio channel
|
|
3
|
275
|
June 6, 2025
|
|
Project: Running your own Whisper-Large-v3 model and extract Audio Embeddings
|
|
10
|
5292
|
May 11, 2025
|
|
What is the difference between realtime-transcription and speech-to-text for Streaming the transcription of an ongoing audio recording?
|
|
2
|
562
|
April 1, 2025
|
|
Create transcription with gpt-4o-transcribe – max audio file length/size?
|
|
0
|
215
|
March 24, 2025
|
|
Audio Models in the API March 20, 2025
|
|
0
|
224
|
March 20, 2025
|
|
Speech to Text (ASR) Strategy
|
|
8
|
641
|
March 10, 2025
|
|
Gpt-4o-audio-preview responds in text, not audio
|
|
6
|
1715
|
January 25, 2025
|
|
Realtime API re-consuming it's own output audio as input audio
|
|
10
|
1280
|
January 10, 2025
|
|
Is there a way to prevent gpt-4o-audio-preview from returning audio?
|
|
8
|
711
|
December 17, 2024
|
|
Can`t get the right audio format for recording in web application with whisper on IOS
|
|
0
|
107
|
November 20, 2024
|
|
Logit_bias for gpt-4o-audio-preview
|
|
1
|
58
|
November 13, 2024
|
|
CoT with 4o Audio or Real Time
|
|
4
|
423
|
November 12, 2024
|
|
500 error in request to gpt-4o-audio-* model
|
|
13
|
484
|
November 12, 2024
|
|
Multiturn conversation format using gpt-4o-audio-preview with audio input
|
|
1
|
538
|
November 12, 2024
|
|
Issues with gpt-4o-audio-preview when using tools/functions
|
|
1
|
440
|
November 12, 2024
|
|
Cached input audio_tokens is always 0
|
|
3
|
510
|
November 8, 2024
|
|
How to replace my GPT TTS call for better performance?
|
|
1
|
325
|
November 5, 2024
|
|
Waiting for gpt-4o-audio-preview
|
|
11
|
3977
|
November 4, 2024
|
|
TranscriptionVerbose.duration is a number, not a string
|
|
0
|
87
|
October 25, 2024
|
|
Translation api returns incorect api key while the same key works for chat
|
|
2
|
108
|
October 11, 2024
|
|
'Transcription Outsourcing, LLC' repeated throughout whisper transcript
|
|
18
|
1559
|
October 5, 2024
|