|
Introducing GPT-4o Transcribe Diarize: Now Available in the Audio API
|
|
7
|
2472
|
November 14, 2025
|
|
Can't Hear Inbound Audio from OpenAI Realtime Agent (WebRTC) - Oubound Works, Inbound Stuck at 1 kbps
|
|
2
|
53
|
November 10, 2025
|
|
Best Practices for Maintaining Speaker Identity Across Chunks with gpt-4o-transcribe-diarize?
|
|
3
|
188
|
November 4, 2025
|
|
GPT-Audio Not working - Error 500
|
|
14
|
483
|
October 15, 2025
|
|
OpenAI Text To Speech - Speaking with emotion/effets / price
|
|
0
|
45
|
September 16, 2025
|
|
New in Evals: Full Audio Support
|
|
0
|
77
|
September 12, 2025
|
|
:speaking_head: Want to Listen to ChatGPT Voices Reading Your Text? Now You Can! Introducing GPT Reader: A Free ChatGPT Powered Text to Speech Extension!
|
|
2
|
1791
|
September 6, 2025
|
|
AI Audio Gone Wild gallery 2025 - Your Audio Clips from gpt-4o-audio and tts
|
|
5
|
358
|
August 13, 2025
|
|
librosa + numpy compatibility bug broke all .wav and .flac audio analysis in GPT-4o
|
|
4
|
346
|
July 7, 2025
|
|
Realtime "modalities" session config not disabling local->model audio channel
|
|
3
|
218
|
June 6, 2025
|
|
Project: Running your own Whisper-Large-v3 model and extract Audio Embeddings
|
|
10
|
3698
|
May 11, 2025
|
|
What is the difference between realtime-transcription and speech-to-text for Streaming the transcription of an ongoing audio recording?
|
|
2
|
524
|
April 1, 2025
|
|
Create transcription with gpt-4o-transcribe – max audio file length/size?
|
|
0
|
195
|
March 24, 2025
|
|
Audio Models in the API March 20, 2025
|
|
0
|
205
|
March 20, 2025
|
|
Speech to Text (ASR) Strategy
|
|
8
|
557
|
March 10, 2025
|
|
Gpt-4o-audio-preview responds in text, not audio
|
|
6
|
1642
|
January 25, 2025
|
|
Realtime API re-consuming it's own output audio as input audio
|
|
10
|
1195
|
January 10, 2025
|
|
Is there a way to prevent gpt-4o-audio-preview from returning audio?
|
|
8
|
668
|
December 17, 2024
|
|
Can`t get the right audio format for recording in web application with whisper on IOS
|
|
0
|
99
|
November 20, 2024
|
|
Logit_bias for gpt-4o-audio-preview
|
|
1
|
50
|
November 13, 2024
|
|
CoT with 4o Audio or Real Time
|
|
4
|
402
|
November 12, 2024
|
|
500 error in request to gpt-4o-audio-* model
|
|
13
|
453
|
November 12, 2024
|
|
Multiturn conversation format using gpt-4o-audio-preview with audio input
|
|
1
|
513
|
November 12, 2024
|
|
Issues with gpt-4o-audio-preview when using tools/functions
|
|
1
|
411
|
November 12, 2024
|
|
Cached input audio_tokens is always 0
|
|
3
|
478
|
November 8, 2024
|
|
How to replace my GPT TTS call for better performance?
|
|
1
|
312
|
November 5, 2024
|
|
Waiting for gpt-4o-audio-preview
|
|
11
|
3904
|
November 4, 2024
|
|
TranscriptionVerbose.duration is a number, not a string
|
|
0
|
80
|
October 25, 2024
|
|
Translation api returns incorect api key while the same key works for chat
|
|
2
|
84
|
October 11, 2024
|
|
'Transcription Outsourcing, LLC' repeated throughout whisper transcript
|
|
18
|
1404
|
October 5, 2024
|