|
Simple hack to make LLMs listen to an audio rather than only reading
|
|
1
|
71
|
November 12, 2025
|
|
All my attempts to improve accuracy and reduce hallucinations have the opposite effect!
|
|
7
|
2710
|
November 10, 2025
|
|
Pricing When Using the Whisper Model with RealtimeAPI
|
|
1
|
90
|
October 29, 2025
|
|
Gpt-4o-transcribe-diarize returns “chunking_strategy is required” — any working example or schema?
|
|
4
|
302
|
October 28, 2025
|
|
Building a WhatsApp Travel Concierge with GPT-4o + Whisper + Flask
|
|
0
|
38
|
October 24, 2025
|
|
Whisper Streaming Strategy
|
|
10
|
18012
|
October 22, 2025
|
|
Best solution for Whisper diarization/speaker labeling?
|
|
20
|
42729
|
October 16, 2025
|
|
Hosting Whisper model on Vultr Machine
|
|
1
|
45
|
September 29, 2025
|
|
Issue with Arabic Transcription in Whisper-Large-V3-Turbo ("نعم" → "Naah"/"Naahe")
|
|
5
|
411
|
September 14, 2025
|
|
Unrecognized file format error whisper BytesIO, can't write to disk
|
|
8
|
2166
|
August 29, 2025
|
|
Self hosting Open Ai Whisper
|
|
3
|
1004
|
August 21, 2025
|
|
GPT-4o-Transcribe: Why Does the Final Output Sometimes Exactly Replicate the Configured Prompt?
|
|
5
|
747
|
August 19, 2025
|
|
Whisper Cost Optimisation for Transcriptions
|
|
2
|
252
|
August 5, 2025
|
|
Web Speech API with whisper
|
|
1
|
360
|
July 24, 2025
|
|
Unable to get word level timestamp from AzureOpenAI client, whisper-1
|
|
0
|
57
|
June 20, 2025
|
|
Whisper is translating my audios for some reason
|
|
24
|
12819
|
June 18, 2025
|
|
Timings offset for long audio in whisperX
|
|
1
|
131
|
June 9, 2025
|
|
Realtime transcription model changes
|
|
2
|
452
|
May 27, 2025
|
|
How to avoid Hallucinations in Whisper transcriptions?
|
|
33
|
24189
|
May 20, 2025
|
|
Whisper ASR Model Skipping Chunks in Audio Transcription
|
|
1
|
605
|
May 20, 2025
|
|
Scrybe Quill TTRPG Recaps - Recapping Tool Powered by OpenAI's Whisper, LLMs, and ElevenLabs
|
|
15
|
504
|
May 20, 2025
|
|
Project: Running your own Whisper-Large-v3 model and extract Audio Embeddings
|
|
10
|
3698
|
May 11, 2025
|
|
How can I integrate Whisper Large v3 into Qwen or InternVL3?
|
|
3
|
288
|
April 23, 2025
|
|
ERR_NETWORK when calling /v1/audio/transcriptions API
|
|
3
|
268
|
April 22, 2025
|
|
Inconsistencies in the Temperature parameter in Transcriptions endpoint
|
|
0
|
164
|
March 26, 2025
|
|
Whisper sometimes randomly skip sentence
|
|
6
|
2248
|
April 18, 2025
|
|
Whisper (API) significant bug with a specific audio
|
|
1
|
151
|
April 17, 2025
|
|
Whisper stalls consistently after specifically 1000 files during epoch
|
|
1
|
76
|
April 8, 2025
|
|
Get 400 Bad Request error for transcript api
|
|
1
|
1051
|
April 4, 2025
|
|
What is the difference between realtime-transcription and speech-to-text for Streaming the transcription of an ongoing audio recording?
|
|
2
|
524
|
April 1, 2025
|