OpenAI Whisper- Send Bytes (python) instead of filename

jayseaeff · June 27, 2023, 2:40am

Hi All,

I also had this problem and managed to find a solution. I was using pydub to load and edit audio segments and then wanted to send a pydub audio segment directly to whisper without having to create a temporary file. The following approach worked: basically create BytesIO buffer, encode the audio into it in a supported format and then pass it to whisper:

import openai
from pydub import AudioSegment

fname = "file.mp3"
audio = AudioSegment.from_file(fname, format="mp3")
# only use first 5sec
audio = audio[:5000]

buffer = io.BytesIO()
# you need to set the name with the extension
buffer.name = fname
audio.export(buffer, format="mp3")

transcript = openai.Audio.transcribe("whisper-1", buffer)

Topic		Replies	Views
Whisper error 400 "Unrecognized file format." API whisper	9	5441	May 6, 2024
Unrecognized file format error whisper BytesIO, can't write to disk API whisper	6	1903	February 25, 2024
How to send file to Whisper API when you can't save files locally API whisper	7	8045	March 22, 2023
Whisper API breaks on AWS Lambda API whisper	6	2201	April 9, 2024
Using Node.js library createTranscription() function without saving a file API	4	5325	August 5, 2024

OpenAI Whisper- Send Bytes (python) instead of filename

Related topics