OpenAI Whisper- Send Bytes (python) instead of filename

virajvaitha1995 · March 5, 2023, 3:28pm

Hi,

I hope you’re well. Really enjoying using the OpenAI api, recently had some challenges and was looking for some help.

I don’t want to save audio to disk and delete it with a background task.

My FastAPI application uses a an UploadFile (meaning users upload the file, and I then have access a SpooledTemporaryFile).

Previously using the free version of Whisper on Github, I was able to send the bytes to the model, whereas this API isn’t working this way.

Can anyone else advise on how they are transcribing audio in python without saving videos/audio to disk?

Thanks,

areebaseher04 · March 8, 2023, 4:22pm

Hi, I am using the openai-whisper from github in my django app and I am sending bytes just like you but it is not working.

Can you please share your code?

tudor.chifor · May 17, 2023, 3:38pm

Any updates on this issue ? [quote=“virajvaitha1995, post:1, topic:84786, full:true”]
Hi,

I hope you’re well. Really enjoying using the OpenAI api, recently had some challenges and was looking for some help.

I don’t want to save audio to disk and delete it with a background task.

My FastAPI application uses a an UploadFile (meaning users upload the file, and I then have access a SpooledTemporaryFile).

Previously using the free version of Whisper on Github, I was able to send the bytes to the model, whereas this API isn’t working this way.

Can anyone else advise on how they are transcribing audio in python without saving videos/audio to disk?

Thanks,
[/quote]

jayseaeff · June 27, 2023, 2:40am

Hi All,

I also had this problem and managed to find a solution. I was using pydub to load and edit audio segments and then wanted to send a pydub audio segment directly to whisper without having to create a temporary file. The following approach worked: basically create BytesIO buffer, encode the audio into it in a supported format and then pass it to whisper:

import openai
from pydub import AudioSegment

fname = "file.mp3"
audio = AudioSegment.from_file(fname, format="mp3")
# only use first 5sec
audio = audio[:5000]

buffer = io.BytesIO()
# you need to set the name with the extension
buffer.name = fname
audio.export(buffer, format="mp3")

transcript = openai.Audio.transcribe("whisper-1", buffer)

axiomofjoy · February 15, 2024, 6:52am

The solution from @jayseaeff worked for me, even without using pydub. The important part is to set the .name attribute on the buffer object. It appears that the Whisper API is inferring the file type from the extension on this attribute, rather than inspecting the raw bytes themselves. Here’s a snippet that worked for me (I’m using GraphQL with multipart file uploads).

@strawberry.type
class Mutation:
    @strawberry.mutation
    async def transcribe(self, audio_file: Upload) -> str:
        audio_data = await audio_file.read()
        buffer = io.BytesIO(audio_data)
        buffer.name = "file.mp3"  # this is the important line
        transcription = await openai_client.audio.transcriptions.create(
            model="whisper-1",
            file=buffer,
        )
        return transcription.text

Looking at the types in the Python SDK, it looks as though as you can pass a bytes object to the file argument, but I haven’t gotten this to work.

jrapp · February 20, 2024, 4:04pm

Yeah in order to send bytes of a file you need to send more than just the buffer to the file parameter. You can see what types the file property accepts by stepping into the documentation with VS code or Intellij. Here’s an example of what is working for me:

return whisper_api_client.audio.transcriptions.create(
        model="whisper-1",
        file=("temp." + file_type, file_bytes, content_type),
    ).text

where file_type="m4a" and content_type="audio/m4a"

Topic		Replies	Views
Whisper error 400 "Unrecognized file format." API whisper	9	5719	May 6, 2024
Unrecognized file format error whisper BytesIO, can't write to disk API whisper	6	1959	February 25, 2024
How to send file to Whisper API when you can't save files locally API whisper	7	8131	March 22, 2023
Whisper API breaks on AWS Lambda API whisper	6	2247	April 9, 2024
Using Node.js library createTranscription() function without saving a file API	4	5427	August 5, 2024

OpenAI Whisper- Send Bytes (python) instead of filename

Related topics