Whisper API only transcribing first few seconds

awissink · October 28, 2023, 6:11pm

Hello! I am working on building a website where a user can record themselves and obtain a transcription of the recording using the Whisper API. The recordings seem to be working fine, as the files are intelligible after they are processed, but when I feed them into the API, only the first few seconds of transcription are returned. I’m not sure why this is happening and it seems like other discussions about this issue never reached a solution. Here’s my Flask code:

@app.route('/transcribe_audio', methods=['POST'])
def transcribe_audio():
    try:
        file = request.files['audio']
        audio_data = file.read()

        audio = open("backend/audios/audio.mp4", "wb")
        audio.write(audio_data)
        audio.close()

        # Make a request to the Whisper API for transcription using the OpenAI Python library
        audio_file = open("backend/audios/audio.mp4", "rb")
        response = openai.Audio.transcribe("whisper-1", audio_file)
        print(response)

        # Extract the transcribed text from the response
        transcribed_text = response['text']

        return jsonify({'transcribed_text': transcribed_text}), 200
    except Exception as e:
        print(f"Error processing audio: {str(e)}")
        return jsonify({'error_message': str(e)}), 500

Has anyone run into this before and/or knows how to fix this? I’m wondering if there’s something funky with the file encoding or something.

Foxalabs · October 28, 2023, 6:19pm

Hi,

This seems identical to an issue another forum member posted a couple of hours ago, can I ask if you are using an Apple product to do the recording or any audio manipulation?

awissink · October 28, 2023, 6:27pm

Hi, yes I am! I’m using Safari on a MacBook Pro with an M1 chip.

awissink · October 28, 2023, 6:33pm

Update: this is definitely an API-related bug, as I just tried using the Github version of Whisper in my web app and it worked perfectly. Hopefully this gets resolved soon as I’d much rather use the official API!

Foxalabs · October 28, 2023, 7:20pm

Indeed, seem to have a few members post similar things all around the same time, I’m sure it will be investigated soon.

_j · October 28, 2023, 7:22pm

Every other platform works fine except the Apple stack. Are you sure you don’t want to blame Apple and their audio encoding?

Here’s that link to a link:

Foxalabs · October 28, 2023, 8:20pm

I wonder if it’s anything along these lines

as in, there has been an update to a system file from apple that’s caused this.

Has there been a recent apple update? Not a mac user so I don’t know if they do stealthy updates or how that works.

Topic		Replies	Views
Whisper API not transcribing audio files coming from an iphone API ios , whisper , javascript	10	2538	December 18, 2024
Whisper api completely wrong for mp4 API whisper	14	5353	December 15, 2023
Whisper API is not able to transcribe audios created on iOS API api	2	2538	December 15, 2023
Issues with audio files from IOS and the x-m4a format API whisper	14	2083	July 21, 2024
MediaRecorder API w/ Whisper not working on mobile browsers API whisper , as-wiki	7	2113	December 20, 2024

Whisper API only transcribing first few seconds

Related topics