Whisper help ឵឵឵឵឵឵឵឵឵឵឵឵ ឵឵឵឵឵឵ ឵឵឵

DonDemon · March 13, 2024, 2:14pm

Can someone please share python code with whisper api ឵឵឵឵឵឵឵឵឵ ឵឵឵឵឵឵឵឵឵឵឵឵ ឵឵឵឵឵឵ ឵឵឵឵឵឵឵឵឵឵឵឵ ឵឵឵ ឵឵឵ ឵឵឵

Innovatix · March 13, 2024, 2:46pm

Basic Whisper API Example:

import requests

openai_api_key = 'ADD YOUR KEY HERE'
file_path = '/path/to/file/audio.mp3'  # Add the Path of your audio file 
headers = {
    'Authorization': f'Bearer {openai_api_key}',
}

files = {
    'file': open(file_path, 'rb'),
    'model': (None, 'whisper-1'),
}

response = requests.post('https://api.openai.com/v1/audio/transcriptions', headers=headers, files=files)

print(response.text)

for more information, checkout the OpenAI documentation

_j · March 13, 2024, 3:02pm

Let’s crank up the level of “basic example”…

import os
import requests

# Gets the API key from environment variable
api_key = os.getenv("OPENAI_API_KEY")
headers = {"Authorization": f"Bearer {api_key}"}
url = "https://api.openai.com/v1/audio/transcriptions"

audio_file_name = "audio.mp3"
base_file_name = os.path.splitext(audio_file_name)[0]  # Get the base file name

with open(audio_file_name, "rb") as audio_file:
    parameters = {
        "file": (audio_file_name, audio_file),
        "language": (None, "en"),
        "model": (None, "whisper-1"),
        "prompt": (None, "Here is the radio show."),
        "response_format": (None, "verbose_json"),
        "temperature": (None, "0.1"),
        "timestamp_granularities[]" : (None, "word"),
    }
    response = requests.post(url, headers=headers, files=parameters)

if response.status_code != 200:
    print(f"HTTP error {response.status_code}: {response.text}")
else:
    # Get the transcribed text and timed words from the response
    transcribed_text = response.json()['text']
    words = response.json()['words']
    formatted_words = [
        {k: f"{v:.2f}" if isinstance(v, float) else v for k, v in word.items()}
        for word in words
    ]
    # Save text or words to a file
    try:
        with open(f"{base_file_name}_transcription.txt", "w") as file:
            file.write(transcribed_text)
        print(f"Transcribed text successfully saved to '{base_file_name}_transcription.txt'.")
        
        with open(f"{base_file_name}_timestamped.txt", "w") as file:
            file.write(str(formatted_words))
        print(f"Timestamped words successfully saved to '{base_file_name}_timestamped.txt'.")
    except Exception as e:
        print(f"output file error: {e}")

    print(formatted_words[:20])

This will use the same API environment variable as the OpenAI library.
It also demonstrates word time stamps, saving two files, one with the plain text, and one with the word list, based on the original file name. Then you get a bit of output shown also.

The requests library is one you’ll have to “pip install”

DonDemon · March 13, 2024, 3:20pm

tried both codes but it returns “no module named requiests”. (openAI library installed)

_j · March 13, 2024, 3:25pm

The two examples use the requests library, not the Python library. As mentioned in the last line I wrote above, you’ll have to install it just like you did openai, with a pip install requests in your OS shell or environment.

This also means that if you want the latest features like word timestamps, you don’t have to wait for the openai library to be updated to let those unknown parameters through.

Topic		Replies	Views
Help Putting Whisper Code Into Python Script API	2	2209	January 29, 2024
How to write a Python script for the new version of OpenAI Whisper API? API api	0	1664	March 21, 2024
Whisper and AI Speech API API whisper	0	405	May 12, 2024
Whisper api, not transcrip all audio API whisper	3	1990	October 28, 2023
🤷🏻‍♂️ Speech to text - whisper? API whisper	2	1304	January 12, 2023

Whisper help ឵឵឵឵឵឵឵឵឵឵឵឵ ឵឵឵឵឵឵ ឵឵឵

Related topics