When translating to arabic it repeats the last lines of the text

Rrrila · May 10, 2023, 3:38pm

When translating to Arabic, whisper repeats the last lines of text in the actual audio file, and it even gives it a timestamp on the actual video, which means it is actually cutting off or leaving out of the transcription the last part of the video.

I tested it with many audios from English to Arabic, and it does the same

I’m using large-v2, i will try with a different one just in case.

sps · May 10, 2023, 6:55pm

Hi @Rrrila

whisper is a speech to text model. It’s meant to transcribe not translate.

Currently the only translations supported are from non-English languages to English, not the other way.

Rrrila · May 10, 2023, 7:00pm

Is it plan to work they other way around (english to non-english or non-english to non-english)?

sps · May 10, 2023, 7:04pm

I’m not OpenAI staff, hence I’m not able to comment on that.

However if you want to translate to non-English languages, you can try using the /edits or completion to translate post transcription.

Rrrila · May 10, 2023, 7:06pm

As stated, im not developer my self so no tsure what do you mean by /edits or completion to tranlate post transcription, if you dont mind pointing me to the right direction so i can investigate myself

sps · May 10, 2023, 7:10pm

Sure,

Here’s the docs for text edit which is part of completions endpoint.
Here’s Docs for chat completion

Rrrila · May 10, 2023, 7:56pm

Thanks for the info, but unfortunately none of that would help me, as one is for editing the output on specific words or cases, and the other one uses ChatGPT which I’m trying to avoid. I guess they will add support for rest of the languages at some point…

Rrrila · May 10, 2023, 7:57pm

Although I would like to point out this thing does not happen when translating to lets say Spanish, it does it just fine, is on RTL Arabic Language when it goes crazy, so there might be a bug just as I’m saying.

sps · May 11, 2023, 7:00pm

Your assumption is incorrect.

/Edits can edit entire transcriptions to translate text.

Chat completion can do the same for lower cost per token.

Examples:

Rrrila · May 11, 2023, 7:45pm

I might still misunderstood your approach, but… isn’t that being translated by ChatGPT API? Meaning, there is no way to do so without paying per token, correct?

sps · May 12, 2023, 1:58pm

There is no ChatGPT API.

This is working with the /edits endpoint - Editing is free while in beta.

Here’s pricing for rest of models.

Topic		Replies	Views
Whisper-1 joint translation and transcription API	6	3564	October 21, 2024
Whisper API stutter and erring like LLMs API whisper	1	1200	December 25, 2023
Whisper Translation failure API whisper	5	2047	December 16, 2023
Whisper not processing palestine related requests API whisper	5	674	January 21, 2024
Whisper API Limits - Transcriptions API whisper	11	15421	December 18, 2023

When translating to arabic it repeats the last lines of the text

Examples:

Related topics