Any plans for releasing an API for TTS?

Innovatix · November 7, 2023, 11:53am

It supports multiple languages.

English translation of TTS:

Hello, world, This is a test to see russian TTS using openAI!

Audio file in russian
https://uploadnow.io/files/n6SC5Rp

Russian audio appears fine to me; any Russian speakers on the forum can confirm.

parthmendapara81 · November 7, 2023, 12:43pm

from pathlib import Path
import openai
import os

openai.api_key = ‘My API Key’

speech_file_path = Path(file).parent / “speech.mp3”

response = openai.audio.speech.create(
model=“tts-1”, # also try tts-1-hd
voice=“alloy”, # other voices are alloy, echo, fable, onyx, nova
input=“Today is a wonderful day to build something people love!”
)

response.stream_to_file(speech_file_path)

getting error :- AttributeError: module ‘openai’ has no attribute ‘audio’

nikola1jankovic · November 7, 2023, 1:18pm

Well, I don’t think other languages will do here. I have tested in Italian, Spanish, Serbian and Greek.

For all but Greek, it sounds like an American actor trying to read these texts in a foreign language, but failing almost miserably. It is not a “slight accent”, as someone reported. The results are unusable - apart from maybe some comic needs.

Innovatix · November 7, 2023, 1:42pm

If you don’t like you can leave feedback also there are better alternative like Elevenlabs. Which is even free for 10k tokens.

Innovatix · November 7, 2023, 1:47pm

try checking the version with “!pip show openai” If your version is older than 0.27.0 or same upgrade with command, My version is 1.1.1

!pip install --upgrade openai

then try running this code:

aweb1 · November 7, 2023, 1:55pm

for Chinese, the quality is unusable (50% of the time it produces incomprehensible abomination audio)

nikola1jankovic · November 7, 2023, 3:32pm

I believe they are aware of that, as they did not mention languages at any point. It was probably a decision to release this and then work on other languages.

Elevenlabs is great, sounds much better in other languages, but it is around 7x / 15x more expensive for the usage, making it too expensive for regular usage.

cyrus.nemati · November 7, 2023, 5:04pm

I dream of the day they’ll reveal their TTS synthesis process! I’m amazed/horrified when my AI coughs. I’d love to record my own for my particular Assistant implementation.

aweb1 · November 9, 2023, 11:25am

upping this topic for visibility. Very bad performance for foreign languages such as chinese.

Topic		Replies	Views
How can I get acess to the TTS models? API tts	17	2361	November 14, 2023
What languages does the new TTS API support? API api , tts	6	2267	February 22, 2024
Can I choose the TTS language? API tts	28	7216	March 30, 2024
When is Voice Chat coming for the API coming? API	6	1486	November 7, 2023
Yo any plans for a Text to Speech API? API	2	439	November 7, 2023

Any plans for releasing an API for TTS?

Related Topics