Any plans for releasing an API for TTS?

aprendendo.next · September 27, 2023, 8:30pm

It is rolling now the new voice conversations, and it seem to be able to speak in different languages.

Is there a plan for making it available through the API?

I know there are other TTS solutions out there, but openai has been able to provide reasonable prices overall, and it would be useful for platforms that don’t have a native TTS engine.

Foxalabs · September 28, 2023, 12:33pm

Hi and welcome to the Developer Forum!

Nothing official has been announces yet, I would imagine that if it is going to become an API then it may come with an official API for sound/images announcement, again, no timelines yet.

Innovatix · September 28, 2023, 12:45pm

Welcome to the community!

OpenAI currently only offer speech-to-text (STT), but we as users hope to have text-to-speech (TTS) capabilities by 2024.In the meantime, you can use another TTS service, such as Elevenlabs, which offers great TTS capabilities.

_j · September 28, 2023, 3:08pm

Who is this “we” that you mention here?

romain · November 6, 2023, 1:02pm

it’s coming!

Innovatix · November 6, 2023, 1:13pm

Yes we knew that, thanks for the update

msveshnikov · November 6, 2023, 9:28pm

Can I use it right now please? Currently I have:
file:///C:/My-progs/Node.JS/tales/server/node_modules/openai/error.mjs:57
return new RateLimitError(status, error, message, headers);
^

RateLimitError: 429 You exceeded your current quota, please check your plan and billing details.
at APIError.generate (file:///C:/My-progs/Node.JS/tales/server/node_modules/openai/error.mjs:57:20)

Foxalabs · November 6, 2023, 9:38pm

It will be rolled out “soon” if I can get more details I’ll update

aprendendo.next · November 6, 2023, 11:02pm

Yeah, today’s keynote was incredible. I am very happy that we now have a model with more reasonable prices, I hope that the rest of the market follows openai’s lead on this.

msveshnikov · November 7, 2023, 6:40am

Ok now it works!! Not sure was a problem with key or just quota was activated for my account
Also, all languages seems to be supported and automtically detected based on string. Great improvement over Google TTS. Price is also on par (they have $16 for 1M chars in Neural2 models)

nikola1jankovic · November 7, 2023, 7:44am

How does it work in other languages then English? Does it sound natural, or giving you a funny American accent voice?

msveshnikov · November 7, 2023, 8:09am

This is quite funny. Russian is little bit (really almost unnoticeable) accent with some letters. But it is better than Google ones, I would say

See example here: https://mangatv.shop/api/video/B7zROoMrVxgfhx_rxdl2u.mp4

colibryx · November 7, 2023, 8:33am

Italian TTS is quite good with a little british accent.

nikola1jankovic · November 7, 2023, 9:36am

Hm, the link you shared is in English?

parthmendapara81 · November 7, 2023, 11:12am

How can I use TTS API? I am following the OpenAI doucumentation : OpenAI Platform

Will you send me a simple working snippet code? With the code in documentations I am getting errors.

msveshnikov · November 7, 2023, 11:30am

Yep. Because it is hard to spot accent actually. Let me know which language is interesting for you, I will generate

msveshnikov · November 7, 2023, 11:33am

Like this (NodeJS):

import fs from “fs”;
import util from “util”;
import textToSpeech from “@google-cloud/text-to-speech”;
import { openai } from “./index.js”;

const tts = new textToSpeech.TextToSpeechClient();
const OpenAiVoices = [“alloy”, “echo”, “fable”, “onyx”, “nova”, “shimmer”];

export const getAudio = async (text, lang, voice) => {
const audio = OpenAiVoices.includes(voice)
? await getOpenAiAudio(text, voice)
: await getGoogleAudio(text, lang, voice);
const writeFile = util.promisify(fs.writeFile);
const audioName = nanoid();
await writeFile(./media/audio/${audioName}.mp3, audio, “binary”);
return {
url: /audio/${audioName}.mp3,
duration: await getAudioDurationInSeconds(./media/audio/${audioName}.mp3),
};
};

const getOpenAiAudio = async (text, voice) => {
const mp3 = await openai.audio.speech.create({
model: “tts-1”,
voice: voice,
input: text,
});
return Buffer.from(await mp3.arrayBuffer());
};

nikola1jankovic · November 7, 2023, 11:36am

No, I thought you have produced an audio spoken in Russian, via OpenAI’s TTS?

Innovatix · November 7, 2023, 11:40am

You want to use OpenAI TTS? Which language? for Python try running this below code or any other language

python Code

nikola1jankovic · November 7, 2023, 11:46am

Basically, I am looking for information on what languages are supported. There is no word about it in the docs, nothing was mentioned yesterday in the speech. It is probably some of:

a) only English is supported, and they are pretending other languages don’t exist
b) everything is supported perfectly and they did not find it important to mention
c) everything kind-of works, but all speech is produced with some English speaker, producing awkward results for non-English speech

Topic		Replies	Views
How can I get acess to the TTS models? API tts	17	3551	November 14, 2023
Chat gpt 4o TTS API lacking details API tts	6	1854	May 20, 2024
TTS voices have a clear US accent API tts	11	3528	January 8, 2025
New model, tts-2, any news on it? (new voice mode) API tts	9	2049	February 21, 2025
Any plans to add new voices on TTS API? API tts	9	898	November 20, 2024

Any plans for releasing an API for TTS?

Related topics