This is a new blank post im making heres is a blank post im making

anon34024923 · October 28, 2023, 1:49pm

Heres me making a blank post because i feel like it. HazahHeres me making a blank post because i feel like it. Hazah

Foxalabs · October 28, 2023, 1:52pm

I would image there will be at least some discussion of the text to speech at DevDay on November 6th, S2S will I think be up to the developer as there are so many ways to do it.

anon34024923 · October 28, 2023, 1:53pm

Right but I figured OpenAI would be releasing their own voice models and API integrations for quick ease of use to compete with eleven-labs or the like. I imagine OpenAI will have the better, cheaper tech.

Eleven Labs is OK but its not something I would feel comfortable using for long form conversations as it just bugs out too much mid sentence.

Foxalabs · October 28, 2023, 1:55pm

Indeed, I really like the natural sound of the OAI speech models, it would be interesting to see if they are going down the trainable 11labs style route or if iut will just be fixed, I also don’t know if they are using a 3rd party solution, in which case there may be no mention of it at all.

anon34024923 · October 28, 2023, 1:58pm

Am I confused? Does Open AI already offer voice models? AFAIK That was coming with voice to voice integration?

Foxalabs · October 28, 2023, 2:02pm

Right now the only offering is voice transcription via Whisper, I have not seen anything official about a text to speech option yet or any mention of a voice to voice system, much as ChatGPT’s conversation system has to be handled in code when using the API, i.e. adding the new chat to the end of the conversation list, I think voice input and output will be two unlinked objects that developers can then put together how they wish, that is assuming that a speech output API even becomes public. It is very possible that the speech output system is for phone apps only and will not be made available.

Counting the days until DevDay so these things become clearer.

anon34024923 · October 28, 2023, 2:05pm

If nothin is announced on the 6th I’ll probably just figure a way to do it with a third party.

Foxalabs · October 28, 2023, 2:09pm

Yes, there are certainly no shortage of options, but it would be nice if OpenAI provided an API for it,

anon34024923 · October 28, 2023, 2:11pm

Do you have any recommendations? I really only know eleven-labs. I am less interested in customizable voices and more interested in stability of the voice over long form conversations

Foxalabs · October 28, 2023, 2:36pm

could take a look at the link below, but it’s worth bearing in mind that if OpenAI are building something, might be worth waiting a week to see.

todaystoryhub · January 29, 2024, 4:56pm

If “Voice2Voice” is a specific technology or project, you might want to look for updates on the official website, blog posts, or announcements from the developers or companies involved. Additionally, community forums, tech news websites, and social media channels could provide insights into any recent advancements.

If you have more context or details about Voice2Voice, feel free to provide them.

itsvnk · January 29, 2024, 5:37pm

Sorry if i sound silly.

Why not do voice to text, do the usual calls to API, then do text to voice and send it back to the user?

Will it be awfully slow? Has anyone tried this yet?

Topic		Replies	Views
Implementing audio conversation with AI API	8	4016	April 29, 2024
Did OpenAI just make a new AI Voice? API	7	2901	May 16, 2024
TTS API service usability API tts	17	6770	December 16, 2023
New model, tts-2, any news on it? (new voice mode) API tts	9	1847	February 21, 2025
Best 'text-to-speech' api to plug into a chatgpt bot? API	4	1715	January 25, 2025

This is a new blank post im making heres is a blank post im making

Related topics