Timeline on Voice2Voice for Devs?

tventura94 · October 28, 2023, 1:49pm

Very excited to integrate our app with voice to voice, I’m assuming OpenAI would give us devs access - wondering if anyone has any ideas if and when this is happening?

Foxalabs · October 28, 2023, 1:52pm

I would image there will be at least some discussion of the text to speech at DevDay on November 6th, S2S will I think be up to the developer as there are so many ways to do it.

tventura94 · October 28, 2023, 1:53pm

Right but I figured OpenAI would be releasing their own voice models and API integrations for quick ease of use to compete with eleven-labs or the like. I imagine OpenAI will have the better, cheaper tech.

Eleven Labs is OK but its not something I would feel comfortable using for long form conversations as it just bugs out too much mid sentence.

Foxalabs · October 28, 2023, 1:55pm

Indeed, I really like the natural sound of the OAI speech models, it would be interesting to see if they are going down the trainable 11labs style route or if iut will just be fixed, I also don’t know if they are using a 3rd party solution, in which case there may be no mention of it at all.

tventura94 · October 28, 2023, 1:58pm

Am I confused? Does Open AI already offer voice models? AFAIK That was coming with voice to voice integration?

Foxalabs · October 28, 2023, 2:02pm

Right now the only offering is voice transcription via Whisper, I have not seen anything official about a text to speech option yet or any mention of a voice to voice system, much as ChatGPT’s conversation system has to be handled in code when using the API, i.e. adding the new chat to the end of the conversation list, I think voice input and output will be two unlinked objects that developers can then put together how they wish, that is assuming that a speech output API even becomes public. It is very possible that the speech output system is for phone apps only and will not be made available.

Counting the days until DevDay so these things become clearer.

tventura94 · October 28, 2023, 2:05pm

If nothin is announced on the 6th I’ll probably just figure a way to do it with a third party.

Foxalabs · October 28, 2023, 2:09pm

Yes, there are certainly no shortage of options, but it would be nice if OpenAI provided an API for it,

tventura94 · October 28, 2023, 2:11pm

Do you have any recommendations? I really only know eleven-labs. I am less interested in customizable voices and more interested in stability of the voice over long form conversations

Foxalabs · October 28, 2023, 2:36pm

could take a look at the link below, but it’s worth bearing in mind that if OpenAI are building something, might be worth waiting a week to see.

todaystoryhub · January 29, 2024, 4:56pm

If “Voice2Voice” is a specific technology or project, you might want to look for updates on the official website, blog posts, or announcements from the developers or companies involved. Additionally, community forums, tech news websites, and social media channels could provide insights into any recent advancements.

If you have more context or details about Voice2Voice, feel free to provide them.

itsvnk · January 29, 2024, 5:37pm

Sorry if i sound silly.

Why not do voice to text, do the usual calls to API, then do text to voice and send it back to the user?

Will it be awfully slow? Has anyone tried this yet?

Topic		Replies	Views
Multiple API calls - high latency; options / product suggestion API chatgpt	21	2383	December 25, 2023
Implementing audio conversation with AI API	8	1443	April 29, 2024
Can (custom) GPT speak and respond via voice? Community gpt-4 , api , chatgpt-plugin	13	5522	November 21, 2023
TTS API service usability API tts	17	3335	December 16, 2023
GPTs with Custom Actions by Whisper API and TTS Feedback gpts	18	4519	December 4, 2023

Timeline on Voice2Voice for Devs?

Related Topics