New OpenAI Trademark: VOICE ENGINE

Sam Altman said on the Lex Fridman podcast that OpenAI will be launching several things soon. Two days ago, a new trademark registration appeared:


Some highlights I found (which are not present in the trademarks of GPT-4/5 or Whisper):

  • “processing voice commands, and converting between text and speech”
  • “computer software for creating and generating voice and audio outputs based on natural language prompts, text, speech, visual prompts, images, and/or video
  • “computer software for building digital voice assistants
  • “computer software for generation of audio and/or voice in response to user prompts
  • “computer software for use as an application programming interface (API)
  • “computer software development tools for the development of voice service delivery and natural language understanding technology across global computer networks, wireless networks, and electronic communications networks”

Something is cooking :eyes:

Source: https://uspto.report/TM/98456635

7 Likes

Apparently they didn’t learn from past rejections that you can not just take two functional words and jam them together and call it a trademark…

1 Like

Downloadable SDK? Huhhhh

2 Likes

here’s the actual uspto case page: https://tsdr.uspto.gov/#caseNumber=98456635&caseSearchType=US_APPLICATION&caseType=DEFAULT&searchType=statusSearch

I dunno- I think the strategy is to throw enough mud at the wall to see what sticks.

Since the cost is almost negligible for them, they’ll first try a standard character claim, and if that doesn’t go through they’ll just go for a special format claim. Some stuff is bound to make it through. Whether on legitimate grounds or through clerical oversight.

1. OpenAI is working on a secretive voice engine project.

OpenAI is developing a voice engine project shrouded in secrecy, hinted at by recent trademark filings and employee statements.

  • Trademark filings for ‘voice engine’ suggest a significant upcoming project.
  • Employee statements hint at a groundbreaking personal assistant product in the works.
5 Likes

I’m really looking forward to this. I have been using the hands-free while driving and it’s really nice.

My mother is also legally blind and calls me to help her find the right spice. I would love for her to be able to have an Assistant in her life to help her find things and read back recipes that she’s found.

I know there’s Be My Eyes or whatever but it low-key just does not work.

All the current big names such as Google Home are utter trash bordering on abandon ware.

I can’t wait to see what they’re cooking. It does feel like OpenAI is going towards the physical side of things soon. Life-like robots when?

Thanks for sharing!

PS the robot face is creepy ASF :joy:

2 Likes

Voice Engine is gonna be shaped like a Furby, right? Right? A Furb-AI… Small smile.

3 Likes

:rofl: I have been imagining a modern tale on Teddy Ruxpin with Whisper, TTS, and GPT-4 in place of the tape player under the hood.

1 Like

Wouldn’t Voice Engine just be the engine giving sound to Sora vids?

Prompt = Video (Sora) + Audio (Voice Engine)?

2 Likes

My best guess is that it could be for a voice chat completion engine/model. This would drastically reduce the time between user speaking and the audio response received, which right now involves transcription, chat completion, and finally TTS.

2 Likes

A couple of thing that continue to worry me about OpenAI:

  1. Ongoing secrecy around roadmap, secrecy around datasets used. I get that they choose to interpret “Open” as not meaning open source, and that’s fine, but what is exactly open here? Almost nothing. They don’t do anything different than any non-open commercial service. I feel that is bound to lead to problems eventually.

  2. It’s a global service for a small world but the level of American bias in the products is quite astounding. For example, all the current voices are very American (except for a UK one). Even just with English there are many accents around the world. I really hope that changes in the new voice projects and that the voice choices are completely customisable and inclusive.

3 Likes