Udio: New Music Generator text2audio from Nvidia?

PaulBellow · April 9, 2024, 2:56am

Rumors have been circulating for a few days now, so I’ve decided to put my detective’s hat on once again, for an AI Mystery Investigation. In this video, you’ll not only learn what this new platform is, but we’ll hear some samples from it, which I’ll do some analysis on to see if it really IS a threat to Suno’s AI Music Throne.

anon22939549 · April 9, 2024, 6:54pm

It’s not a mystery.

Assuming it is Udio, it’s a product of Uncharted Labs, Inc which is made up of some former Google researchers.

PaulBellow · April 9, 2024, 9:39pm

A bit more research from Reddit…

https://old.reddit.com/r/singularity/comments/1bzd4bo/its_been_confirmed_the_suno_killer_is_called_udio/

omni72 · April 10, 2024, 2:08pm

open beta: https://www.udio.com/

edit: i’ve experienced a few minor hiccups with the website and suspect that i’ll see more of that as this starts popping up on people’s rAIdar

2nd edit: like i said

idk if that qualifies for /softwaregore but yeah they appear to be experiencing some growing pains

3rd edit: ~200k views in ~2.5hrs on their intro post. it may take them a moment to catch their breath:

PaulBellow · April 10, 2024, 6:03pm

Thanks for sharing!

Anyone else give it a go yet? Crazy they’re giving away so much for free … they’re gonna grab all the market share soon!

I’ve not had a chance to play around yet.

paul.fishwick · April 11, 2024, 10:25pm

It is amazing, from my trials. I finished a metal piece that goes for over 4 mimutes by extension and curation. Can we include audio links here?

paul.fishwick · April 11, 2024, 10:29pm

Here is a link: Udio | Neon Shredscape by Paul Fishwick

supershaneski · April 11, 2024, 11:26pm

I tested it. Using lyrics written in language other than English seem to be not so good yet. Other than that, it sounds good. I turned Invictus by William Henley to ska lol.

_j · April 14, 2024, 1:58am

Just starting to try it, but it is obvious that it won’t result in a top-40 smash. One generation has lyrics that are on the theme of the prompt, another go just sounds like plausible English – if you didn’t know the language. But enough prompting and the lyrics can be make clear and intelligible. There’s a bit of prompt disobedience and mentioning instructions in output, but if using DALL-E, it’s nothing new.

It’s actually pretty amazing, even when it writes its own lyrics based on a theme, and then you refine that prompt to transition the style. The only thing missing is more form to an entire piece than merely “extend”, where you might be able to highlight a chorus and have a return to what’s been written and sung before. Forcing your own lyrics on it might give form to the repetition that makes music appeal.

My masterwork of “pride” (a bit of humor), that takes a while to get there. Seeing the lyrics in the link is kind of a spoiler.

Update at 6hrs in. I was really getting the hang of this and sculpting what I want, until heartbreak at 4:22: “This song is too long to extend”. Right at the core of what was going great. Enjoy:

_j · April 15, 2024, 10:41am

Here’s a creative re-imagination of a tune you might recognize, with another style for it that suits the lyrics being a bit mashed up…

Udio | Midday Reprieve by King Krispus

Tell me that’s not crazy close to a song? And with remarkable text-to-speech.

Still, this technology seems most suited for a producer making loops or “evolution” of a beat instead of going for a whole catchy song (that I let continue with nonsense vocals), because of the 30 seconds of generation at a time that breaks audio into unnatural prompted segments, even of forgotten volume and voices, and inability to return to a chorus or melody. And I expect that like DALL-E or even ChatGPT, the perceptions will be tamed after finding out where everything output is kind of the same.

TonyAIChamp · April 17, 2024, 2:26am

Anyone knows anything if they have / when they are opening their API?

_j · April 17, 2024, 2:57am

Do they even hint there would be an API? They do plan to transition to paid-only.

The largest percentage of outputs are discardable, and continuing on the base 33 second through four different “extend” methods needs even more prompt interaction based on the impression you get of where the first seed of an idea might need to go.

The UI they have is incredibly mature and attuned to the particular product features, with few foibles. Like a better looking sharing than OpenAI “store” after half a year.

Something completely amazing like this clip I made from a whisper transcript, to then blow miles past OpenAI TTS, simply takes an hour of work to get right.

city3204 · June 16, 2024, 3:01pm

My romance with Udio is dying. It is either buggy or I don’t know what I’m doing. Here is how I try to create a song in Udio:

Add my prompt with appropriate tags.

First time around, I let Udio generate the song and lyrics

After Udio generation, I click the song title to enter the area with the large Cover Icon. Here I click edit to change the lyrics.

After changing the lyrics, I click save. Udio tells me that the lyrics have been updated.

I click play and Udio play the original lyrics - ignoring my lyrics, or it creates brand-new lyrics but still ignoring my lyric changes or even worse it starts singing in gibberish

So next I try to remix a few times with fingers crossed Udio will start using the lyric changes I made. NOPE—no such luck.

Any feedback, thoughts, and suggestions is welcomed. I haven’t checked if Udio has a YouTube account. Proper documentation is as important as the software itself. There are tons of YouTube videos about Udio but also lots of inconsistencies. Need proper YouTube videos from the creators.

Thanks for any thoughts in advance.

p.s. depending on what you want done, there may be several Udio work flows. If anyone has a good handle on Udio work flows, please post on YouTube and let us know. At the moment I would want to know a work flow for straight forward creation of a song and another work flow when you need to edit (lyrics, singer, medley, etc)

_j · June 16, 2024, 5:40pm

That’s not how lyrics work. Udio only recently allowed editing of the display lyrics because they sometimes became out of order or nonsensical with in-platform editing.

You have to provide the sung lyrics when you are creating the initial audio. The music and the lyrics are one output product. If you got an idea that has poor AI-written lyrics, you can write 30 seconds-worth of new lyrics and put those in the create lyrics box when you extend the segment for another 30 seconds.

codeyunky · June 24, 2024, 6:45am

its amazing for what it is, but i dont like the interaction, it always creates 2 versions and in a regular chatgtp you can correct the AI, udio promp is totally num when extending songs and suno has better lyrics and is superfast, but spitts out a complete song and voices on both platform are limited and the quality low.

i doubt songs like “smurfing you” are 100% udio only, those songs require allot of efforts. on the good site, it can even do local dialect songs in my native language and was 80% correct.

udio dot com /playlists/ushEJg8a6TnUbzgSvBzNpL

bottomline, its a nice toy for musicians who want a fast demo to take to the studio, if you dont publish, the songs are public domain under your own copyright claim, so thats a good thing.

Topic		Replies	Views
AI start-up Suno AI from Cambridge generates lifelike radio-quality music Community music	33	13477	October 17, 2024
Got ChatGPT to make Music 🎶 Prompting	11	23980	April 15, 2024
Anyone else using ChatGPT to make music? Prompting chatgpt , python , music	12	10666	July 30, 2024
Can LLMs be used to create unique and original music compositions? Community	7	3678	February 5, 2024
How to elevate vocals with A.i plugin? Community plugin-development , api	5	520	May 22, 2024

Udio: New Music Generator text2audio from Nvidia?

Related topics