Udio: New Music Generator text2audio from Nvidia?

Rumors have been circulating for a few days now, so I’ve decided to put my detective’s hat on once again, for an AI Mystery Investigation. In this video, you’ll not only learn what this new platform is, but we’ll hear some samples from it, which I’ll do some analysis on to see if it really IS a threat to Suno’s AI Music Throne.

3 Likes

It’s not a mystery.

Assuming it is Udio, it’s a product of Uncharted Labs, Inc which is made up of some former Google researchers.

3 Likes

A bit more research from Reddit…

https://old.reddit.com/r/singularity/comments/1bzd4bo/its_been_confirmed_the_suno_killer_is_called_udio/

2 Likes

open beta: https://www.udio.com/

edit: i’ve experienced a few minor hiccups with the website and suspect that i’ll see more of that as this starts popping up on people’s rAIdar :smiley:

2nd edit: like i said :laughing:

idk if that qualifies for /softwaregore but yeah they appear to be experiencing some growing pains

3rd edit: ~200k views in ~2.5hrs on their intro post. it may take them a moment to catch their breath:

3 Likes

Thanks for sharing!

Anyone else give it a go yet? Crazy they’re giving away so much for free … they’re gonna grab all the market share soon!

I’ve not had a chance to play around yet.

It is amazing, from my trials. I finished a metal piece that goes for over 4 mimutes by extension and curation. Can we include audio links here?

Here is a link: Udio | Neon Shredscape by Paul Fishwick

1 Like

I tested it. Using lyrics written in language other than English seem to be not so good yet. Other than that, it sounds good. I turned Invictus by William Henley to ska lol.

Just starting to try it, but it is obvious that it won’t result in a top-40 smash. One generation has lyrics that are on the theme of the prompt, another go just sounds like plausible English – if you didn’t know the language. But enough prompting and the lyrics can be make clear and intelligible. There’s a bit of prompt disobedience and mentioning instructions in output, but if using DALL-E, it’s nothing new.

It’s actually pretty amazing, even when it writes its own lyrics based on a theme, and then you refine that prompt to transition the style. The only thing missing is more form to an entire piece than merely “extend”, where you might be able to highlight a chorus and have a return to what’s been written and sung before. Forcing your own lyrics on it might give form to the repetition that makes music appeal.

My masterwork of “pride” (a bit of humor), that takes a while to get there. Seeing the lyrics in the link is kind of a spoiler.


Update at 6hrs in. I was really getting the hang of this and sculpting what I want, until heartbreak at 4:22: “This song is too long to extend”. Right at the core of what was going great. Enjoy:

1 Like

Here’s a creative re-imagination of a tune you might recognize, with another style for it that suits the lyrics being a bit mashed up…

Udio | Midday Reprieve by King Krispus

Tell me that’s not crazy close to a song? And with remarkable text-to-speech.

Still, this technology seems most suited for a producer making loops or “evolution” of a beat instead of going for a whole catchy song (that I let continue with nonsense vocals), because of the 30 seconds of generation at a time that breaks audio into unnatural prompted segments, even of forgotten volume and voices, and inability to return to a chorus or melody. And I expect that like DALL-E or even ChatGPT, the perceptions will be tamed after finding out where everything output is kind of the same.

Anyone knows anything if they have / when they are opening their API?

Do they even hint there would be an API? They do plan to transition to paid-only.

The largest percentage of outputs are discardable, and continuing on the base 33 second through four different “extend” methods needs even more prompt interaction based on the impression you get of where the first seed of an idea might need to go.

The UI they have is incredibly mature and attuned to the particular product features, with few foibles. Like a better looking sharing than OpenAI “store” after half a year.


Something completely amazing like this clip I made from a whisper transcript, to then blow miles past OpenAI TTS, simply takes an hour of work to get right.

1 Like

My romance with Udio is dying. It is either buggy or I don’t know what I’m doing. Here is how I try to create a song in Udio:

Add my prompt with appropriate tags.

First time around, I let Udio generate the song and lyrics

After Udio generation, I click the song title to enter the area with the large Cover Icon. Here I click edit to change the lyrics.

After changing the lyrics, I click save. Udio tells me that the lyrics have been updated.

I click play and Udio play the original lyrics - ignoring my lyrics, or it creates brand-new lyrics but still ignoring my lyric changes or even worse it starts singing in gibberish

So next I try to remix a few times with fingers crossed Udio will start using the lyric changes I made. NOPE—no such luck.

Any feedback, thoughts, and suggestions is welcomed. I haven’t checked if Udio has a YouTube account. Proper documentation is as important as the software itself. There are tons of YouTube videos about Udio but also lots of inconsistencies. Need proper YouTube videos from the creators.

Thanks for any thoughts in advance.

p.s. depending on what you want done, there may be several Udio work flows. If anyone has a good handle on Udio work flows, please post on YouTube and let us know. At the moment I would want to know a work flow for straight forward creation of a song and another work flow when you need to edit (lyrics, singer, medley, etc)

That’s not how lyrics work. Udio only recently allowed editing of the display lyrics because they sometimes became out of order or nonsensical with in-platform editing.

You have to provide the sung lyrics when you are creating the initial audio. The music and the lyrics are one output product. If you got an idea that has poor AI-written lyrics, you can write 30 seconds-worth of new lyrics and put those in the create lyrics box when you extend the segment for another 30 seconds.

its amazing for what it is, but i dont like the interaction, it always creates 2 versions and in a regular chatgtp you can correct the AI, udio promp is totally num when extending songs and suno has better lyrics and is superfast, but spitts out a complete song and voices on both platform are limited and the quality low.

i doubt songs like “smurfing you” are 100% udio only, those songs require allot of efforts. on the good site, it can even do local dialect songs in my native language and was 80% correct.

udio dot com /playlists/ushEJg8a6TnUbzgSvBzNpL

bottomline, its a nice toy for musicians who want a fast demo to take to the studio, if you dont publish, the songs are public domain under your own copyright claim, so thats a good thing.