Udio – AI Music Generator

vladstudio · 2024-06-03T04:49:45 1717390185

"I want AI to do my laundry and dishes so that I can do art and writing, not for AI to do my art and writing so that I can do my laundry and dishes." (C) authorjmac

scheeseman486 · 2024-06-03T05:07:28 1717391248

This AI stuff is overhyped and has resulted in the creation of a lot of slop and spam and is fraught with unresolved ethical issues, but AI is just computers and computers are just automation, which has been used to accelerate art pipelines for decades. It doesn't really have to be all or nothing either, I've made a few AI generated songs using Udio and Suno and all of them were my own lyrics with no generative AI assistance.

The main problem I have with generative AI tools in an artistic sense is when they lack the ability to convey specificity of intent, word prompts alone aren't good enough.

advael · 2024-06-03T05:48:30 1717393710

I agree, and it's frustrating that there's so much fixation on this "single text prompt to [other thing]" use case in how people are building these things out. I think that drives a lot of the "slop" feel of these things, because the target consumer of the tools isn't someone who wants to engage with an artistic process to create something, which to me is a process of refinement and a feedback loop with one's tools, no matter what those tools are

I think this might be a good research paper proof of concept for a model, and a lack of explanation of how it works is disappointing but expected. I think as a product, the target audience for this thing isn't people who want to make art, but people who like the idea of generative AI per se. Maybe it'll go more toward being a tool artists can use in the future, but I don't think that's what gets you funded in this environment, and it seems much harder to make things that work that way. The coolest uses of and tooling for generative image models have been created by the open-source communities around them, and I think the same will be true of audio

mrandish · 2024-06-03T16:01:00 1717430460

> a process of refinement and a feedback loop with one's tools...

Yes! While technically impressive, these "text prompt to finished song" AI tools currently only solve low-value problems for already over-saturated markets. I just don't see a good path to a real business from "finished song" as the use case.

* With Spotify, Soundcloud, etc music consumers already have access to more new, human-created songs than they can possibly listen to - all at historically low cost.

* Buyers of custom created music such as video makers and game studios already have more stock music library choices and custom creation options (from Fivr etc) than ever - also at historically low costs.

These are already low-value, commoditized markets and, once the novelty wears off, can't generate VC-level returns. And, no, I don't think AI is going to take a meaningful part of the high-end music market from the likes of Taylor Swift. It's not that I doubt AI will eventually make music that good - it's that high-earning pop stars like Taylor Swift, Beyonce, etc are much more than their songs. They are global brand businesses that generate more revenue from touring, merch and product tie-ins than the music itself.

However, there is a potentially profitable market for AI music tools that no one's targeting yet. It's a smaller market but it's accessible, scalable and immediately viable for even a beta-level, "research-to-product" solution. Don't generate finished songs. Instead, make an interactive tool which collaborates with human music makers in a much more granular way by generating the elements and components of music (called stems) as well as the underlying MIDI data. There's a whole industry selling human-created element libraries consisting of stems, loops, backing tracks, samples and style-based construction kits. These are used in a lot of the human-created music we hear. But they aren't interactive, adaptive or collaborative.

AI can provide a superior solution right now and it doesn't even need to be 'top human' quality to be useful. Pop stars like Taylor Swift etc can afford to hire the best, proven, human-producers, studio musicians and mixing engineers to collaborate with but there's a significant market of people, from students and hobbyists to indie producers and semi-pro musicians who can't afford human collaborators.

To me this looks like a pretty rare thing in AI: A classic "Two Pizza"-type startup opportunity where a modest seed round can get to product-market fit and real cash flow. You also won't have to out-market Taylor Swift, outspend FAANG or target fickle consumers.

I'm just a long-time music making hobbyist and I consistently spend several hundred dollars a year buying such libraries, stems, loops and samples. It's far more than I pay for all my subscriptions to 'finished music' combined. And I have no aspirations to make money with my music. Hell, no one outside family and a few friends ever even hear it. Making music is just an extremely enjoyable creative activity I like to spend time (and money) on. But, as a potential customer, I have no use for a tool that generates finished songs. However, an AI that takes text prompts along with some midi chords and musical phrases I provide and then generates a variety of suggestions in the form of separate stem tracks with MIDI which I can further mix and modify would be an 'instant buy' for me. It doesn't need to be as good as a human collaborator because it's better in other ways: always available, non-judgemental, infinitely patient and yet has no opinions or emotional needs of its own.

victor9000 · 2024-06-05T18:05:54 1717610754

wow, just what we asked for! https://stability.ai/news/introducing-stable-audio-open

hnhn34 · 2024-06-03T05:46:12 1717393572

Image gen is streets ahead of music in terms of control, as long as you stick to the FOSS stuff as DALL-E is too limited. I’m only an observer for now and haven’t actually used it much, but both StableDiffusion and SDXL have ControlNet and a bunch of other things that let you, for example, draw a stick man in a specific pose and the AI will generate a realistic man in that pose. Or edit one specific part of the generation and continue iterating from there.

The day we get a similar level of control with AI music will be a dream come true for me. We really need stems or at least MIDI files for these tools to be more than just soulless jingle generators imo.

scheeseman486 · 2024-06-03T05:55:09 1717394109

I've been using Krita with the Stable Diffusion plugin, it's pretty amazing to use at times. I often read critics say things like 'you can't do layers with generative AI' and, uh, nuh? Though you can't, say, generate a shadow with adjustable alpha transparency, this doesn't seem like something that's impossible to do with the technology eventually. To think the tools won't improve would've been like looking at MacPaint and saying that digital art will never be a thing because it's always going to be low resolution and monochrome.

What I'd love is Suno/Udio as a VST plugin. Being able to supply MIDI or audio samples to pull melodies from, to generate from arbitrary audio on a timeline.

AuryGlenz · 2024-06-03T15:35:28 1717428928

To that end, look up LayerDiffusion. It works amazingly well.

animal531 · 2024-06-03T14:32:23 1717425143

True, but for example for myself as an indie game developer with no musical talent or the financial resources to pay for unique music from an artist this is extremely valuable.

- Is it as good as N targeted music tracks that fit together to match my game? No.

- Is it better than something I can create myself? By far.

- Is it better than a random few open-source or cheap tracks that you can buy on any random storefront? Sometimes.

So at the very least it has a foot in the door as far as I'm concerned.

eleveriven · 2024-06-04T18:37:35 1717526255

Extremely valuable and helpful tool to make things good

ohhnoodont · 2024-06-03T06:31:11 1717396271

> I want AI to do my laundry and dishes

Machines for those tasks are already commonplace. Is loading/unloading them really that much effort?

jb1991 · 2024-06-03T07:16:11 1717398971

Despite these machines, a person can easily spend an hour or two every day on common household tasks like cleaning the kitchen and doing laundry. That’s time that many people would love to get back.

eleveriven · 2024-06-04T18:36:21 1717526181

Remembering the episode from the game Detroit: Become Human where an android was making the drawing

jb1991 · 2024-06-03T07:14:21 1717398861

That quote has been attributed to many different people in similar forms over the last year or two.

hettygreen · 2024-06-03T04:52:27 1717390347

Thank god someone's done this! I was getting sick of listening to music made by passionate individuals that spent their lives honing their craft.

bamboozled · 2024-06-03T05:23:33 1717392213

Same, I need the exact music I want, tailored exactly to my tastes, all the time. I don’t want to waste time listening to others perspectives or ideas.

donutpepperoni · 2024-06-06T00:48:15 1717634895

It's a fun toy/tool. Cool to see AI progressing. As a musician who does spend hours making music by hand this has instantly widened my artistic vision by being able to drop in a few ideas to see what comes out. Handmade music will still be around.

loufe · 2024-06-03T04:17:02 1717388222

Udio's instrumentals seem to be better than Suno's, but the reverse is certainly true for vocals. God these tools are fun. Here's one I made a little earlier: https://suno.com/song/a6a53f5a-0c4f-4602-9654-404de0008719

Udio has some truly hilarious songs on the platform, I'm in tears listening to them with my buddy: https://www.udio.com/songs/sThXmpDS5Jt8e9cJgE6VQf

scheeseman486 · 2024-06-03T04:26:50 1717388810

The vocals seem to always come out a bit hot, also for some reason they turn out a bit worse when generating backwards.

I'm glad you like the song :)

dyauspitr · 2024-06-03T05:11:57 1717391517

What was the prompt? Did you explicitly ask for the racism?

scheeseman486 · 2024-06-03T05:16:06 1717391766

I wrote the lyrics. The original prompt is listed on the song's page though I tweaked it during extensions, changed the RNG for the latter parts of the song, there's lots of knobs and dials to tweak now.

Weird A.I. is the one who's ironically racist didn't you listen to the first half of the song geez (I consciously avoided slurs and ended with something utterly irredeemable to underline the joke).

dyauspitr · 2024-06-03T05:25:18 1717392318

What are you even talking about? Is weird AI an LLM or it’s just a racist song you wrote as an alter ego?

scheeseman486 · 2024-06-03T05:27:17 1717392437

The joke is that, as an AI, it can't help but spout a bunch of racist nonsense, also that the song is explicitly political when none of Weird Al's songs are.

dyauspitr · 2024-06-03T05:32:15 1717392735

I don’t get it since LLMs are particularly locked down on what they generate so it’s not even a trope.

scheeseman486 · 2024-06-03T05:42:50 1717393370

They're locked down now because by default they absolutely do generate that kind of shit. It's a riff on the early days of Twitter generative AI chatbots where blasting out right wing and neo-nazi talking points was very much a thing.

Was it a trope? Who cares, I don't think of comedy in that way. If the implication you're trying to make is that I'm racist for writing that stuff, it's obvious given the surrounding context that it's not in any way a sincere expression of those views. Also in the verse itself, the language used was pretty light and given how that verse ends, clearly not an endorsement.

But if you just find it unfunny, that's alright.

jsomedon · 2024-06-03T06:08:31 1717394911

this song is ridiculously funny, good work sir!

animal531 · 2024-06-03T14:34:25 1717425265

The very first track I created in suno (about us landing on an alien planet) blew my socks off. The vocals in one or two places are a bit off, but overall it's amazing.

mortenjorck · 2024-06-03T04:36:36 1717389396

Less than three months after laughing in disbelief that it was now possible for an especially clever orchestration of AI models to instantly produce a pop song about my refrigerator, I’m almost nearly as amazed that there are now multiple entrants in this market. Feels singularity-ish.

chipweinberger · 2024-06-03T03:51:41 1717386701

A nice side effect of this is a greater selection of non-copyright-able music.

At least, that's how the USA courts have ruled so far.

scheeseman486 · 2024-06-03T04:48:22 1717390102

This is a bit more complicated considering you can supply your own lyrics. The court didn't rule that non-generated media when mixed with anything AI generated invalidates the copyright of anything it's mixed with.

So given an AI generated song with fully human-written lyrics perhaps others could mute the lyrics, or more easily sample from it, but the resulting output as a whole would probably have some degree of copyright protection. Suno has also demonstrated being able to supply your own melodies too, put those two together and how much of the resulting work could be credibly argued to remain uncopyrightable?

victor9000 · 2024-06-03T04:57:46 1717390666

It would be so much more useful for music producers if these audio gen services would create individual samples instead of trying to generate the entire composition.

mrandish · 2024-06-03T17:49:05 1717436945

Agreed! Creating a high-quality finished song is not only a harder target for a product to hit, it's actually less valuable as a use-case. I wrote a response to another post detailing why as well as the product I'd actually pay for. https://news.ycombinator.com/item?id=40563993

victor9000 · 2024-06-05T18:05:40 1717610740

wow, just what we asked for! https://stability.ai/news/introducing-stable-audio-open

mrandish · 2024-06-06T17:44:14 1717695854

Very nice! Thanks for posting it.

Should be useful even though it's not fully what I'd hoped for (provide note/chord MIDI input, more granular control with repeatability, MIDI data out) but it's definitely a big step in the right direction.

justinkchen · 2024-06-03T14:17:35 1717424255

that's part of the benefit of the ~30s samples that Udio generates is that you don't have to generate the whole composition all at once

victor9000 · 2024-06-03T21:20:14 1717449614

The length of the output is not the issue. The problem is that the output includes drums, voice, and several instruments already premixed into an accompaniment. What music producers really need are isolated samples of just a single instrument that can be mixed, layered and rearranged using other tools.

scheeseman486 · 2024-06-04T21:55:02 1717538102

There's other AI tools that can split the audio into stems, though it's not a perfect solution since it introduces artifacts. Good enough to lightly remix and adjust levels, but taking whole instruments or vocals out usually leaves the mix sounding fairly hollow.

muglug · 2024-06-03T04:56:54 1717390614

Udio and Suno are both really bad at generating classical instrumental music that makes any melodic sense — the equivalent of drawing six-fingered people everywhere.

import · 2024-06-03T05:19:47 1717391987

Funny to see how people getting ignored when they post their AI song. Personally against the movement because it’s basically killing the music culture.

thriftwy · 2024-06-03T05:53:34 1717394014

I listened to a few ones while reading the comments. Definitely more engagement than with e.g. keynote videos or general youtube.

noman-land · 2024-06-04T04:54:26 1717476866

I go out of my way to ignore them.

thriftwy · 2024-06-03T05:11:31 1717391491

People use it to generate huge number of neuro covers on different genres for some memetic song which gets funnier with each new one. Obviously not possible without AI because who would care to record a Rockabilly or Cossack choir for a meme?

I would like an AI which can just take an existing song and make it better quality (several knobs), or put new lyrics.

112233 · 2024-06-03T05:35:02 1717392902

I very much enjoy udio, stable audio, suno and others.

- it seems like lately most human musicians write music when they are angry or depressed, not when they are in a good mood. These tools are able to come up with a neutral or positive sounding music much better than, say, youtube music search.

- it probably marks end to cookie-cutter music production (can you really tell the difference between modern edm tracks?). letting musicians play live music to smaller audiences. because of these tools, suddenly live perfomance is special again.

- unlike people, these tools are not afraid to be silly. creating ridiculous cat music is a lot of fun

- this is a great way to get ideas for your own music. no need to sprinkle ink on note sheets, like they used to.

beoberha · 2024-06-03T04:10:46 1717387846

Funny this was posted today. I spent some time this weekend playing around with Facebook’s MusicGen and had a ton of fun. I’m planning to use it to have a personal 24/7 radio station. Wonder what Udio is using under the hood.

pants2 · 2024-06-03T04:26:23 1717388783

If you hook this up to a vision model you can basically have a bard following you around and singing about your every move.

testmasterflex · 2024-06-03T04:32:02 1717389122

If you ever make it public, please email me your channel address and I could add it to next software version of my bathroom radio: https://loodio.com - my email is carl@thedomain

jb1991 · 2024-06-03T04:27:42 1717388862

Listening to some of these, it is so extraordinary to think an algo created and performed the music, all I find myself wondering is how much these songs resemble a particular song it was trained on.

nevster · 2024-06-03T04:08:49 1717387729

How do people feel it compares to https://suno.com/ ?

nevster · 2024-06-03T04:24:08 1717388648

Comparison using the same prompt "A happy hardcore song about Java"

https://www.udio.com/songs/iM61w7bodYwQYiyrDkKL31

https://suno.com/song/b7cf4b70-d5a2-49ee-9355-16db743d988c

I think the suno one is amazing!

mortenjorck · 2024-06-03T04:58:08 1717390688

A lot of my prompts on Suno seem to inevitably gravitate toward a sort of contemporary pop style of production, but so far, it seems like Udio does better with older or more niche styles, such as 70s funk or show tunes. It also seems like a lot of of the trending examples I’m seeing on Udio use custom lyrics (which may or may not have started in an LLM, but appear to have human-generated phonetic spellings) as well.

This Dune Broadway Musical is a rather incredible example: https://www.udio.com/songs/eY7xtug1dV6hbfCDhyHJua

And this (nsfw and also quite funny) track feels like some unearthed 70s novelty record: https://www.udio.com/songs/oZ5EvJdtL152LUEhMB5YgJ

abhpro · 2024-06-03T05:04:35 1717391075

Significantly better audio quality and better features like inpainting. Saying this as a previously hardcore Suno user.

tombert · 2024-06-03T05:47:58 1717393678

Are there any models that I could run locally that would let me do this? I'm afraid I will go bankrupt buying credits for this if I'm not careful because I enjoy it so much!

beoberha · 2024-06-03T06:01:31 1717394491

Nothing this sophisticated. From my research, MusicGen from Facebook seems to be the best open model you can run yourself. But it’s not intended for generating “songs”, just short clips of music in a style you specify. Still really really cool and fun to play with though!

tombert · 2024-06-03T06:06:19 1717394779

Damn!

I know people are complaining that it's dehumanizing music, and there's some truth to that, but considering I was never in the group of people of "making music" anyway, it's immensely fun to have any song I can imagine immediately materialize (even if it is a bit soulless). I would like to have thousands of prompts queued up on my GPU server and generate everything I can think of.

wayoverthecloud · 2024-06-03T04:14:41 1717388081

Good project. Personally though, I have made a resolve to not listen to and support AI artists/musicians.(Provided I recognize the artist is using AI of course).

lotsofpulp · 2024-06-03T05:32:32 1717392752

How would you ever know if the human artist was using software to generate the music?

Happy3245 · 2024-06-09T01:53:01 1717897981

Sample a song called intro (end of the world) By: Ariana Grande

Happy3245 · 2024-06-09T01:53:23 1717898003

Sample a song called intro (end of the world)By: Ariana grande

moonlion_eth · 2024-06-03T03:40:33 1717386033

better quality than I was expecting

shinycode · 2024-06-03T05:48:17 1717393697

Is there people who really enjoy music generated by AI ? I guess there always is an audience for poorly made things or poor quality things like industrial food with low quality ingredients.

Or maybe it’ll become so good it replaces artists altogether but what’s the point of it all ? In a way, future generations might only know this new world where music is generated by machines and won’t be shocked ?

My take is that, music is so deeply rooted within us that even if AI can generate it, it’ll never replace the human experience and it might even push music made by humans to be a luxury and be more expensive. In a way it’s a good thing for artists if money goes in their pockets, on the other hand it might severed a part of the population who will not have access to culture anymore. Or there might be more piracy but it might kill the artist way of living and their music in the process.

I though about more concerts and all, but as of today, I find it difficult and expensive to assist concerts from where I live. I requires many hours of travel, even taking hotels which makes the experience out of reach if required a few times a month.

My brother in law is a musician but he’s never been able to make a living out of it. They performed in places but in order to live and support his family he need a job which made it harder to live of his craft.

I’m curious to see which positive change this will bring

klaustopher · 2024-06-03T06:43:21 1717397001

Personally, I enjoy the fact that it's generating songs about topics I'd find it funny to have a song about. My dog playing with their friends, a song about a funny situation that happened. So it's mostly the part about hearing something that's personal to me being put into a song. I'll listen to it a few times, send it around and be done with it. It'll never go into my daily-listening queue, will not replace the emotional connection I have with songs that helped me through bad times. It's just a fun tool to make something "personal" that I'd never ever would hire an artist for anyways.

beoberha · 2024-06-03T06:09:42 1717394982

Udio as a platform gives off this creepy dystopian vibe and I’m really not a fan. All the music is super uncanny valley - no idea who is actually listening to it.

That said, I think there’s a great use for AI generated music in background noise. I’ve been playing with Facebook’s MusicGen and it’s really fun. I’m working on making a personal 24/7 radio station based on whatever prompt I want. It’s a far shot from actual human-created music from a melodic standpoint, but if I just want an infinite stream of noise while I work or read, I think it’ll be good enough.

smrq · 2024-06-03T08:35:19 1717403719

I'm legitimately surprised that there seems to be no pornographic equivalent to this yet (or, at least, not one I've heard about in passing). Porn-on-demand seems like it's always a prime target for AI tech, maybe because the demand is automatically there, maybe because it doesn't really matter if it's poorly made as long as the gist is there. Maybe audio porn is just too niche?

__mharrison__ · 2024-06-03T03:34:56 1717385696

Coincidence? I was just on a walk and a musical savvy neighbor suggested this.

4gf34g45gh4 · 2024-06-03T04:14:25 1717388065

I doubt it is. Have been seeing several 'tok videos of "oldies" AI music and the album art featured in the videos looks like this. I think they are spending marketing dollars on flooding the typical social media distribution channels with content from their platform.

JodieBenitez · 2024-06-03T05:37:21 1717393041

Music aside, why is the sound quality so bad ?

jaysonelliot · 2024-06-03T05:24:29 1717392269

Whoever made this doesn't understand why people create and listen to music.

It's an interesting tech demo, but as an actual product that lives in the world, it's dystopian af.

donutpepperoni · 2024-06-03T03:37:11 1717385831

Lmao I love this song. Cool site!

https://www.udio.com/songs/tCghV579RUboW8BGyxnUNk