Google's MusicLM: Text Generated Music & It's Absurdly Good
Vložit
- čas přidán 27. 01. 2023
- Google's MusicLM that uses AudioLM may have just changed the whole text to music AI landscape. Without using any diffusion, MusicLM creates extremely high (24 kHz) audio quality with consistent result that had my jaw dropped. Probably the first working and direct text to music that is accurate and fully synthesized. 2023 has started amazingly. Now I am only 1 day late to this cuz I spent the last 10 hours making this video please like and sub ♡
Riffusion: Riff + Diffusion
[Project Page] www.riffusion.com/about
[Code] github.com/riffusion/riffusio...
[Model Checkpoint] huggingface.co/riffusion/riff...
Mubert-Text-to-Music
[Website] mubert.com/
[GitHub] github.com/MubertAI/Mubert-Te...
MusicLM: Generating Music From Text
[Paper] arxiv.org/abs/2301.11325
[Project Page] google-research.github.io/sea...
MusicCaps: 5.5k high-quality music captions written by musicians
[Dataset] www.kaggle.com/datasets/googl...
This video is supported by the kind Patrons & CZcams Members:
🙏Andrew Lescelius, Chris LeDoux, Shawn77134, Panther Modern, Jake Disco, Demilson Quintao, Tony Jimenez, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony
[Discord] / discord
[Twitter] / bycloudai
[Patreon] / bycloud
[PayPal] paypal.me/bycloudai
[Profile & Banner Art] / pygm7 - Věda a technologie
FINALLY I am only 1 day late to AI news cuz I spent the last 10 hours making this video so please like and sub ♡
Suggestion: place your avatar in thumbnails - its very recognisable and catch attention
Okay that's the greatest thing I've ever seen in my entire life.
you got mine and mine respect
cause* and i subscribe to every youtuber i watch!
You know Google never releases there ai to puplic pretty sure this music one is no different than those other ai
Training a text-to-image AI to generate spectograms was really clever
It's a Galaxy brain idea
I typed "Farts and burps" into riffusion with the negative prompt "Guitar" and those hellish sounds will haunt me for weeks.. also try "Monkey screams" if you need some nightmare fuel
@@legendaryra3590 I did, thanks... I will not sleep tonight
@@legendaryra3590 I also typed "Dog screams" and got some crazy weird vocals.
Is there a way to convert pre-existing audio files into the spectrograms SR uses?
The vocals remind me of human dreams where you hear a melody or see that you're reading something, but if you try to focus on it, you can not comprehend what has actually been said.
Totally. Like trying to read a book or even just a clock in your dreams. Superficially represents some information, but it is meaningless.
Aping art. Aping Intelligence. AI.
Sounds like K-pop to me lol
Am I the only one who feels it sounds like chinese? Like that the training sample had too many chinese music in it and it all ended up sounding chinese.
@@videodaniel8945 yeah to me the vocals sound pretty authentic like a human could be singing it. i don’t get any creepy or dream like feeling from it tbh, it just sounds like someone singing in a foreign language to me.
They could totally transform this into an AI audio upscaler, to make low quality music sound crystal clear! I've been wanting something like that for years
Fix some of those crappy mp3s we burnt from cds back in the day!
This already exist, people even remastered Half-Life audio.
Not sure how this video’s content relates to that, everything in this video has sounded like so-so vinyl or a somewhat worn Tape…so unless you’ve got some **really** bad MP3s idk where you got that idea
@@The_CGA yeah but these sound like that because all the ‘vocals’ are robot made. It would be much clearer over actual vocals
audio upscaler... throw it through a fourier transform, take the mag bins as a series and extend that series. haven't done it but i figure a cepstral transform may be used to "advance" the fft frame and double the bandwidth.
There's something a little uncanny about an AI trying to imitate human speech.
Eventually it will imitate so well that you can't tell the difference.
@@Miranox2 That's already the case. There are a lot of AI TTS papers where you can't tell.
@@laupoke There's a big difference between a research paper and a commercial product. TTS software still has a ways to go.
@@Miranox2 wym bro? there's literally multiple voice gen AIs lol
You say that now because you're human still.
As a listener, i'm not impressed... As a composer and producer, i'm impressed that this even works.
Perfect comment. My thoughts exactly.
SAME
Stan(ley) Pine's dad: I'm not impressed and impressed at the same time.
a lot of people aren't even impressed by marvel movies so.. it's a scale. we saw how good ai images got in
Warning. I would not dismiss this technology based on your personal biases. AI learns at an exponential rate and only gets better over time. It seems like AI developers are hell-bent on cutting people out of the art and music they consume.
I like the conversion of voice expression to guitar. That is an actual useful tool in human composition.
agreed, this would be a fantastic tool!
There are already some audio tools that do something similar - with audio to MIDI convertors - but that example was strikingly good and serves as an example of something that will become commonplace. It will be unnecessary to have guitar lessons when you can just hum a tune and then get a computer to play it back and sound realistic.
Melodyne is great for this
That has been in Logic DAW for a while. Audio to midi
Theres a vst plugin called Vochlea that does this already, although I imagine AI would do it much better
It kinda has an eerie, surreal kind of liminal lostwave analog horror quality to it.
Crazy that the model learned this from just ~5K labeled songs. If I didn't know that, I'd guess the dataset had to be hundreds of thousands of such datapoints at least.
no, it's less.
sincerely,
Scary
wtf
WAIT WHAT? Just 5k songs?
Actually apparently (copy-pasted from comment on Fireship's video on the topic): the 5.5k music samples are only the *evaluation* dataset, which they've published along with the paper, but the paper states that "the semantic and acoustic modeling stages are trained on a dataset containing five million audio clips, amounting to 280k hours of music"
I wonder if part of the reason for creating this was the aggressive music industry copyright claims on CZcams. Now future videos can have custom music without getting their earnings taken by some random rights management company.
I think the opposite may happen: lots of false flagging.
New technologies always threaten the hegemony, so it will be interesting to see how the Big 3 record companies react to the destabilisation of the industry that AI can cause. I could imagine the big record companies doing things like threatening to pull their songs off Spotify or CZcams if the streaming services allowed people to upload AI-created "fakes" of real artists. Any "democratic" technology that puts creative power in the hands of consumers scares the hell out of existing rights holders. I wonder whether Universal and Sony will see AI as an enabler that it can harness for its own profit, or as a potential destroyer of the industry. (Years ago, the record industry of America basically prevented minidiscs taking off as a recording medium, and it also tried to shut down online piracy via napster, so it will be interesting to see how it reacts to random people being able to produce and share "good" music without paying a record company for it).
I doubt that was the sole reason, but likely a considered use case. A good idea none the less.
Nah man there's enough people who make music and enough free music on the internet to not be flagged
*Good. Those companies need to finally burn in hell.*
As a music producer, I'm really impressed by the track at 2:01 !
It's coherent, stays on track and is catchy as well. It's better than some people who have made musics for a while
Some of the tracks are straight out of tune or just use wrong accord that makes it unmelodic sometimes
@@joemama-bu5ue What's a bit ironic with that comment is that these are often the songs that sticks the most. It's often tracks that what feels off on the first couple sets of listening that get addictive and can be listened to over and over again.
As a meddling guitar player, playing in key with scales is easy; it's what you learn when you start off. Not saying the AI is genius-level composer, but don't dismiss such creations so fast, what seems worse to you at first might be something you're just not 'getting' on the first couple of plays.
@@joemama-bu5ue Or just take jazz...... But yeah I don't mean that your comment is wrong, I'm sure there's going to be a lot of garbage generated, if it's like image AI, what is it, maybe one out of 10 is usable? 50? 100? You'd expect music to be similar... Can't all be homeruns all the time! But a bit of off-key off-scale might be a great thing; not a bad thing.
The jazz examples are an interesting contrast. They're completely incoherent even though the superficial elements that might make something "sound jazzy" are present.
Jazz is often called "musician's music" because of its technical complexity and the level of music theory involved. Maybe that makes it particularly challenging to recreate convincingly.
If you're not into jazz, the best way I can describe it is it kind of sounds like a non-musician with a really superficial understanding tried to imagine some jazz in their head. The overall vibe's right, there's drums and electric piano playing chords and a solo guitarist noodling away. But the harmony and changes are confused, the melodic lines are incoherent random notes, there's no groove to speak of and no real structure or idiomatic jazz vocabulary.
what? that sounds like trash :D i would say any newbe can do it better in one week, with the right teacher in one day
LOL that '' BELLA CIAO'' italian song, on E-guitar, holy moly this is amazing
Whoa, that story mode is incredible, so many ideas you can generate alone just from being able to transition music from style to another.
»The book of the generation of Jesus Christ, the son of David, the son of Abraham.
Abraham begat Isaac; and Isaac begat Jacob; and Jacob begat Judas and his brethren;
And Judas begat Phares and Zara of Thamar; and Phares begat Esrom; and Esrom begat Aram;
And Aram begat Aminadab; and Aminadab begat Naasson; and Naasson begat Salmon;
And Salmon begat Booz of Rachab; and Booz begat Obed of Ruth; and Obed begat Jesse;
And Jesse begat David the king; and David the king begat Solomon of her that had been the wife of Urias;
And Solomon begat Roboam; and Roboam begat Abia; and Abia begat Asa;
And Asa begat Josaphat; and Josaphat begat Joram; and Joram begat Ozias;
And Ozias begat Joatham; and Joatham begat Achaz; and Achaz begat Ezekias;
And Ezekias begat Manasses; and Manasses begat Amon; and Amon begat Josias;
And Josias begat Jechonias and his brethren, about the time they were carried away to Babylon:
And after they were brought to Babylon, Jechonias begat Salathiel; and Salathiel begat Zorobabel;
And Zorobabel begat Abiud; and Abiud begat Eliakim; and Eliakim begat Azor;
And Azor begat Sadoc; and Sadoc begat Achim; and Achim begat Eliud;
And Eliud begat Eleazar; and Eleazar begat Matthan; and Matthan begat Jacob;
And Jacob begat Joseph the husband of Mary, of whom was born Jesus, who is called Christ.
So all the generations from Abraham to David are fourteen generations; and from David until the carrying away into Babylon are fourteen generations; and from the carrying away into Babylon unto Christ are fourteen generations.
Now the birth of Jesus Christ was on this wise: When as his mother Mary was espoused to Joseph, before they came together, she was found with child of The Holy Ghost.
Then Joseph her husband, being a just man, and not willing to make her a publick example, was minded to put her away privily.
But while he thought on these things, behold, the angel of the Lord appeared unto him in a dream, saying, Joseph, thou son of David, fear not to take unto thee Mary thy wife: for that which is conceived in her is of The Holy Ghost.
And she shall bring forth a son, and thou shalt call his name JESUS: for he shall save his people from their sins.
Now all this was done, that it might be fulfilled which was spoken of the Lord by the prophet, saying,
Behold, a virgin shall be with child, and shall bring forth a son, and they shall call his name Emmanuel, which being interpreted is, God with us.
Then Joseph being raised from sleep did as the angel of the Lord had bidden him, and took unto him his wife:
And knew her not till she had brought forth her firstborn son: and he called his name JESUS.«
The Gospel of Jesus Christ - Matthew - Chapter 1
It would be quite interesting to transform this type of generated music into actual sheet music (or midi in a DAW), and analyze what it did from a "traditional" music theory perspective.
Also could be interesting to compare how those tracks would sound if played by a real band.
This! At that point we're basically reverse-reverse-engineering music, which is sort of meaningless by itself, but still interesting.
That sort of thing was done at least thirty years ago (and possibly earlier) when primitive AIs were fed the manuscripts of Beethoven and Mozart and prompted to take bits and create new tunes. According to taste, some of the pieces were "quite good", but lacked the emotional qualities of the human-created pieces (and in parts sounded "weird") when played by orchestral musicians. With today's deep learning, the bots can create music that would be indistinguishable from the work of the all-time greats, but humans just don't want to accept it. They will though. The market will dictate that "Beethoven" should release a new symphony. The AI that creates the best one will be top of the classical charts.
You could just make the AI run the midi through a plugin like Kontakt where it'll play real sounding violins, bass, piano etc etc it'll sound much more convincing that what we have so far. Something like classical music or Jazz will sound very convincing soon.
@@LAFELIXMUSIC @AutPen38 This is a different technology, it does not work with "notes", this thing analyzes spectrograms and does not work with melodies and harmony in any comparable way to that. It does not generate midi nor can it be fed with manuscripts. It goes from finished sound wave to finished new sound wave, without ever going trough writing parts or anything like that.
@@koraamis5568 ableton can already do audio to midi. if its one simple instrument it converts it very well and accurately. soo all you need to do is take the audio generated and convert to midi then run that midi through a good sounding plug in.
I was waiting for you to make a video about this, you're great at always keeping us informed.
Damn Google has some sick beat
Whenever I was listening to that Drake- Greece song, I always suspected it being AI generated from beat to vocals.
Sounds very computerized. I think these labels have been doing this kind of stuff for a minute now.
There are several music creation tools that already use AI, but nothing quite as comprehensive as this. There was a lot of hate for "Autotuned" vocals, but the record sales spoke for themselves. (Young) people like hearing new sounds and they don't care if a computer was used. In Korea they have pop bands that exist in virtual avatar form, and there are "vocaloid" plugin instruments that are virtual singers that you can program to sing your own lyrics. There's bound to me more and more of this over time, as listeners want to hear exciting new sounds and the best way to get them is to use the latest technology.
For sampling this amazing, for listening not so much, but for samples this is endless.
This AI music revolution will truly be having an impact, not when a company makes a breakthrough (like this), but when they make that breakthrough public and usable by everyone.
It won't change much.
When tools are available, they are available for everyone, music producers as well.
Who do you think is going to make the best out of it? The same people who are making the best out of the billions tolls that already exist today.
The AI revolution will have a huge impact, that's true, but I would be much more concerned if I was delivering stuff in a van, moving box in a warehouse on answering the phone for a company.
Musicians will be fine, you won't go to a concert to see an empty stage with a laptop in the middle, randomizing notes.
@@ChristianIce Disagree. There are a lot of talented people who just dont have the equipment to do it. That AI provides this opens it up to numerous people. Same for AI pictures. Its more accessible and removes a lot of barriers. It will change a lot.
@@TheH1st0ry
You must be unaware of the infinity of tools available to make music for free.
Anybody with a computer and a microphone can, for decades.
Yes, this tool will add to that, but you could produce loops and songs without knowing anything about music since the Amiga Tracker.
To think that the effect would be the same in music industry compared to visual art means not knowing either.
@@ChristianIce but it is going to place where you even do not need microphone - soon AI will sing for you with your voice, and after that it will be able to create any song in any style, e.g. "play me new Elvis song" - you still think it will not change anything and there is no danger for artists who will lost their audience?
@@bzdr lol
i guess the truth is that people not only listen to the music of other people but also they follow the personality, how them act, look etc. AI will make a revolution but only in production making, like any already existing VST. you’re not gonna listen to robot lol because it can do billions of GoOd songs and blah blah blah the perception of art will change
This is miles ahead of other music generating programs.
No wonder
And it is still lightyears behind real music, and it will never ever catch up
@@isaacnewtech nah
@@Milk_Delivery yes, because you're unable to discern what is missing, unlike millions of people who can.
@@isaacnewtech you won't be able to distinguish ai music from real music in 5-10 years mark my word
I can already imagine generating an infinite piece of music that exactly adhere to one's own music taste.
Have it generate music based on your heart rate, body temp, and brainwave frequency. Spend your life living in 'the zone' of perfectly appropriate music set to every life experience you have.
And you’ll get so tiered and bored of it
@@vmusatov I don't mean just the same beat and melody, but like everchanging, possibly with mood or whatever. I doubt it'll ever get boring if done correct.
@@astrovation3281 Itll def get boring
@@snakeyeslp How though? The entire point of this imaginary thing is to make an indefinitely ongoing song (maybe based on some you already like?) that keeps it interesting. Neither you or me know what will be possible, but I know that if it becomes a commercial product there will be a lot of effort to make it not boring.
Just saw the video o the same research paper by another youtuber last night, but man this video feels something so "natural" (don't know if it's the right word,maybe soothing¿) and I don't know how to put it but "lofi like", yeah.
Bycloud, your videos have a hard to define soothing feel to it, can't put a finger on what it is.
I really appreciate all the time and effort you put into this video! 🙌🏻
Great video! I’ve been checking out the various music AI’s and this one never popped up in my searches. Thanks!
I need to get my hands on this software! Really wanna try the humming feature 😂
did you manage? i'm trying to do the same
@@NPJGlobal never tried
Google is doing what Google does best, make something great and then bury it.
@@GamingDad lol, I remember calling Google about 10 years ago trying to see if we could pay them to license use of Google Maps and got what sounded like an intern who said wow, we aren’t really set up for that, you see we’ve got this advertising model and we don’t have a mechanism to sell things, so when our license with Delorme ended, we switched to Microsoft maps, then Google finally woke up and figured out how to sell things and we switched to them, was such a big deal the CFO got them to throw in a Google Glass kit. We gave it to a colleague as a going away present when he left to work for Indeed.
What a time to be alive!
Hehehe! 'Hold on to your 2 Minute Papers'! That's probably how this ended up on my Home page recommendations!
Interesting stuff! That hum conversion really impressed me!
You are the first (ive heard of ai music from) thanks for the spectrograph talk up front. Makes so much sense off images idea
I had a lot of struggle getting it to make anything I asked it to, it always defaults to some modern EDM crap with my requests, but surely it will get better. Pretty impressive.
Unless you're a researcher working on this model, as far as I know no-one else has access to this. Were you using some other AI-Music-Generating site? There's a whole bunch of them now, but none of them seem to work the way MusicLM (THIS video's AI) does. A lot of them generate midi songs from building blocks it's given; not making songs from scratch. They both have their pros and cons, but the ones available to the public are often much simpler and same-y sounding.
Besides the 24kHz sample rate it actually sound quite good to me. Humans are really going to have to start thinking about things like intent vs technical skill.
this was a really good explainer on how audiolm works! thx for making this!
I don't get why not train AI on music notation like midi or sheet music, and then use synth/samples to actually play the sounds?
You would be able to put the notation output into a music program and have total control over tweaking it, instead of getting a raw audio output which is basically set in stone...
the same reason image ai's haven't been trained to make use of separate layers for cohesion/motion that would work well with an adobe/gimp/krita-based workflow
answer: that tech leans closer to proceduralism & well-tuned algorithms, the sort of boring yet effective work that won't get the same "ai" marketing label
That workflow is quicker using midi packs, you don't need AI to reinvent the wheel.
But I agree that in this way you can just use some loop, which again, you can make for yourself quicker through libraries.
Let's get back to it next year and see if they managed to double the bitrate, at least :)
for starters musical notation completely fails to capture what's necessary to recreate the complexity of the human voice or rap flow. but also popular music is increasingly moving away from things that can be expressed in western musical notation and the creative focus is more on tone/timbre and subtle unquantizeable rhythms/grooves. so it'd be completely inadequate for that as well as international/ethnic music that never used it in the first place.
Well that's actually been done by openAI a few years ago, it is called MuseNet
This is exactly what the company AIVA did. Imo as a musician/ producer, it was the most compelling one of all the music generation AI that I've looked into. Some genres come out terrible but a lot of them are pretty impressive. This Google MusicLM is one of the first ones that piqued my interest since AIVA tho. Way more impressive than Riffusion.
This reminds me of when art Ai started to get good.
Let’s wait a year and the music will be indistinguishable from actual music
not on this training data it wont and the thing about music that's different to visual art which is actually rendered as data in the form of pixels is the training data at present they have access to is comps, not stems, not to mention they can only deal with comps that have already been written or melody lines and harmony's, they cannot come up with new ones because every individual is different and makes different choices, can probably write a pop song eventually about the limits of it.
@@123Andersonev I was mostly exaggerating on the time-frame, and I don’t imagine this specific AI picking off like you said.
But I will disagree when it comes to the ceiling of AI music. I do think it’ll get to a point where you really have to break down the music in order to tell if its AI or not. Let’s be honest, AI is already getting insanely impressive, music isn’t going to be its hurdle. Even if it takes a decade to get to human level.
But just like AI art, a talented human touch will always stand out from something that’s a product of previous work
@@RealityRogue that's not the problem, the problem is in how AI works, it can never be original, it can only work on what it's been taught, it cannot go beyond the realms of convention if that makes sense, so it's actually artless, it only knows how to replicate what already exists because people affirm the output as what they expect, in order for it to go outside convention you need someone sat there telling it no.
Are these "original" creations its creating? Or aggregating things that exist hybrid kind of way like how does the copyright works?
8:00 is from DooM 2 "Nobody Told Me About id"
that humming to instrument functionality needs to become available, please just release it so that we can start using it....
I require it
2023 is a key year
Y’all are all so messed. Like why do we need this? Anyone could’ve come up that! Life is a journey. Not an end result. What are we trying to do here make it to where we just have to sit back and press start? What’s the point to life if we aren’t even going to live it ourselves? I’m honestly baffled why we feel a need to do this? Is this what we sacrifice in the name of science and advancement? I thought the point of automation was that it was going to free us to spend our time doing things like making music and art.
Couldn't have said it better myself! I'm a professional music producer, it's my main source of income. But it's also my main source of joy. Without creating I fear I might be nothing. Hope people still care about human made stuff in the future.
Exactly!
Because tech bros don’t have any clue about art and its process. The same goes for the lazy people who worship these new toys. If they are satisfied with these results, good for them. Just another market niche.
What all these Algorithmic generators have seemingly neglected is AI self-consumption, that at some point the models will break down as AI generated content gets into datasets. What we have currently will be the purest sets from here on out. Soon the boundaries will blur until the AIs start to break and not function so well.
Yes this exactly this.
I especially worry about the implications of using these models in the medical industry, manufacturing, and to write software, for exactly the reason you are describing. They accuracy is bad as it is. I expect it will get a lot better over the next couple of years, but then they will have exhausted the relevant/usable datasets and AI-generated content will start to feed into the training data, at which point the results will likely start degrading. I just hope society hasn't become too dependent on the technology by the time that happens!
Training data will become unbelievably critical and valuable. All of the massive public repositories of information and art will become locked behind paywalls at best, if not privatized and locked down entirely. Hackers might even started breaking into corporate networks to secretly insert data INTO their databases in order to influence the results generated by their AI models. Google has already acknowledged the potential for models like ChatGPT to replace traditional search engines. And we all know that companies are willing to pay good money to ensure their sites appear at the top of search results.
Interesting! Never heard about that.
Been thinking about that for a while now. Glad others have, too.
AlphaGo used AI vs AI games to train itself to top level play. There's nothing wrong with AI training on AI generated content, as long as the training data is curated for good results (in this case, good sounding music).
9:43 this is literally the essence of 2000's music and i don't know how to feel
That fact that it already sounds like staticky music from a radio scares the daylights out of me as an aspiring music producer.
I dunno, the music industry is so awful, this won’t be the worst thing you see!
@@TheCALMInstitute The awful part about the music industry is its robotic repetitiveness and regurgitation of whatever gets streams. Having an algorithm that's literally programmed to do just that would only make the situation worse. What we need is more creativity.
@@TheCALMInstitute Though you're right, that is not the only awful part of the music industry, of course. 😅
These musical snippits remind me of AI Art a year ago, not quite there yet but showing dramatic improvement. And now with Dalle2 and Stable-diffusion, human artists are reduced to this czcams.com/video/zx3ROK9nOYE/video.html. If you are in the music industry, this is your 1 year warning to prepare accordingly and get out while you still can!
Subbed. Thanks for sharing!
Incredible how far along that's already come!
Instead of breaking up Google. Let’s open google up to the public. They made all these algorithms and AI tech using our data anyways.
What people say: I don't want algorithms and AI dictating how to live my life.
What the data says: Oh yes you do, you predictable humans.
What really would be helpful is an AI, that can generate a note sheet from a song. Sometimes it's really hard to figure out what's going on.
Check frettable
just learn how to do it yourself jesus christ
You had my sub at the "Hold on to your two minute papers" reference.
Okay the humming demo was actually insane.
Yeah that shit was hard
The video is just soooo high quality man.
I foresee a future where each human lives in their own AI generated audiovisual bubble that creates just the optimal moods & thought patterns for manking to keep the servers going those algorithms run on.
Matrix was more than an artistic metaphor.
Humans will be equivalent to hamsters in wheels, but with headphones on.
Counterpoint: how do u even know what kind of media ur into without exposing urself to it? What abt genres that grow on u with time? Ud just get bored otherwise
how do you know thats not what you're already experience just on a much much larger scale?
@@nicf1555 Using feedback from a brainscan? Ai is already able to take brain scans of people that are seeing certain pictures, and then accurately guessing what objects the pictures contain.
It's absolutely possible that an AI could estimate how much you like a song, (and also what emotions and mental imagery comes along with it) and then generate music according to that input.
You could have it make music that makes you sad or angry, energizer you or make you calm down.
The possibilities are endless.
I’ve been trying to find this for a couple years and it didn’t exist back then, when I used weed for like a year and my brain started making these crazy Medly-mashup things that were stuck in my head for a year
So thanks for making this video so I’d find out.
This sounds exactly like the inside of my head, nonsense vocals, vagueness but catchy
that's really cool... this is really impressive... 🥰
The vocals feel like one of those "name 1 thing in this picture" but as audio
This is amazing stuff and I think it just adds to our ability to be more creative. To me, it's still slightly off and doesn't sound quite right, almost like music in a dream or nightmare.
I think at this stage it can be a great way to get ideas for tracks
@@TracksWithDax I think that's how it will be used. Quickly iterating on ideas.
yes, because the best way to teach some creativity is to give them access to a machine that will literally do everything for them. It's why our teachers teach our students by doing all of their work for them isn't it?
I think it'll be really cool as a tool for making new sounds and really weird shit, but that most people will still prefer royalty free human stuff because it'll be a bit more reliable (and is already soulless).
@@johncasey9544 So true, lol.
From what I've seen following AI stuff for the past few years, I think it'll be a long while before we see AI music reach the heights that image generation is at... It's so hard for them to legally get huge amounts of training data (music) because of the huge labels that have an iron grip on stuff they own
That said, I also feel like the nuances of human-created music are going to be hard for an AI to capture
Gaddamn! this will be amazing
This is dope can't wait to see more ai in the future
Dude...the humming thing...I've been imagining some kind of gear or software that could pull this off but I didn't think it was possible...I come up with ideas when alone without instruments a lot and this is just an amazing solution....I didn't catch whether this is open to the public and/or free to use....I must learn to wield this powerful sorcery!
Just a heads up that audio to MIDI has been available for many years now using software like Melodyne or Ableton Live. You might want to check those out!
I'm impressed at the quality of the stuff this produces, and can see a lot of potential for it as a companion tool for songwriters and producers, but, jeeze, it's already hard enough getting music composition gigs because of all of the royalty-free loops out there that people think are Good Enough for their games and short films. With this I don't see why anyone would ever hire a soundtrack musician ever again.
I'm still fairly optimistic that not much would be worse from today's industry, partly since I still believe human artistry will always be superior. Similar to the digital art community, in the end they will still have their following. People follow artists for their art and no amount of 'good quality' AI works will change that.
If anything, there will be less tedious souless corporate music jobs since they'll just use AI for their corporate music. This will probably apply to Top 40 stuff as well.
On the bright side, maybe we'll see a rise of a counterculture dedicated to subvert 'good AI music', and the whole world of AI art by extension. Kinda' like the Dadaists.
The "best" or most successful musicians are those that harness the latest technologies, whether it's Bach with the harpsichord, the Beatles with the electric guitar and the four-track recorder, or some rapper or other with Autotune in 2010. The general public (and old people that grew up with older technology) won't like the "new music" ("It's just noise", they'll say) but technology is completely unstoppable. Just as the folk musicians couldn't stop Dylan going electric, and the "Keep Music Live" members of the Musicians Union in the 1980s couldn't hold back the tide of synths, drum machines, and samplers, no one around today will be able to stop AI. The future of music belongs to the kids that will use and love this new technology, even if it means that every human musician goes extinct.
@@AutPen38 AI is gonna be your new idea buddy, just like a little creative spark to get your musical juices flowing. With its help, you can bring your musical dreams to reality in no time. No more struggling with certain parts of the production process, AI's got your back. It's gonna level the playing field and let everyone share their ideas. But let's not forget the important role of the music producer. They still have turn those generated ideas into a killer song, add their own touch with effects like reverb, delays, and chorus, and make it sound exactly how they want it to with mixing and mastering. In the end, music producers are selling their ideas and their own personal style, that's what makes their music unique
Low budget indie games will likely be using this tool, but the ones with a bigger audience/revenue will still keep paying musicians to do better.
They won't. And in 15 years, nearly every single "creative" or "artistic" endeavor will be monetized via AI.
Photography, painting, drawing, singing, sculpting, etc...
All of these will be replaced by AI (ironically, more than likely by the same creative-types that enjoy these passions, LOL).
Heck, we're about the same time away from most labor being replaced too, IMO. The only reason it hasn't yet is due to the "old guard" that literally cannot process that it is cheaper to buy a maintain a machine than to keep hiring humans, or are threatened by tech they don't understand... And they aren't going to be around much longer, LOL.
So... Yeah.
:P
Very interesting video, kept me fully focused the whole time
Wow !!! I'm so impressed!!!
I fell in love with the Starry Night piece. It's beautiful in a very perplexing, horrifying kinda' way.
Maybe those of us without synesthesia can finally *listen* to pictures. I'm curious what the future will bring.
AI is becoming more and more scary and mind-blowing at the same time..
the hummin part blew my mind! Wow! great video @bycloud 👍
4:37 I've watched practically every TMP videos and squeezed my papers before but this is the first time my jaw has dropped!!
AI Generated music: czcams.com/users/shortsy8Io2r-Lwps?feature=share
This is amazing to be honest, great if you need some inspiration whenever in a pinch
It all sounds a bit like muzak. There's no real life in it, but.. that makes it perfect for background music in a youtube or instructional video. Now , if only they would share it.
For now
@@spydaboiii You mean you think that in time the music generated will be more complex and full of life? I hope that's true but I feel like , just with generated imagery, its easy to get the details but much harder for these machines to get the overall concept down just right. With any artform, the entire thing tells a story. That overarching theme is hard, if not impossible for this kind of AI to accomplish because it requires a higher level of understanding. perhaps they can combine these generators with another AI who's purpose is a higher level of concept?
Wow. Imagine the possibilities of this technology. Producers can now create custom sample packs - specific to a project idea or style.
🤯Story Mode! Wow! Thank you for this. 👍
I want this for my table top games. My players would freak if I had a theme for characters/npcs. I could make music from different regions... 🤯
What about generating music directly from an image without the text description of the image. Any AI formats out there that do this?
What were the prompts you used to get those samples from? Very cool
I can't wait to feed an A.I. my unfinished short story from middle school and see how it ends.
And then turns it into a full fledged Disney Channel movie, 2000's style.
Some people really just do not understand how incredible AI is, like yes, this is probably not something we will listen to but just wait a year or 2. Why does everyone expect it to be perfect instantly?That yall weren't there to see the slow and stead progress of text and image generation is a big issue.
I've seen so many people bash on this technology which i feel is a little undeserved imo.
It’s gonna replace human artist, that’s why they bash it
That almost sounds like it's a thing to look forward to... Do you want your music to be made by a soulless computer?
@@tomasviane3844 Isn't every computer soulless?
So much of music is already made and automated by computers, why not finish the job and let the computer do the entire thing? Or at least a lot more. If it means more music I see that as a win.
Anyone wants to see listen to this? This is worse enough for music, surprised so many people want it to improve
Just imagine what it will be a couples more papers down the line !! What a time to be alive !!
AAAAA THIS IS SO COOOL
The transition from Heavy Metal to Indian Rap was crazy
"What a time to be alive!! "
Lol
FYI Some of these tasks are not completely new and don't require AI. For example tools like ableton have the ability to create midi data from melodies since a coupe of years. It used to be a manual step for us to apply them to a new instrument. Same goes for the generation of melodies. There are useful websites to create melodies out of simple paraments with the UI. So a lot of the things this tool does is applying these existing technologies by beeing able to match a task to a text command. The really interesting part is how it comes all together and the interpretation of the AI.
Pretty much my thoughts as well.
Geez! This is nuts!!
wonderful! Hold music will never be the same!
5:21 Story mode would have use in games. To accentuate what is happening, a game of today plays from a bank of soundtracks, and I suppose uses some heuristics to mix in tracks from that bank. Story mode could probably produce nearly infinite, seamless, adaptive music
I like how you said that
They tried something similar with Doom IIRC, but this could probably do it even better.
Games mostly use wwise, which is a middleware that let's developers provide the changing context (in combat, new area, dungeon, etc) and the musician can create stems (snippets or layers of music) that take that context into account and will be combined together on the fly.
Games have been doing essentially this since LucasArt's iMuse system, used in Monkey Island and Dark Forces.
@@SimonBuchanNz Woah
Если тренировать нейросеть с аккордами и блоками формы, то результат будет на порядок лучше. Само собой что база аудиоматериалов должна быть тоже большая.
Нужно сделать нейросеть, которая качественно выделяет аккорды с midi -партиями из треков и сделать базу аккордов, мелодий и блоков формы (вариаций) с таймингами. А потом тренировать генеративную сеть уже на основе и волнового аудиоматериала, и теоретического.
So, I could perform in electric kazoo, then turn it to every instrument?
That's amazing!
That jazz to pop transition was crazy
I see this as analogous to version 1 of midjourney. What I see here is massive potential, and I think they're finally on the right track.
> I think they're finally on the right track.
Badumtss.
Maybe, again, with AI, there needs to be a balance of how much power it can use until it gets into privacy and morality issues with machine learning Artificial programs versus human rights.
If Lyric cohesion was there this would be amazing
Fascinating video! How can I access and use google music LM? Thank you 🙏
YAYY LESS GOO NEW VID :3!
This is truly fascinating. I'd use it as an idea generator - and I'm not even making music.
The ability to create music in a specific style u have in ur mind when creating videos is very nice.
U could just learn music too
feels like Ai Dj's are only a year away.
Craaazy!! Sampling on steroids 😂😂 great video👍👍👍🙏🙏
now we're slowly entering this new era where in the near future the usual industry/man-made products will turn into *artesanal*
I really want this to be released, it’s help my channel so much lol been having writers block for months.
Unfortunately... Google... so at least OpenAI release something similar, you would have to wait like 5 years for.
cmon make some original content not AI generated :/
It will soon make musicians obsolete :(
We're living in the end times
It's been 90 years since the publication of Huxley's Brave New World, in which the term "synthetic music" was used. For those who don't know, the book was meant to be dystopian, despite the creature comforts of its characters. Unlike Orwell's 1984, where order was coerced from unwilling citizens, Huxley's vision was of people embracing and fostering the dystopia gladly. In the long reach of history, 90 years was only yesterday.
If this is what is now (and going forward) what we consider music (although I don't know where the "muse" is found here), then my guess is that a young person who is accidentally killed can be replace by an android with a similar face, and the family, friends and associates will feel that everything has been restored to the way it had been before the loss. We used to give children dolls to keep them company, but it's clear that the trick will work on adults too. (A chicken panicked by isolation will be fooled by the company of a mirror. That's not true of a chimpanzee, though.)
For me, I take no pleasure or solace in these developments. You may see things differently.
I was thinking the same thing. I'm surprised everyone is embracing the idea of all entertainment being made with no effort by a souless machine
This stuff will be improved a lot in a few years. This is like an advanced plugin. I think all musicians would like to use this in someways, to experiment.
But i find this program not scary as an ai image
This is better than riffusion, but similar to a lot of other AI stuff right now, it’s pretty much just “stuff that seems like music, but doesn’t make any sense when listened to closely”.
It’s been interesting to see that music is the item valued the least by AI devs - likely because we already have too much music and it’s basically free.
Why do you say that music is the thing AI devs value the least?
What I want is to be able to play a rough song idea on a guitar, and have the AI produce a full production, that would be fun I guess.
Like a drummer that follow me even in not perfectly in tempo
Thank you for this! Because the vocals aren't really linguistic, it made me think of eliminating them and thus a question: do they have "separate track downloads" for multi-track recordings? If not, seems a good suggestion.
The midi notes will be better output. These audio file can’t really be used
At last,.. Those were the clearest, most understandable hip hop lyrics I've ever heard!