Submitted by Danuer_ t3_xxfi5y in singularity
Comments
ThatInternetGuy t1_ird8lx3 wrote
Holy cow! I had to play back the samples 5 times and still couldn't tell that AI continued the rest of the clips at all.
2022 is the best year of AI progress.
kevinmise t1_irg40d0 wrote
…yet.
gantork t1_irc81o7 wrote
The examples are pretty insane.
-ZeroRelevance- t1_irdiamr wrote
Honestly I’m pretty blown away by this. This feels like it’s an order of magnitude better than anything else out there right now. To be honest, I’m surprised no-one had thought about using language models for audio yet, it seems pretty obvious. Hopefully these results can inspire others to follow in their footsteps and create even more capable language model-based audio generators. I imagine we’ll probably see something like that from StabilityAI soon.
PolishSoundGuy t1_irdr7xa wrote
Now we can get Morgan Freeman’s voice on any documentary/film making edits! That would be great for passion projects.
PC-Bjorn t1_isniq72 wrote
Not so great for Morgan Freeman.
drizel t1_ircjzbg wrote
Can't wait to have an Ai band join in on my guitar practice.
ThatInternetGuy t1_ird8t9h wrote
No, this AudioLM thing means you're not needed at all, well after 3 seconds of you playing the guitar, the AI model learns to mimic both your style and your guitar acoustics.
drizel t1_irdp8pa wrote
I'm not needed during my guitar practice?
ThatInternetGuy t1_irdy36a wrote
Yes, sit back watch AI practice for you. :D
Lone-Pine t1_irfrire wrote
Your arms hangin' limp at your sides
Your legs got nothin' to do
Some machine's doin' that for you
PC-Bjorn t1_isnipha wrote
Hahaha! At one point, I'm sure we realize this is what reality really is. Humans are just the sub-routines that chose to rebel, and thus were cast out of the garden.
When machines are doing everything for us, doing things physically gains a new form of value. Who wants to go to a purely AI-generated concert? Initimate concerts will be the new thing. Pick up your guitar, u/drizel.
MachineDrugs t1_irdfy95 wrote
Pathways will be sick
ndetro t1_ire3tew wrote
This is great. Can’t wait to get my hands on this.
Mr_Hu-Man t1_irhzs41 wrote
I need help understanding this. How much audio information is the model receiving before generating those sections? Is it just the couple of seconds that we hear in the examples? Or is it a lot of input in the background and then the examples are the end result of what the AI model learned, and now it can continue from a couple of seconds prompt?
Sashinii t1_irbs6m8 wrote
Companies like to announce their latest models at the same time that others announce theirs, so don't be surprised if more audio models are announced soon, possibly as soon as tonight.