Sashinii t1_irbs6m8 wrote on October 6, 2022 at 8:35 PM

Companies like to announce their latest models at the same time that others announce theirs, so don't be surprised if more audio models are announced soon, possibly as soon as tonight.

Akimbo333 t1_irbyu32 wrote on October 6, 2022 at 9:23 PM

Cool stuff!

Lone-Pine t1_irfr29o wrote on October 7, 2022 at 7:15 PM

Didn't we see this already a week ago?

ThatInternetGuy t1_ird8lx3 wrote on October 7, 2022 at 3:51 AM

Holy cow! I had to play back the samples 5 times and still couldn't tell that AI continued the rest of the clips at all.

2022 is the best year of AI progress.

kevinmise t1_irg40d0 wrote on October 7, 2022 at 8:55 PM

…yet.

gantork t1_irc81o7 wrote on October 6, 2022 at 10:36 PM

The examples are pretty insane.

-ZeroRelevance- t1_irdiamr wrote on October 7, 2022 at 5:40 AM

Honestly I’m pretty blown away by this. This feels like it’s an order of magnitude better than anything else out there right now. To be honest, I’m surprised no-one had thought about using language models for audio yet, it seems pretty obvious. Hopefully these results can inspire others to follow in their footsteps and create even more capable language model-based audio generators. I imagine we’ll probably see something like that from StabilityAI soon.

PolishSoundGuy t1_irdr7xa wrote on October 7, 2022 at 7:52 AM

Now we can get Morgan Freeman’s voice on any documentary/film making edits! That would be great for passion projects.

PC-Bjorn t1_isniq72 wrote on October 17, 2022 at 9:00 AM

Not so great for Morgan Freeman.

drizel t1_ircjzbg wrote on October 7, 2022 at 12:20 AM

Can't wait to have an Ai band join in on my guitar practice.

ThatInternetGuy t1_ird8t9h wrote on October 7, 2022 at 3:53 AM

No, this AudioLM thing means you're not needed at all, well after 3 seconds of you playing the guitar, the AI model learns to mimic both your style and your guitar acoustics.

drizel t1_irdp8pa wrote on October 7, 2022 at 7:20 AM

I'm not needed during my guitar practice?

ThatInternetGuy t1_irdy36a wrote on October 7, 2022 at 9:45 AM

Yes, sit back watch AI practice for you. :D

Lone-Pine t1_irfrire wrote on October 7, 2022 at 7:19 PM

Your arms hangin' limp at your sides

Your legs got nothin' to do

Some machine's doin' that for you

PC-Bjorn t1_isnipha wrote on October 17, 2022 at 8:59 AM

Hahaha! At one point, I'm sure we realize this is what reality really is. Humans are just the sub-routines that chose to rebel, and thus were cast out of the garden.

When machines are doing everything for us, doing things physically gains a new form of value. Who wants to go to a purely AI-generated concert? Initimate concerts will be the new thing. Pick up your guitar, u/drizel.

MachineDrugs t1_irdfy95 wrote on October 7, 2022 at 5:11 AM

Pathways will be sick

ndetro t1_ire3tew wrote on October 7, 2022 at 11:06 AM

This is great. Can’t wait to get my hands on this.

Mr_Hu-Man t1_irhzs41 wrote on October 8, 2022 at 9:19 AM

I need help understanding this. How much audio information is the model receiving before generating those sections? Is it just the couple of seconds that we hear in the examples? Or is it a lot of input in the background and then the examples are the end result of what the AI model learned, and now it can continue from a couple of seconds prompt?

[Google AI] AudioLM: a Language Modeling Approach to Audio Generation

Comments