Comments

You must log in or register to comment.

Sashinii t1_irbs6m8 wrote

Companies like to announce their latest models at the same time that others announce theirs, so don't be surprised if more audio models are announced soon, possibly as soon as tonight.

27

ThatInternetGuy t1_ird8lx3 wrote

Holy cow! I had to play back the samples 5 times and still couldn't tell that AI continued the rest of the clips at all.

2022 is the best year of AI progress.

16

gantork t1_irc81o7 wrote

The examples are pretty insane.

9

-ZeroRelevance- t1_irdiamr wrote

Honestly I’m pretty blown away by this. This feels like it’s an order of magnitude better than anything else out there right now. To be honest, I’m surprised no-one had thought about using language models for audio yet, it seems pretty obvious. Hopefully these results can inspire others to follow in their footsteps and create even more capable language model-based audio generators. I imagine we’ll probably see something like that from StabilityAI soon.

7

PolishSoundGuy t1_irdr7xa wrote

Now we can get Morgan Freeman’s voice on any documentary/film making edits! That would be great for passion projects.

5

drizel t1_ircjzbg wrote

Can't wait to have an Ai band join in on my guitar practice.

4

ThatInternetGuy t1_ird8t9h wrote

No, this AudioLM thing means you're not needed at all, well after 3 seconds of you playing the guitar, the AI model learns to mimic both your style and your guitar acoustics.

3

drizel t1_irdp8pa wrote

I'm not needed during my guitar practice?

4

ThatInternetGuy t1_irdy36a wrote

Yes, sit back watch AI practice for you. :D

1

Lone-Pine t1_irfrire wrote

Your arms hangin' limp at your sides

Your legs got nothin' to do

Some machine's doin' that for you

2

PC-Bjorn t1_isnipha wrote

Hahaha! At one point, I'm sure we realize this is what reality really is. Humans are just the sub-routines that chose to rebel, and thus were cast out of the garden.

When machines are doing everything for us, doing things physically gains a new form of value. Who wants to go to a purely AI-generated concert? Initimate concerts will be the new thing. Pick up your guitar, u/drizel.

1

ndetro t1_ire3tew wrote

This is great. Can’t wait to get my hands on this.

2

Mr_Hu-Man t1_irhzs41 wrote

I need help understanding this. How much audio information is the model receiving before generating those sections? Is it just the couple of seconds that we hear in the examples? Or is it a lot of input in the background and then the examples are the end result of what the AI model learned, and now it can continue from a couple of seconds prompt?

2