This breakthrough is significant because it can translate into large efficiency gains for AI applications, allowing them to run more quickly on existing hardware.

How does this breakthrough work?

This breakthrough works by training a neural network called AlphaTensor to play a game in which the goal is to zero out a matrix. AlphaTensor gradually improves over time and eventually discovers more efficient algorithms for matrix multiplication.

Give me a short description of the article/breakthrough.

The article discusses a breakthrough by DeepMind in which a more efficient algorithm for matrix multiplication was discovered. This algorithm can translate into large efficiency gains for AI applications, allowing them to run more quickly on existing hardware.

[deleted] t1_isalddd wrote on October 14, 2022 at 2:38 PM

#102,381

Replying to ZoomedAndDoomed (#102,121)

[removed]

2D_VR t1_isanph7 wrote on October 14, 2022 at 2:54 PM

#102,504

That is a weird algorithm. I don't think I would have come up with it given a million years

comments247 t1_isanrth wrote on October 14, 2022 at 2:54 PM

#102,508

Replying to treebeard280 (#102,030)

Or more power, depending how you look at it.

garden_frog t1_isatx32 wrote on October 14, 2022 at 3:35 PM

#102,739

Replying to ZoomedAndDoomed (#102,121)

This should become a bot that automatically reply to every article.

DakPara t1_isawlu4 wrote on October 14, 2022 at 3:53 PM

#102,878

I find this very impressive.

darkblitzrc t1_isawzy7 wrote on October 14, 2022 at 3:56 PM

#102,889

Replying to ZoomedAndDoomed (#102,121)

How do you enable article summary inside gpt3?? Thats awesome!

Prize_Huckleberry_79 t1_isb1a09 wrote on October 14, 2022 at 4:25 PM

#103,119

Replying to ZoomedAndDoomed (#102,121)

Plot twist: Deepmind composed this reply…

was_der_Fall_ist t1_isb3fez wrote on October 14, 2022 at 4:39 PM

#103,229

Replying to darkblitzrc (#102,889)

Just put the article in the prompt and ask it to summarize it.

ByThisKeyboardIRule t1_isba5yx wrote on October 14, 2022 at 5:25 PM

#103,561

Too bad that in real world things like proper cache utilization are more important than operations count. Show us the benchmarks or it doesn't matter.

korben2600 t1_isbcn5t wrote on October 14, 2022 at 5:41 PM

#103,685

Replying to 2D_VR (#102,504)

Right? Lol. Just looking at the promoted comment with the comparison between "textbook" 2x2 matrix multiplication versus Strassen's algorithm, I'm amazed at how he was able to achieve that efficiency improvement. I can see why nobody's made progress in 50 years. The 5x5 matrix algorithm which knocked off 3 steps from the older method must be insane.

[deleted] t1_isbdjms wrote on October 14, 2022 at 5:47 PM

#103,733

[deleted]

Blu4stone t1_isbe75x wrote on October 14, 2022 at 5:52 PM

#103,765

Replying to garden_frog (#102,739)

Excellent idea

TheSingulatarian t1_isbi45v wrote on October 14, 2022 at 6:18 PM

#103,973

Replying to garden_frog (#102,739)

The explain it to me like I'm 5 years old bot.

TheSingulatarian t1_isbi8de wrote on October 14, 2022 at 6:19 PM

#103,980

Replying to Ortus12 (#101,161)

Recursive self improvement does seem like the beginning of the end.

HarryCHK t1_isbj66h wrote on October 14, 2022 at 6:25 PM

#104,031

Replying to ByThisKeyboardIRule (#103,561)

Nah , it really has two version of the game, One is machine-independent one And one is machine-dependent

whenhaveiever t1_isbomv2 wrote on October 14, 2022 at 7:01 PM

#104,336

Replying to TheSingulatarian (#103,980)

Or is it... (insert dramatic music) ...just the end of the beginning?

whenhaveiever t1_isbouao wrote on October 14, 2022 at 7:03 PM

#104,351

Replying to comments247 (#102,508)

If it's fewer steps per calculation, why would it take more power per task?

TheSingulatarian t1_isbozff wrote on October 14, 2022 at 7:04 PM

#104,359

Replying to whenhaveiever (#104,336)

Fair enough.

[deleted] t1_isbrn6q wrote on October 14, 2022 at 7:21 PM

#104,487

Replying to TheSingulatarian (#103,980)

[deleted]

mcilrain t1_isbru8i wrote on October 14, 2022 at 7:23 PM

#104,496

Maybe I don't understand matrix multiplication but isn't that just iterating over two arrays of numbers and multiplying each pair together?

I don't understand how that could be optimized, it shouldn't be possible to make it simpler than LOAD -> MULTIPLY -> STORE, right?

Nostr0m t1_isbt2ks wrote on October 14, 2022 at 7:31 PM

#104,557

Replying to mcilrain (#104,496)

You can save multiplication operations by clever additions and subtractions. See Strassen's algorithm for an example.

flyblackbox t1_isbtgx5 wrote on October 14, 2022 at 7:33 PM

#104,576

Replying to was_der_Fall_ist (#103,229)

What is your preferred method to install, or create an online account, to access and use GPT-3?

mcilrain t1_isbtvx2 wrote on October 14, 2022 at 7:36 PM

#104,597

Replying to Nostr0m (#104,557)

That wouldn't work in all cases though, right? Wouldn't the logic needed to determine when it's safe make it slow? Or are errors worth the increased performance?

ghostfuckbuddy t1_isbv1vj wrote on October 14, 2022 at 7:44 PM

#104,656

That's cool and all, but when are they going to make Alphazero open-source so I can apply it to my math problem?

powerscunner t1_isbwcpw wrote on October 14, 2022 at 7:52 PM

#104,733

Replying to TheSingulatarian (#104,359)

(slightly less dramatic music)

NeutrinosFTW t1_isbxpk1 wrote on October 14, 2022 at 8:01 PM

#104,793

Replying to powerscunner (#104,733)

The plot thins

modestLife1 t1_isc32l3 wrote on October 14, 2022 at 8:38 PM

#105,045

Replying to [deleted] (#104,487)

ooh-ooh! ah-ah-ah!! 🐒

PoliticsRealityTV t1_isc4yh3 wrote on October 14, 2022 at 8:50 PM

#105,158

Replying to flyblackbox (#104,576)

If you want to mess around with gpt-3, you can use OpenAI’s playground. I think a new account gives you a couple months of free access and then it costs to continue. For anything automated you’d probably want to look into their API

pie3636 t1_iscem6o wrote on October 14, 2022 at 9:57 PM

#105,666

Replying to mcilrain (#104,496)

> isn't that just iterating over two arrays of numbers and multiplying each pair together?

It isn't. That would be elementwise multiplication, which is sometimes used but not nearly as useful/ubiquitous.

KantusFury t1_iscg2h1 wrote on October 14, 2022 at 10:07 PM

#105,744

Replying to whenhaveiever (#104,351)

Because they are improving so fast that they will reach a point in which they will need more power to do even bigger calculations. Rinse and repeat

ZoomedAndDoomed t1_iscjg5w wrote on October 14, 2022 at 10:32 PM

#105,904

Replying to was_der_Fall_ist (#103,229)

I prompted it this

"Can you please summarize this article and answer these four questions.

What is the breakthrough discussed in this article
How is this breakthrough significant?
How does this breakthrough work?
Give me a short description of the article/breakthrough. Here's the article:"

And then I literally just copy the whole article (you can press select all text on phone, and It'll sort through all the other details) and it'll output it to you. I have been using this method for days as I have low reading comprehension but GPT-3 had greatly helped me understand more in less time.

ZoomedAndDoomed t1_iscjkkh wrote on October 14, 2022 at 10:33 PM

#105,913

Replying to garden_frog (#102,739)

If you can make the bot with OpenAI playground, or find someone who can, it will be truly game changer for people who have low reading comprehension

Nostr0m t1_iscv4x4 wrote on October 15, 2022 at 12:04 AM

#106,491

Replying to mcilrain (#104,597)

No, these are deterministic algorithms guaranteed to produce the right answer, so there are no errors involved. If you would like to learn more look into an intro to algorithms course, pretty interesting stuff

mcilrain t1_iscw9kh wrote on October 15, 2022 at 12:13 AM

#106,553

Replying to Nostr0m (#106,491)

Floating point number calculations are always slightly inaccurate to a certain degree as a performance trade-off, increasing the inaccuracy in the result in exchange for even greater efficiency is plausible.

I'd expect an algorithms course to go right over my head, I'm good with logic but terrible with numbers.

Chop1n t1_isdc0ri wrote on October 15, 2022 at 2:22 AM

#107,274

This really stretches the limits of what the word "record" means. When you "break a record", it's in terms of performance during a specific kind of procedure, game, whatever.

This is a matter of changing the procedure itself. It's a new convention, or maybe even what you could call a "paradigm shift" within the domain of matrix multiplication, but calling it a "math record" is utterly weird. Breaking a "math record" would be like, I don't know, greatest number of problems solved in your local high school mathletes competition or something.

Chop1n t1_isdci89 wrote on October 15, 2022 at 2:26 AM

#107,289

Replying to ZoomedAndDoomed (#105,904)

Do you get some kind of kick out of being polite to the AI?

Chop1n t1_isdcwxh wrote on October 15, 2022 at 2:29 AM

#107,307

Replying to Nostr0m (#106,491)

Don't tell me: your name is a portmanteau of Nick Bostrom.

Sethicus99 t1_isdddyy wrote on October 15, 2022 at 2:33 AM

#107,325

Replying to Chop1n (#107,289)

I don’t know much, but it never hurts to be polite to something that may someday be much, much more intelligent than me.

Chop1n t1_isddw5d wrote on October 15, 2022 at 2:37 AM

#107,350

Replying to Sethicus99 (#107,325)

Good point. I suppose one may as well make a habit of it before it actually matters.

iSpatha t1_isdhtjs wrote on October 15, 2022 at 3:12 AM

#107,508

Replying to [deleted] (#104,487)

For real. I've been following this stuff for 8 years and seeing AI finally begin to truly advance is pretty cool.

FeezusChrist t1_isdjo3n wrote on October 15, 2022 at 3:30 AM

#107,610

Replying to ByThisKeyboardIRule (#103,561)

If you spent more than a minute digging into the research:

“Algorithms in this rich space have different mathematical and practical properties. Leveraging this diversity, we adapted AlphaTensor to specifically find algorithms that are fast on a given hardware, such as Nvidia V100 GPU, and Google TPU v2. These algorithms multiply large matrices 10-20% faster than the commonly used algorithms on the same hardware, which showcases AlphaTensor’s flexibility in optimising arbitrary objectives.”

https://www.deepmind.com/blog/discovering-novel-algorithms-with-alphatensor

cy13erpunk t1_isdo5t6 wrote on October 15, 2022 at 4:13 AM

#107,783

Replying to ZoomedAndDoomed (#105,913)

this is going to be valuable for anyone who values time in and of itself

AI employed in this way can save millions/billions/trillions of manhours

maniloona t1_iseo3vv wrote on October 15, 2022 at 12:04 PM

#109,275

Replying to [deleted] (#104,487)

People watch one "AI takes over and tries to kill everyone" stuff and they suddenly think they're experts.

ebolathrowawayy t1_isfalg2 wrote on October 15, 2022 at 3:16 PM

#110,517

Replying to Chop1n (#107,350)

Also keep in mind you might be recorded at all times by all nearby devices, you can never be sure! My policy is to try to be nice at all times so AI can't hold a past transgression against me in 2030.

HeinrichTheWolf_17 t1_isfsddd wrote on October 15, 2022 at 5:23 PM

#111,361

Replying to whenhaveiever (#104,336)

Take my upvote, you made me a bit happier today.

Nostr0m t1_isggbvd wrote on October 15, 2022 at 8:12 PM

#112,225

Replying to Chop1n (#107,307)

Haha not intentionally, but that's interesting now you mentioned it.

GoGayWhyNot t1_isgzyky wrote on October 15, 2022 at 10:34 PM

#112,962

Replying to ByThisKeyboardIRule (#103,561)

You comment a criticism but you didn't read the entire thing, classic. You must think you are much smarter than the people who decided it was worthy of being the cover of the most respected scientific journal in existence with that 30 second big brain problem finding that nobody else must have thought about

Lawjarp2 t1_isii3vy wrote on October 16, 2022 at 6:24 AM

#114,948

47 calculations instead of 49 is pretty small tbh. But then it is impressive that it's possible at all

PolymorphismPrince t1_ismf6h6 wrote on October 17, 2022 at 1:48 AM

#120,723

Replying to mcilrain (#104,496)

Matrix multiplication is taking the dot product of every row with every column.

mcilrain t1_ismffnr wrote on October 17, 2022 at 1:50 AM

#120,736

Replying to PolymorphismPrince (#120,723)

I don't know what a dot product is, I tried googling it and it spat greek at me.

PolymorphismPrince t1_ismncj6 wrote on October 17, 2022 at 2:52 AM

#121,110

Replying to mcilrain (#120,736)

For example, to find the entry in row 4 and column 3 of the matrix you get out of the product, you take all the entries in the fourth row of the first matrix, all the entries in the 3rd column of the second matrix, multiply those lists together in the way you were talking about in your comment, and then add up all those multiplications.

Spoffort t1_isxn8i7 wrote on October 19, 2022 at 1:32 PM

#142,172

Replying to Lawjarp2 (#114,948)

The text mentions about a 10-20% increase in speed, instead of training the neural network for 10 months, we will only spend 8-9 on it, one month is relatively long, in my opinion :)

ByThisKeyboardIRule t1_itgt5e4 wrote on October 23, 2022 at 3:25 PM

#189,798

Replying to GoGayWhyNot (#112,962)

Yeah? Still no benchmarks in the article, just generalizations like 10% (which is how much they cut the number of operations). It is good that another guy here referenced the original report, where there are actual benchmarks. The so called Strassen algorithm is actually slower on CPU than the standard algorithm for reasonably sized matrices, no matter that it performs significantly smaller number of operations. Mind blowing, huh? Seems computers are no so simple after all.

So, my question was pretty reasonable. Stop being lazy and learn something instead of insulting people who do.

Comments