Abstract:

>Remarkable progress has been made on automated reasoning with knowledge specified as unstructured, natural text, by using the power of large language models (LMs) coupled with methods such as Chain-of-Thought prompting and Selection-Inference. These techniques search for proofs in the forward direction from axioms to the conclusion, which suffers from a combinatorial explosion of the search space, and thus high failure rates for problems requiring longer chains of reasoning. The classical automated reasoning literature has shown that reasoning in the backward direction (i.e. from the intended conclusion to the set of axioms that support it) is significantly more efficient at proof-finding problems. We import this intuition into the LM setting and develop a Backward Chaining algorithm, which we call LAMBADA, that decomposes reasoning into four sub-modules, each of which can be simply implemented by few-shot prompted LM inference. We show that LAMBADA achieves massive accuracy boosts over state-of-the-art forward reasoning methods on two challenging logical reasoning datasets, particularly when deep and accurate proof chains are required.

https://preview.redd.it/q3ul0czx4w8a1.jpg?width=542&format=pjpg&auto=webp&s=30618e8ee9c766ee33ca1721b71e23c24f5de778

https://preview.redd.it/bqb28jzx4w8a1.jpg?width=539&format=pjpg&auto=webp&s=6ff28846b5659e3ab275e89018b5985ec0afcab4

https://preview.redd.it/nfx5jmzx4w8a1.jpg?width=435&format=pjpg&auto=webp&s=a2ac6e353b244ae3a3731212347d3527d9bc7a79

https://preview.redd.it/yd0zrfzx4w8a1.jpg?width=964&format=pjpg&auto=webp&s=81d67476f4492caa81f488c546dc9d6f50315915

https://preview.redd.it/34x4nlzx4w8a1.jpg?width=481&format=pjpg&auto=webp&s=4765f471e03976e62414659a68fd4f0525b40e4c

https://preview.redd.it/6tdhlkzx4w8a1.jpg?width=544&format=pjpg&auto=webp&s=341ec127c35a51bf7c5f5929df884e8e94acb321

Comments

artoftheproblem t1_j272x5z wrote on December 30, 2022 at 2:58 AM

#1,168,290

so hard to keep up with progress...I'm still getting over the simple insight that asking it to "think step by step" was a huge boost in accuracy in the initial instrutgpt model

Dankmemexplorer t1_j27hf6g wrote on December 30, 2022 at 4:53 AM

#1,171,212

Replying to artoftheproblem (#1,168,290)

that was like 4 months ago right???

nogop1 t1_j27xexh wrote on December 30, 2022 at 7:44 AM

#1,174,313

I wonder whether the large models are not better due to their larger amount of params, but the increased number of layers. Thus being able to perform more steps and search more deeply.

If been wondering if certain questions/algos do not need a variable amount of steps. Leaving aside the universal function approximation theorem, would simple exponentiation not require that? If I were to ask a llm/transformer to perform these arithmetic operations?

ThePerson654321 t1_j280czc wrote on December 30, 2022 at 8:22 AM

#1,174,885

Replying to artoftheproblem (#1,168,290)

It feels like we are hitting a wall though...

pommedeterresautee t1_j289hxw wrote on December 30, 2022 at 10:26 AM

#1,176,601

Replying to ThePerson654321 (#1,174,885)

Why? The improvement seems quite significant.

IdentifiableParam t1_j2adu49 wrote on December 30, 2022 at 8:18 PM

#1,194,321

Great, now there are two machine learning LAMBADAs that mean completely different things...

https://arxiv.org/abs/1606.06031

farmingvillein t1_j2awxls wrote on December 30, 2022 at 10:24 PM

#1,198,863

Replying to IdentifiableParam (#1,194,321)

Yes, and the old one was named relatively sanely:

> LAnguage Modeling Broadened to Account for Discourse Aspects

Whereas the new Google paper is a horror show in naming:

> We develop a hybrid LAnguage Model augmented BAckwarD chAining technique, dubbed LAMBADA

currentscurrents t1_j2by81g wrote on December 31, 2022 at 2:53 AM

#1,208,538

So, if I'm understanding right:

Backwards chaining is an old classical algorithm for logic proving.
They've implemented backwards chaining using a bunch of language models, so it works well with natural text.
Given a knowledge base (which are available as datasets these days), it can decompose a statement and check if it's logically consistent with that knowledge.
The reason they're interested in this is to use it as a training function to make language models more accurate.

This is effectively an old "expert system" from the 70s built out of neural networks. I wonder what other classical algorithms you can implement with neural networks.

I also wonder if you could use this to create its own knowledge base from internet data. Since the internet is full of contradicting information, you would have to compare new data against existing data somehow and decide which to keep.

currentscurrents t1_j2csenb wrote on December 31, 2022 at 7:41 AM

#1,217,042

Replying to nogop1 (#1,174,313)

The number of layers is a hyperparameter, and people do optimization to determine the optimal values for hyperparameters.

Model size does seem to be a real scaling law. It's possible that we will come up with better algorithms that work on smaller models, but it's also possible that neural networks need to be big to be useful. With billions of neurons and an even larger number of connections/parameters, the human brain is certainly a very large network.

xt-89 t1_j2dxg0p wrote on December 31, 2022 at 3:25 PM

#1,228,700

Replying to currentscurrents (#1,208,538)

I’ve been thinking that we’re really leaving the domain of ‘Machine learning’ and entering the domain of ‘artificial cognition’. It seems like more of these expert system algorithms will be used going forward

[R] LAMBADA: Backward Chaining for Automated Reasoning in Natural Language - Google Research 2022 - Significantly outperforms Chain of Thought and Select Inference in terms of prediction accuracy and proof accuracy.