nul9090 t1_j9th1xg wrote on February 24, 2023 at 1:15 PM

Reply to comment by fangfried in What are the big flaws with LLMs right now? by fangfried

Well, at the moment, we can't really have any idea. Quadratic complexity is definitely really bad. It limits how far we can push the architecture. It makes it hard to make it on to consumer hardware. But if we are as close to a breakthrough as some people believe maybe it isn't a problem.

nul9090 t1_j9sqmaf wrote on February 24, 2023 at 7:51 AM

Reply to What are the big flaws with LLMs right now? by fangfried

In my view, the biggest flaw of transformers is the fact that they have quadratic complexity. This basically means they will not become significantly faster anytime soon. The context window size will grow slowly too.

Linear transformers and Structured State Space Sequence (S4) models are promising approaches to solve that though.

My hunch it that LLMs should be very useful in the near-term but, in the future, they will be of little value to AGI architecture but I am unable to convincingly explain why.

nul9090 t1_j983f73 wrote on February 19, 2023 at 11:46 PM

Reply to comment by diabeetis in Proof of real intelligence? by Destiny_Knight

Okay. I suppose, it all depends on what kind of conversation we want to have.

nul9090 t1_j97krdy wrote on February 19, 2023 at 9:30 PM

Reply to comment by MysteryInc152 in Proof of real intelligence? by Destiny_Knight

The hostility was uncalled for. What you're asking for is a lot of work for a Reddit post. But there are plenty of tests and anecdotes that would lead one to believe it is lacking in important ways in its capacity to reason and understand.

I'm not a fan of Gary Marcus but he raises valid criticisms here in a very recent essay: https://garymarcus.substack.com/p/how-not-to-test-gpt-3

Certainly, there are even more impressive models to come. I believe firmly that, some day, human intelligence will be surpassed by a machine.