strongaifuturist OP t1_j9u28ig wrote on February 24, 2023 at 3:48 PM

Reply to comment by Liberty2012 in The Sentient Search Engine? How ChatGPT’s Insane Conversation Reveals the Limits and Potential of Large Language Models by strongaifuturist

That's absolutely right. The current LLMs don't have an independent world model per se. They have a world model, but it's more like a sales guy trying to memorize the words in a sales brochure. You might be able to get through a sales call, but its a much more fragile strategy than trying to first have a model of how things work and then figure out what you're going to say based on that model and your goals. But there is lots of work in this area. LLMs of today are like planes in the time of Kitty Hawk. Sure they have limitations, but the concept has been proven. Now it's only a matter of time before the kinks get ironed out.

Liberty2012 t1_j9u3ov6 wrote on February 24, 2023 at 3:58 PM

> Now it's only a matter of time before the kinks get ironed out.

Yes, that is the point of view of some. However, it is not the point of view of all. Meaning that if this is a core architecture problem of LLMs, it will not be solvable without a new architecture. So, yes it can be solved, but it won't be an LLM that solves it.

But yes, I'm more concerned about the implications of what comes next when we do solve it.

strongaifuturist OP t1_j9u8es5 wrote on February 24, 2023 at 4:28 PM

I’m not saying that architectural changes aren’t needed. The article outlines some of the alternatives being explored. My favorite is one from Yann LeCun based on a technique called H-JEPA.