harharveryfunny t1_jbjk9nb wrote on March 9, 2023 at 2:44 PM

Reply to comment by harharveryfunny in [D] Why are so many tokens needed to train large language models? by blacklemon67

Just to follow up, the reason why the "interact with the world" approach is way more efficient is because it's largely curiosity driven - we proactively try to fill gaps in our knowledge rather than just go read a set of encyclopedias and hope it might cover what we need to know. We learn in a much more targeted fashion..

visarga t1_jbn5g3w wrote on March 10, 2023 at 6:03 AM

On the other hand LLM has broad knowledge about all topics, a true dilettante. We can't keep up on that level.