Submitted by Vegetable-Skill-9700 t3_121agx4 in deeplearning
StrippedSilicon t1_jdte8lj wrote
Reply to comment by BellyDancerUrgot in Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700
That's why I'm appealing to "we don't actually understand what it's doing" case. Certainly the AGI-like intelligence explanation falls apart in alot of cases, but the explanation of only spitting out the training data in a different order or context doesn't work either.
Viewing a single comment thread. View all comments