Submitted by Vegetable-Skill-9700 t3_121a8p4 in MachineLearning
harharveryfunny t1_jdmd38s wrote
Reply to comment by alrunan in [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700
>You should read the LLaMA paper.
OK - will do. What specifically did you find interesting (related to scaling or not) ?
Viewing a single comment thread. View all comments