[D] Do we really need 100B+ parameters in a large language model? Submitted by Vegetable-Skill-9700 t3_121a8p4 on March 25, 2023 at 4:14 AM in MachineLearning 84 comments 101
AllowFreeSpeech t1_je3rjmv wrote on March 29, 2023 at 5:00 AM Reply to comment by currentscurrents in [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700 20:1 ratio of tokens:params Permalink Parent 1
Viewing a single comment thread. View all comments