[D] Do we really need 100B+ parameters in a large language model? Submitted by Vegetable-Skill-9700 t3_121a8p4 on March 25, 2023 at 4:14 AM in MachineLearning 84 comments 101
A1-Delta t1_jdl325g wrote on March 25, 2023 at 4:26 AM Reply to comment by wojapa in [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700 GPT-J-6B fine tuned on Alpaca’s instruction dataset. Permalink Parent 4
Viewing a single comment thread. View all comments