[D] Do we really need 100B+ parameters in a large language model? Submitted by Vegetable-Skill-9700 t3_121a8p4 on March 25, 2023 at 4:14 AM in MachineLearning 84 comments 101
_Repeats_ t1_jdm3h7a wrote on March 25, 2023 at 12:25 PM For enterprise use cases, you might need only a small model in the 1-3 billion range that answers specific queries. For general knowledge, it remains to be seen how big or small you can retrain them. Permalink 6
Viewing a single comment thread. View all comments