Submitted by Vegetable-Skill-9700 t3_121a8p4 in MachineLearning
wrossmorrow t1_jdmsbvf wrote
Reply to comment by shanereid1 in [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700
Probably related https://arxiv.org/abs/2106.09685
fiftyfourseventeen t1_jdngwum wrote
Eh.... Not really, that's training a low rank representation of the model, not actually making it smaller.
Viewing a single comment thread. View all comments