What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] Submitted by Destiny_Knight t3_118svv7 on February 22, 2023 at 8:27 AM in singularity 194 comments 493
Yngstr t1_j9jzzbv wrote on February 22, 2023 at 3:04 PM Am I reading this wrong? Is the dataset used to train this model the same dataset used to test it? Not saying that's not a valid method, but that certainly makes it less impressive vs generalist models that can still get decent scores... Permalink 7
Viewing a single comment thread. View all comments