[R][D] Overlooked AI/ML papers from 2022 Submitted by Neurosymbolic t3_1027qvv on January 3, 2023 at 1:06 PM in MachineLearning 8 comments 3
gamerx88 t1_j2vzjfx wrote on January 4, 2023 at 9:15 AM "An empirical analysis of compute-optimal large language model training" by Deepmind, suggesting that LLMs are over-parameterized or under-trained (insufficient data used in training). Permalink 2
Viewing a single comment thread. View all comments