[D] Why are so many tokens needed to train large language models? Submitted by blacklemon67 t3_11misax on March 9, 2023 at 4:35 AM in MachineLearning 17 comments 12
frequenttimetraveler t1_jbljl97 wrote on March 9, 2023 at 10:17 PM Have they tried to train the same model with half the tokens? Permalink 2
Viewing a single comment thread. View all comments