[D] Why do we train language models with next word prediction instead of some kind of reinforcement learning-like setup? Submitted by blazejd t3_yzzxa2 on November 20, 2022 at 9:30 AM in MachineLearning 34 comments 18
victotronics t1_ix5g254 wrote on November 20, 2022 at 10:01 PM Children pick up on rules and then extrapolate them. "He bringed this to me". I don't think an AI will generate that since it has no general rules that it tries to apply to a special case. Permalink 2
Viewing a single comment thread. View all comments