Acceptable-Cress-374 t1_j62qh5g wrote
Reply to comment by ElectronicCress3132 in [R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers by currentscurrents
Thank you for putting it into words, I was having trouble understanding this myself.
Viewing a single comment thread. View all comments