DigThatData t1_j61zv3l wrote
Reply to comment by currentscurrents in [R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers by currentscurrents
Compute Is All You Need
endless_sea_of_stars t1_j627a9m wrote
Just rent out an AWS region for a month and you'll be good to go. Hold a couple bake sales to defray the cost.
Viewing a single comment thread. View all comments