justheuristic t1_j02ohk6 wrote
Reply to comment by dojoteef in [D] Are there any distributed model training services similar to, e.g. Folding@Home? by genuinelySurprised
The first link (petals) is about finetuning.
Others (e.g. distributed diffusion) involve training from scratch -- but they deal with smaller models. Thing is, you need a lot of people to train a 100B model from scratch. Like, a few hundred online on average. There aren't many communities that can do that. In turn, with finetuning, you can see it work more immediately.
I've heard a talk by Colin Raffel where he proposed an alternative view where instead of training from scratch, an open-source community could gradually improve the model over time. Like github, but for large models. A contributor can fine-tune for a task, then create a "pull-request", then maintainer runs a special procedure to merge the model without forgetting other tasks. That's how I remember it, anyways.
Viewing a single comment thread. View all comments