unua_nomo t1_j2eyhnh wrote on December 31, 2022 at 7:40 PM

Reply to comment by misconfigbackspace in There's now an open source alternative to ChatGPT, but good luck running it by ravik_reddit_007

I mean there are already open source datasets available, such as the Pile.

I can't see any argument for why a model derived on open source data would likewise not be open source, at which point if you could argue that a ML model could produce ip breaking content, that would be the responsibility of the individual producing and subsequently distributing that content.

As for data becoming stale, that wouldn't necessarily be an issue for plenty of applications, and even then there's no reason you couldn't just crowd fund 80k a year to train a newly updated model with newer content folded in.

unua_nomo t1_j2enydh wrote on December 31, 2022 at 6:27 PM

Reply to comment by misconfigbackspace in There's now an open source alternative to ChatGPT, but good luck running it by ravik_reddit_007

Crowdsource the funding, not the content the model is trained on

unua_nomo t1_j2e5rp7 wrote on December 31, 2022 at 4:25 PM

Reply to There's now an open source alternative to ChatGPT, but good luck running it by ravik_reddit_007

I mean, honestly wouldn't be that hard to even crowd source training an open source model right?