gurenkagurenda t1_jebq66o wrote
Reply to comment by ghostinshell000 in A top AI researcher reportedly left Google for OpenAI after sharing concerns the company was training Bard on ChatGPT data by jack_lafouine
The research did, which is a bit different. I don't see why this would be a violation of the TOS though. I don't see anything in there about using model outputs to train other models. The closest would be:
> reverse assemble, reverse compile, decompile, translate or otherwise attempt to discover the source code or underlying components of models, algorithms, and systems of the Services
But that's not the same thing. Training your own model on ChatGPT outputs won't result in anything like the same source code, algorithms, or model weights as ChatGPT.
VelveteenAmbush t1_jec59yj wrote
> I don't see why this would be a violation of the TOS though.
It's this section:
> (c) Restrictions. You may not ... (iii) use output from the Services to develop models that compete with OpenAI;
Lemonio t1_jectj5y wrote
It’s interesting because ChatGPT trained their models on data of companies they will now be competing with
VelveteenAmbush t1_jecx9ar wrote
Like, Google's data? Or which OpenAI competitor are you thinking about?
Lemonio t1_jed6pv1 wrote
Sure, google, Reddit, most big sites on the internet I imagine
VelveteenAmbush t1_jegqlm8 wrote
They're competing with Google but Google doesn't publish a lot of text as far as I know.
I don't see how they're a competitor to Reddit.
Lemonio t1_jegzrpl wrote
Well people go to both google and Reddit to find answers to questions which is a use case many people could use ChatGPT for
gurenkagurenda t1_jecg2gi wrote
Ah, I see, I missed that.
ghostinshell000 t1_jebtxlv wrote
pretty much and alot of the Open source models have opensource like licenses on them.
Afaren42 t1_jedtu84 wrote
Stanford university used this exact method to train alpaca, an ai, based on llama. Google is doing the exact same thing.
Viewing a single comment thread. View all comments