Viewing a single comment thread. View all comments

gurenkagurenda t1_jebq66o wrote

The research did, which is a bit different. I don't see why this would be a violation of the TOS though. I don't see anything in there about using model outputs to train other models. The closest would be:

> reverse assemble, reverse compile, decompile, translate or otherwise attempt to discover the source code or underlying components of models, algorithms, and systems of the Services

But that's not the same thing. Training your own model on ChatGPT outputs won't result in anything like the same source code, algorithms, or model weights as ChatGPT.

25

VelveteenAmbush t1_jec59yj wrote

> I don't see why this would be a violation of the TOS though.

It's this section:

> (c) Restrictions. You may not ... (iii) use output from the Services to develop models that compete with OpenAI;

36

Lemonio t1_jectj5y wrote

It’s interesting because ChatGPT trained their models on data of companies they will now be competing with

24

VelveteenAmbush t1_jecx9ar wrote

Like, Google's data? Or which OpenAI competitor are you thinking about?

6

Lemonio t1_jed6pv1 wrote

Sure, google, Reddit, most big sites on the internet I imagine

13

VelveteenAmbush t1_jegqlm8 wrote

They're competing with Google but Google doesn't publish a lot of text as far as I know.

I don't see how they're a competitor to Reddit.

1

Lemonio t1_jegzrpl wrote

Well people go to both google and Reddit to find answers to questions which is a use case many people could use ChatGPT for

1

ghostinshell000 t1_jebtxlv wrote

pretty much and alot of the Open source models have opensource like licenses on them.

4

Afaren42 t1_jedtu84 wrote

Stanford university used this exact method to train alpaca, an ai, based on llama. Google is doing the exact same thing.

4