Submitted by besabestin t3_10lp3g4 in MachineLearning
golongandprosper t1_j67dc19 wrote
I read an article that it’s so good because they hired “almost slaves” at lowest possible price.. $2 was the rate.. don’t know if that’s per day or hour.. from some downtrodden country.
And hundreds to thousands of these serfs spent their days testing and manually training it. So they apparently got hundreds of thousands of hours of human manual training, at a price that many Americans could afford by taking a mortgage against their house- and apparently they are still there manually watching and reacting to queries in real time to verify answers are decent.. while the rest of the world gives them more data for free.
So when it says the servers are busy, to wait? That could mean the humans are busy ;p
visarga t1_j6c0o3e wrote
I very much doubt they do this in real time. The model is responding too fast for that.
They are probably used for RLHF model alignment: to keep it polite, helpful and harmless, and to generate more samples of tasks being solved by vetting our chatGPT interaction logs, or using the model from the console like us to solve tasks, or effectively writing the answers themselves where the model fails.
Viewing a single comment thread. View all comments