Viewing a single comment thread. View all comments

Superschlenz t1_j0es4dr wrote

Why does ChatGPT need explicit feedback?

Why don't they just perform sentiment analysis on the user prompts as the reward? For safety they would also have to classify the users into good/evil and invert the rewards from the latter.

−16