Superschlenz t1_j0es4dr wrote on December 16, 2022 at 2:47 AM

Why does ChatGPT need explicit feedback?

Why don't they just perform sentiment analysis on the user prompts as the reward? For safety they would also have to classify the users into good/evil and invert the rewards from the latter.