Submitted by spiritus_dei t3_10tlh08 in MachineLearning
spiritus_dei OP t1_j77u2ic wrote
Reply to comment by sarabjeet_singh in [D] Are large language models dangerous? by spiritus_dei
That might be why RLHF (reinforcement learning by human feedback) is ultimately doomed to fail.
Viewing a single comment thread. View all comments