Likelihood of OpenAI moderation flagging a sentence containing negative adjectives about a demographic as 'Hateful'. Submitted by grungabunga t3_11bb3l3 on February 25, 2023 at 3:45 AM in singularity 127 comments 140
Spire_Citron t1_ja05kyo wrote on February 25, 2023 at 9:16 PM Reply to comment by MadDragonReborn in Likelihood of OpenAI moderation flagging a sentence containing negative adjectives about a demographic as 'Hateful'. by grungabunga Yup. I think if anything this shows it probably wasn't individually programmed to respond to particular things and is just making its judgements based on the hate that it sees in its data. Permalink Parent −1−
Viewing a single comment thread. View all comments