Nalmyth OP t1_j2ltjg9 wrote on January 2, 2023 at 7:13 AM

Reply to comment by Ortus14 in Alignment, Anger, and Love: Preparing for the Emergence of Superintelligent AI by Nalmyth

Looking at how they currently do it (manually lobotomising) I'm not sure they are really ready, or using AI to help as much as you think they are.

Ortus14 t1_j2luhse wrote on January 2, 2023 at 7:25 AM

From their website:"Our approach to aligning AGI is empirical and iterative. We are improving our AI systems’ ability to learn from human feedback and to assist humans at evaluating AI. Our goal is to build a sufficiently aligned AI system that can help us solve all other alignment problems."

https://openai.com/blog/our-approach-to-alignment-research/

ChatGTP has some alignment in avoiding racist and sexist behavior, as well as many other human morals. They have to use some Ai to help with that alignment because there's no way they could manually teach it all possible combinations of words that are racist and sexist.