Submitted by Desi___Gigachad t3_126rgih in MachineLearning
There is this research post by Neel Nanda which states that Othello-GPT has a linear emergent world representation. What does it mean (as I am mostly a novice) and what do you all think about it?
link :- https://www.neelnanda.io/mechanistic-interpretability/othello
step21 t1_jearoln wrote
It means he says it has a representation of its world, not just statistics. He may or may not be right. (Also I didn’t read all of it yet, fing long.