Viewing a single comment thread. View all comments

XagentVFX t1_j5tamgf wrote

The Word Predictor neural net is only half of the architecture of a Transformer, its also got a Attention network that produces Context Vectors. This is a much bigger deal, because this is showing the Ai is building and understanding Context. That is Understanding.