rikiiyer t1_izew1lv wrote on December 8, 2022 at 4:35 PM

I listened to Noam’s conversation with Lex Friedman the other day and he made the point that the model had to learn human like tendencies in order to work with humans to win at Diplomacy. Do you think it would be possible to use these learned features to somehow teach other models how to act more human-like?

MetaAI_Official OP t1_izfoej8 wrote on December 8, 2022 at 7:38 PM

The learned features are specific to the game of the Diplomacy because the data we used is specific to the game of Diplomacy, but the ideas can be transferred to other domains. Rather than just learning Diplomacy by playing against itself, the AI used a model trained on human games both to guide exploration during training (sampling moves from this model during self-play) as well as during planning (consider what actions humans are likely to take). It's not always obvious exactly how to apply this, but we think there's exciting opportunities for research in this space! -AM