Viewing a single comment thread. View all comments

Better_Ad4061 t1_j7xyb2r wrote on February 10, 2023 at 4:45 AM

I'm trying to make a decision transformer, but I can't quite figure out how to prompt it. I trained it on a chess dataset of (state, reward, move) but I don't know how to "prompt" it with the reward I would like.

visarga t1_j7yc08k wrote on February 10, 2023 at 7:16 AM

You prompt it by reward. Let's say your top reward is 1.

you predict model(past history, state, 1) -> move