Submitted by These-Assignment-936 t3_10y2mu0 in MachineLearning
Better_Ad4061 t1_j7xyb2r wrote
I'm trying to make a decision transformer, but I can't quite figure out how to prompt it. I trained it on a chess dataset of (state, reward, move) but I don't know how to "prompt" it with the reward I would like.
visarga t1_j7yc08k wrote
You prompt it by reward. Let's say your top reward is 1.
you predict model(past history, state, 1) -> move
Viewing a single comment thread. View all comments