Submitted by lmtog t3_10zix8k in MachineLearning
lmtog OP t1_j84vc3x wrote
Reply to comment by thiru_2718 in [D] Transformers for poker bot by lmtog
Thats what I'am not quite sure about. I assume the result would not be close to the nash equilibrium.
But I don't know since I have not worked with transformers before.
I think it comes down to, can we train a transformer with feedback on what hands were good and which ones were not. Looking at other responses it seems like that is not possible.
Viewing a single comment thread. View all comments