Thats what I'am not quite sure about. I assume the result would not be close to the nash equilibrium.
But I don't know since I have not worked with transformers before.
I think it comes down to, can we train a transformer with feedback on what hands were good and which ones were not. Looking at other responses it seems like that is not possible.
But technically it should be possible to train the model on hands, in the mentioned representation, and get an output that would be a valid poker play?
lmtog OP t1_j84vc3x wrote
Reply to comment by thiru_2718 in [D] Transformers for poker bot by lmtog
Thats what I'am not quite sure about. I assume the result would not be close to the nash equilibrium.
But I don't know since I have not worked with transformers before.
I think it comes down to, can we train a transformer with feedback on what hands were good and which ones were not. Looking at other responses it seems like that is not possible.