Mr_Smartypants t1_ir7rbxp wrote
Reply to comment by harharveryfunny in [R] Discovering Faster Matrix Multiplication Algorithms With Reinforcement Learning by EducationalCicada
At the end of RL training, they don't just have an efficient matrix multiplication algorithm (sequence of steps), they also have the policy they learned.
I don't know what that adds, though. Maybe it will generalize over input size?
Viewing a single comment thread. View all comments