Viewing a single comment thread. View all comments

Singularian2501 OP t1_iwnpy8m wrote

Yes they mentioned it at the end of their blog article. But I think it was only meant as an example how better sample efficiency could be achieved and not SOTA related.

1

13ass13ass t1_iwo4lan wrote

Efficient zero is for RL with atari games though. How does it apply to things like large language models?

5

lostmsu t1_iws6anl wrote

The point is there are many models that use the same technique.

3