Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

Overview
Submissions
Comments

Chemont

[R] Is there any research on allowing Transformers to spent more compute on more difficult to predict tokens?

Submitted by Chemont t3_109z8om on January 12, 2023 at 1:07 PM in MachineLearning

16 comments

19

Chemont

Registered on January 14, 2019

t2_2ybraupw

Running Postmill