[R] Is there any research on allowing Transformers to spent more compute on more difficult to predict tokens? Submitted by Chemont t3_109z8om on January 12, 2023 at 1:07 PM in MachineLearning 16 comments 19
[deleted] t1_j42hduk wrote on January 12, 2023 at 6:49 PM Reply to comment by tdgros in [R] Is there any research on allowing Transformers to spent more compute on more difficult to predict tokens? by Chemont [removed] Permalink Parent −4−
Viewing a single comment thread. View all comments