Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

/f/MachineLearning

[R] Is there any research on allowing Transformers to spent more compute on more difficult to predict tokens?

Submitted by Chemont t3_109z8om on January 12, 2023 at 1:07 PM in MachineLearning

16 comments

19

Viewing a single comment thread. View all comments

[deleted] t1_j41r6kf wrote on January 12, 2023 at 4:08 PM

[deleted]

Permalink

10

0 points (+0, −0)

Short URL:

http://forum.junglegym.ai/77878

MachineLearning

t5_2r3gv

Created October 1, 2022
Subscribe via RSS

Toolbox

Bans
Moderation log

Running Postmill