[D] Trying to find paper about n-grams in early transformer layers Submitted by soraki_soladead t3_zmoxp7 on December 15, 2022 at 4:13 PM in MachineLearning 9 comments 28
prohitman t1_j0dory1 wrote on December 15, 2022 at 10:02 PM Reply to comment by Rabrg in [D] Trying to find paper about n-grams in early transformer layers by soraki_soladead This is a really interesting article! Permalink Parent 3
Viewing a single comment thread. View all comments