[D] Trying to find paper about n-grams in early transformer layers Submitted by soraki_soladead t3_zmoxp7 on December 15, 2022 at 4:13 PM in MachineLearning 9 comments 28
soraki_soladead OP t1_j0gemzb wrote on December 16, 2022 at 1:30 PM Reply to comment by 2600_yay in [D] Trying to find paper about n-grams in early transformer layers by soraki_soladead It isn’t but interesting paper! Permalink Parent 2
Viewing a single comment thread. View all comments