[R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python Submitted by bo_peng t3_11iwt1b on March 5, 2023 at 1:11 PM in MachineLearning 26 comments 63
estrafire t1_jc2umln wrote on March 13, 2023 at 5:15 PM Reply to comment by bo_peng in [R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python by bo_peng Any particular reason for moving from CNN to RNN? Permalink Parent 1
Viewing a single comment thread. View all comments