I_will_delete_myself t1_jb532v5 wrote
Reply to comment by Philpax in [R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python by bo_peng
Intelligence is the ability to take complex information into a simple explanation that a child can understand .
It makes me skeptical if someone doesn’t explain besides performance reasons . Most people just use the cloud because ML networks regardless of size take up a lot of battery.
Philpax t1_jb53nhe wrote
As far as I can tell, the sparse documentation is just because they've been in pure R&D mode. I've played around with it in their Discord server and can confirm it does perform well, but I've struggled to get it working locally.
Viewing a single comment thread. View all comments