bo_peng OP t1_jcjuhix wrote
Reply to comment by blueSGL in [R] RWKV 14B ctx8192 is a zero-shot instruction-follower without finetuning, 23 token/s on 3090 after latest optimization (16G VRAM is enough, and you can stream layers to save more VRAM) by bo_peng
Yes ChatRWKV v2 supports that :)
Take a look at the "strategy" guide: https://pypi.org/project/rwkv/
Viewing a single comment thread. View all comments