Viewing a single comment thread. View all comments

ortegaalfredo OP t1_jbov7dl wrote on March 10, 2023 at 4:23 PM

Tried the 8bit, 4bit for some reason don't work yet for me.

Problem is, those are very very slow, about 1 token/sec, compared with 13B I'm getting 100 tokens/s