Viewing a single comment thread. View all comments

SpaceCockatoo t1_jblj2so wrote on March 9, 2023 at 10:13 PM

4bit quant already out

ortegaalfredo OP t1_jbov7dl wrote on March 10, 2023 at 4:23 PM

Tried the 8bit, 4bit for some reason don't work yet for me.

Problem is, those are very very slow, about 1 token/sec, compared with 13B I'm getting 100 tokens/s