Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

/f/MachineLearning

[P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM

github.com

Submitted by Amazing_Painter_7692 t3_11pmz69 on March 12, 2023 at 7:13 PM in MachineLearning

51 comments

320

Viewing a single comment thread. View all comments

APUsilicon t1_jc0zbtj wrote on March 13, 2023 at 6:27 AM

oooh, I've been getting trash responses from opt-6.7b hopefully this is better.

Permalink

1

0 points (+0, −0)

Short URL:

http://forum.junglegym.ai/120502

MachineLearning

t5_2r3gv

Created October 1, 2022
Subscribe via RSS

Toolbox

Bans
Moderation log

Running Postmill