Viewing a single comment thread. View all comments

csreid t1_iwlyxnz wrote on November 16, 2022 at 5:00 PM

>Also, I can request up to 372GB VRAM, is there any large language model (#parameters > 100B) that I can actually download and run "locally"?

I've never done anything non-trivial with LLMs but even using 32 bit floats for 100B parameters should take 400 gigs of RAM, right?

SJ5125 t1_iwwfez9 wrote on November 18, 2022 at 9:30 PM

Most would use bfloat16 for LLMs