I've finally pulled the plug on a 4090 that'll arrive by the end of this week after ages with a 1050, and besides throwing everything ray traced at it, I also want to use it to train some deep learning models.

I do know the talk of the town, LLMs, are waaay too big to be done on such a card (iirc ChatGPT was train on 1024 industrial cards), but I was wondering if there's some neat DIY projects I could set up and train in a human amount of time (something that's not neural style transfer, that already ran on the 1050 too).

FYI I'm not specifically looking for language modeling, Chat was just an example about a model that'd def be too big.

Comments

You must log in or register to comment.

Disastrous_Elk_6375 t1_jb8y5r2 wrote on March 7, 2023 at 9:04 AM

GptNeoX should fit with 8bit and low prompt sizes. GptJ-7B should fit as well with 16bit inference. On smaller models you might even be able to do some finetuning for fun.

There's a couple of coding models from salesforce that you could fit comfortably. Check out FauxPilot for a copilot "clone".

lifesthateasy OP t1_jb8zyyu wrote on March 7, 2023 at 9:31 AM

Ooh great I'll look into those! Thank you!

Dankmemexplorer t1_jb9xjl9 wrote on March 7, 2023 at 3:10 PM

-stable diffusion would be fun to play with

-you can try simple computer vision tasks / finetune a model to detect your cat or something

gopher9 t1_jbb9f2l wrote on March 7, 2023 at 8:19 PM

RWKV works rather well on 4090.

thekevsh0w t1_jcnkybt wrote on March 18, 2023 at 3:19 AM

>RWKV

How bout a 3080 TI? im guessing the 12 gigs vs 24 gigs of VRAM is gonna be rather lacking :(

gmork_13 t1_jbbj49n wrote on March 7, 2023 at 9:19 PM

With fp16/int8 you can probably stick a couple of LLMs of smaller size onto that card.
Have a look around, with fp32 it's about 1B params per 4GB of VRAM. Halve it for fp16 and again for int8 (very roughly).

abstract000 t1_jbfcydi wrote on March 8, 2023 at 5:26 PM

You will not be able to train huge models like ViT or BERT from scratch, but for fine tuning a 4090 is waaaaaaay enough. Even with my 2080Ti I achieve really fine results with a decent amount of time.

Your best friend is Huggingface, because transformer are life with a nice GPU

With your new config here would be my first try:

ViT
BERT
Donut because it's hell of a thing
Everything you can find on huggingface and makes you curious

2muchnet42day t1_jckb1es wrote on March 17, 2023 at 1:34 PM

Why not LLama/Alpaca ?