Submitted by faker10101891 t3_10cxuo2 in MachineLearning
I'm aware transformers are pretty vram hungry and a 4080 only has 16 GB. So I am guessing a lot of transformer based models will be out of the question. At least anything that is interesting.
Not sure about other models though. Is there anything I can do with a 4080 that's beyond just some toy experiment?
currentscurrents t1_j4ijlqv wrote
You can fine-tune image generator models and some smaller language models.
You can also do tasks that don't require super large models, like image recognition.
>that's beyond just some toy experiment?
Don't knock toy experiments too much! I'm having a lot of fun trying to build a differentiable neural computer or memory-augmented network in pytorch.