Recently, the LLaMA models by Meta were released. What makes these models so exciting, is that despite being small enough to run on consumer hardware, popular metrics show that the models perform as well or better than GPT3 despite being over 10X smaller!

The reason for this increased performance seems to be due to a larger number of tokens being used for training.

Now, following along with the video tutorial and open-source code, you can now fine-tune these powerful models on your own dataset to further increase the ability of these models!

https://youtu.be/d4Cnv_g3GiI

Comments

You must log in or register to comment.

bacocololo t1_jcb2ba0 wrote on March 15, 2023 at 3:50 PM

#2,236,218

Thanks

vini_2003 t1_jcb90zy wrote on March 15, 2023 at 4:32 PM

#2,236,593

You wrote that description with the model, didn't you?

l33thaxman OP t1_jcb9akr wrote on March 15, 2023 at 4:33 PM

#2,236,610

Replying to vini_2003 (#2,236,593)

Actually no. I wrote that. Missed opportunity though.

vini_2003 t1_jcbc7j0 wrote on March 15, 2023 at 4:51 PM

#2,236,766

Replying to l33thaxman (#2,236,610)

Aw, damn! It really seemed like a generated description, haha

Thanks for the guide, by the way! Will be setting it up locally and this is very helpful.

l33thaxman OP t1_jcbcwx7 wrote on March 15, 2023 at 4:56 PM

#2,236,808

Replying to vini_2003 (#2,236,766)

Glad it was helpful!

DingWrong t1_jcc3axk wrote on March 15, 2023 at 7:37 PM

#2,238,155

Is there a written version? I like reading.

--dany-- t1_jccendz wrote on March 15, 2023 at 8:47 PM

#2,238,773

The GitHub link

https://GitHub.com/mallorbc/Finetune_LLMs

ShadowStormDrift t1_jcew7hj wrote on March 16, 2023 at 10:18 AM

#2,242,782

I need proof.

l33thaxman OP t1_jcg842w wrote on March 16, 2023 at 4:39 PM

#2,245,582

Replying to DingWrong (#2,238,155)

No sorry. You can read the GitHub README though.

l33thaxman OP t1_jcg884o wrote on March 16, 2023 at 4:39 PM

#2,245,588

Replying to ShadowStormDrift (#2,242,782)

Not sure what you mean? I show the loss decreasing and then inference it and it obviously learned how to generate quotes.