Yomiel94 t1_jcj6i7w wrote on March 17, 2023 at 5:30 AM

Reply to comment by Intrepid_Meringue_93 in Those who know... by Destiny_Knight

That’s not the whole story. Facebook trained the model, their data was leaked, and the Stanford guys fine-tuned it to make it function more like ChatGPT. Fine-tuning is easy.

CypherLH t1_jcjakya wrote on March 17, 2023 at 6:21 AM

All You Need Is Fine-Tuning

vegita1022 t1_jcks65e wrote on March 17, 2023 at 3:33 PM

Imagine where you'll be two more papers down the line!

[deleted] t1_jcob97a wrote on March 18, 2023 at 8:33 AM

I hope so that it will be happen means 16GB ram and cpu or consumer gpu 😍

cartmanOne t1_jcof1cw wrote on March 18, 2023 at 9:29 AM

What a time to be alive!!

CellWithoutCulture t1_jcjku3z wrote on March 17, 2023 at 8:47 AM

The specific type of fine-tuning was called Knowledge Distillation, I believe. ChatGPT taught LLaMA to chat, "stealing" OpenAI's business edge in the process.

visarga t1_jcjornh wrote on March 17, 2023 at 9:44 AM

Everyone does it, they all exfiltrate valuable data from OpenAI. You can use it directly, like Alpaca, or for pre-labelling, or for mislabeled example detection.

They train code models by asking GPT3 to explain code snippets, then training a model the other way around to generate code from description. This data can be used to fine-tune a code model for your specific domain of interest.