Submitted by [deleted] t3_11v4h5z in MachineLearning
Smallpaul t1_jcsah9r wrote
Reply to comment by RoyalCities in [P] The next generation of Stanford Alpaca by [deleted]
I think the new model gets most of its knowledge from its original model and the training is mostly about how to act like a RLHF model.
philipgutjahr t1_jctbs35 wrote
which can make a huge difference: GPT-3 + RLHF = Chat-GPT
Viewing a single comment thread. View all comments