Viewing a single comment thread. View all comments

Smallpaul t1_jcsah9r wrote on March 19, 2023 at 4:13 AM

I think the new model gets most of its knowledge from its original model and the training is mostly about how to act like a RLHF model.

philipgutjahr t1_jctbs35 wrote on March 19, 2023 at 12:18 PM

which can make a huge difference: GPT-3 + RLHF = Chat-GPT