Submitted by fintechSGNYC t3_1095os9 in MachineLearning
--algo t1_j40j84w wrote
Reply to comment by GeoLyinX in [D] Microsoft ChatGPT investment isn't about Bing but about Cortana by fintechSGNYC
We are both right and wrong. To be pedantic, it's this paper for both https://arxiv.org/abs/2203.02155 but with different training data
Hyper1on t1_j43crwx wrote
That's the InstructGPT paper, which is right for ChatGPT, but Copilot is based on Codex, which does not use RLHF.
--algo t1_j43rpre wrote
Are you sure? This implies otherwise: https://openai.com/blog/instruction-following/
But maybe it's only for the non-codex models
Hyper1on t1_j43wyf3 wrote
You can see the full details here: https://beta.openai.com/docs/model-index-for-researchers
Copilot itself is the 12B Codex model, with further refinements.
Viewing a single comment thread. View all comments