Fast-for-a-starfish t1_jc3w3xd wrote on March 13, 2023 at 9:15 PM

Very impressive work, thank you very much for sharing.

I have a few question regarding the training precedure:

did you train using a next token prediction scheme or something else?
do you think RLHF would further improve the model using your instructions?
why did you choose to do the differentiation between Instruction and Input?
How do you create the string the model is trained on? just concat Input and Instruction?

Thank you very much

dojoteef OP t1_jc3wcg9 wrote on March 13, 2023 at 9:16 PM

It's not my work, so I can't answer your questions. Helpfully the authors see this post and can answer your questions.