Viewing a single comment thread. View all comments

Fast-for-a-starfish t1_jc3w3xd wrote

Very impressive work, thank you very much for sharing.

I have a few question regarding the training precedure:

  • did you train using a next token prediction scheme or something else?
  • do you think RLHF would further improve the model using your instructions?
  • why did you choose to do the differentiation between Instruction and Input?
  • How do you create the string the model is trained on? just concat Input and Instruction?

Thank you very much

5

dojoteef OP t1_jc3wcg9 wrote

It's not my work, so I can't answer your questions. Helpfully the authors see this post and can answer your questions.

3