Fast-for-a-starfish t1_jc3w3xd wrote
Very impressive work, thank you very much for sharing.
I have a few question regarding the training precedure:
- did you train using a next token prediction scheme or something else?
- do you think RLHF would further improve the model using your instructions?
- why did you choose to do the differentiation between Instruction and Input?
- How do you create the string the model is trained on? just concat Input and Instruction?
Thank you very much
dojoteef OP t1_jc3wcg9 wrote
It's not my work, so I can't answer your questions. Helpfully the authors see this post and can answer your questions.
Viewing a single comment thread. View all comments