emmytau t1_j2r7cbc wrote on January 3, 2023 at 11:07 AM

Is it problematic that my Bart summarization model's training loss drop below validation loss? I could for example stop the training already after 2 epochs. However, it would be nice to train more epochs but maybe it would just require more data - or do you have any training argument suggestions?