Viewing a single comment thread. View all comments

Kroutoner t1_j9vzb0b wrote

There are scenarios where you would be totally fine not using a validation set, or even any sort of sample splitting whatsoever, but you definitely need to know what you’re doing and know why it’s okay that you’re not using them. If you can’t provide an explicit justification for why it’s okay you’re probably best off using a validation set.

1

osedao OP t1_j9waf9t wrote

Could this approach be enough to justify not using validation: i have 8 features and if i have equal/same distributions of each of these features in both training and test set, would this be enough?

1

Kroutoner t1_j9wfyz1 wrote

This does not seem like suitable justification.

2