Submitted by hopedallas t3_zmaobm in MachineLearning
trendymoniker t1_j0acn6e wrote
Reply to comment by Far-Butterscotch-436 in [D] Dealing with extremely imbalanced dataset by hopedallas
👆
1e6:1 is extreme. 1e3:1 is often realistic (think views to shares on social media). 18:1 is a actually a pretty good real world ratio.
If it were me, I’d just change the weights for each class in the loss function to get them more or less equal.
190m examples isn’t that many either — don’t worry about it. Compute is cheap — it’s ok if it takes more than one machine and/or more time.
Viewing a single comment thread. View all comments