Submitted by shingekichan1996 t3_10ky2oh in MachineLearning
koolaidman123 t1_j5ujfpv wrote
Reply to comment by altmly in [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996
contrastive methods require in-batch negatives, you can't replicate that with grad accumulation
Viewing a single comment thread. View all comments