Submitted by von-hust t3_11jyrfj in MachineLearning
astrange t1_jb6hn1a wrote
Reply to comment by AuspiciousApple in [R] We found nearly half a billion duplicated images on LAION-2B-en. by von-hust
StableDiffusion claims they also dedupe following this, in SD2.X at least.
Though, deduplicating images feels incomplete to me - what if the same thing appears in different images? That's kind of what you want, but also not what you want.
Viewing a single comment thread. View all comments