Submitted by von-hust t3_11jyrfj in MachineLearning
AuspiciousApple t1_jb6gzcd wrote
Reply to comment by currentscurrents in [R] We found nearly half a billion duplicated images on LAION-2B-en. by von-hust
Can't wait to see this replicated!
astrange t1_jb6hn1a wrote
StableDiffusion claims they also dedupe following this, in SD2.X at least.
Though, deduplicating images feels incomplete to me - what if the same thing appears in different images? That's kind of what you want, but also not what you want.
Viewing a single comment thread. View all comments