Submitted by TheCockatoo t3_10m1sdm in MachineLearning
bloc97 t1_j63q1nk wrote
Reply to comment by HateRedditCantQuitit in [D] Why are GANs worse than (Latent) Diffusion Models for text2img generation? by TheCockatoo
>It's simpler (which leads to progress)
I wouldn't say current diffusions models are simpler, in fact they are much more complex than even the most "complex" GAN architectures. However it's exactly because of all the other points that they have become this complex. A vanilla GAN would never be able to endure this much tweaking without mode collapse. Compare that to even the most basic score-based models, which are always stable.
Sometimes, the "It just works™" proposition is much more appealing than pipeline simplicity or speed.
Viewing a single comment thread. View all comments