Viewing a single comment thread. View all comments

juggarjew t1_iyrtq5p wrote

And I can generate an image in a few second on my Nvidia A4000, this is a meaningless statement given that you can tweak so many settings such that there is no apples to apples comparison going on.

58

AkirIkasu t1_iyu4y2e wrote

From the github page:

> The image generation procedure follows the standard configuration: 50 inference steps, 512x512 output image resolution, 77 text token sequence length, classifier-free guidance (batch size of 2 for unet).

12