Viewing a single comment thread. View all comments

AkirIkasu t1_iyu4y2e wrote

From the github page:

> The image generation procedure follows the standard configuration: 50 inference steps, 512x512 output image resolution, 77 text token sequence length, classifier-free guidance (batch size of 2 for unet).

12