Given the stochastic way it works I wonder how the randomness is seeded for a ce...

nodja · on Aug 2, 2022

The model starts from 64x64 8bit RGB image of noise (random pixels) so technically 1 in 3_145_728 (64x64x256x3) but most will probably be very close to each other as the color difference won't be that much. The image is then further upsampled by two other models which will change some details, but shouldn't affect the general composition of an image.

spyder · on Aug 3, 2022

Maybe I'm wrong, but with these diffusion models there is randomness in every sampling step too not just in the initialization and they can have 1000 steps to generate a single image.

nodja · on Aug 4, 2022

Ah good point, this would introduce more variation if the initial noise is close, but if the initial noise is exactly the same it probably means it was initialized with the same seed and the rest of the generation will be the same since the random algorithms are deterministic.

cube2222 · on Aug 2, 2022

Since the image is RGB 1024x1024, and the random seed is noise (as it is for diffusion models), I guess it would be quite long.