This gets interesting. One approach that I've used with image generation before ...

whywhywhywhy · on Aug 2, 2024

Current generation image generators don’t understand text like instructions as you’re trying to do, describing an object then placing it then setting the scene.

It’s more like a giant telescope of many lenses (the latents from the prompts) and you’re adjusting the lenses to bring a possible reality of many into focus.

Taek · on Aug 2, 2024

It looks like imgur is blocking Mullvad VPN connections