Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I mean, that's multimodality, but fine-grained editing of a previously generated text->image prompt is an entirely distinct thing, no?


I'm pretty sure that's still the same multimodal LLM, and considered a form of prompting?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: