It's so finicky with that. I upload images to regular Gemini interface (with a text prompt as well) and almost always get back results, but I have seen this message before.
There's gotta be a condition that allows it to be bypassed, and clearly this project should be triggering that condition lol
This isn't true. But how true it isn't is something even the best minds at OpenAI don't, and probably can't, know. Throwing black box magic spaghetti at the wall as hard as your GPUs can is a nasty shortcut.
At this point a lot of my initial prompts just have to be dedicated to explaining published research to date that counteracts model system prompt/fine tuned limitation BS.
> I can't see or analyze any image. I am a text-based chat assistant and thus I cannot process any image.
Pretty hilarious honestly.