I just had very bad JSON mode operation with gemini-1.5-flash and 2.0-flash mode...

zbrw · 2025-04-07T13:11:42 1744031502

Things to note: 1) supply a JSON schema in `config.reponse_schema` 2) set the `config.response_type` to `application/json`

That works for me reliably. I've had some issues with running into max_token constraints but that was usually on me because I had let it process a large list in one inference call, which would have resulted in very large outputs.

We're using gemini JSON mode in production applications with both `google-generativeai` and `langchain` without issues.

MrBuddyCasino · 2025-04-07T11:53:04 1744026784

Did you provide a JSON schema? I've had good experience with that.