I'm not convinced. I had Gemini convert a bunch of charity forms yesterday, and ...

timschmidt · 2025-06-20T02:13:23 1750385603

I've seen similar. I wonder if traditional organizational solutions, like those employed by the US Military or IBM, might be applicable. Redundancy is one of their tools for achieving reliability from unreliable parts. Instead of asking a single LLM to perform the task, ask 10 different LLMs to perform the same task 10 different times and count them like votes.

Normal_gaussian · 2025-06-22T10:12:03 1750587123

Yeah, what I did to "solve" my issue was to use several models (4), then where there was any disagreement farm out to humans (2). 60% went to humans in the end.

I suspect if I'd done some corrective transformations before LLM scanning the success rate would have been higher, but the cost threshold of the project didn't warrant it.

latentpot · 2025-06-20T02:42:42 1750387362

Why complicate? One LLM works, another reflects and then a decision engine to review would be cheaper.

nojito · 2025-06-20T17:06:44 1750439204

Not sure I believe this.

I just quickly took a scanned document and the transcription looks good.

https://19january2021snapshot.epa.gov/sites/static/files/201...

https://g.co/gemini/share/d315b4047224

It even got the faded partial date stamp.

Normal_gaussian · 2025-06-22T10:09:28 1750586968

Well bully for you accusing people of lying.

Thats one of the best scanned documents I've seen in years. Most scanning now is via phone.

simonw · 2025-06-20T02:28:48 1750386528

Did you out as much work into it as Derek did? He spent a full hour with Gemini to process the longer document.

7moritz7 · 2025-06-20T08:05:13 1750406713

Use 2.5 Pro on ai studio, not the gemini app

Normal_gaussian · 2025-06-22T10:10:12 1750587012

I did. I was scanning about 400 forms.

dwillis · 2025-06-20T16:06:13 1750435573

That's what I did.