Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I recently needed to scan hundreds of low quality invoices and run them through OCR for invoice numbers and dates. I really took for granted how seamless this is in some applications, and was shocked how much work went into producing decent results.

I was obviously really naive. Either way, it gets me excited any time I see progress with OCR. I should give this a try against my (small) dataset.



I don't think I understand your comment. What were your results for Qwen? Or is that what you meant for how much work was needed?


I just ran Qwen against some of invoices that my gnarly algorithm really struggled with (with openai fallback) and Qwen was able to extract all relevant data without any issues. I'm pretty damn impressed to be honest.


What is your point about Qwen? Or is it just a general statement regarding LLM?


I did it the old school way with openai as the fallback.

All I'm saying is I'm excited to try Qwen to see if it out performs my gnarly algorithm.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: