Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The problem with PDFs is that text isn’t necessarily text. Most RAG implementations that support them don’t do any sort of OCR or use local offline OCR implementations that have really low accuracy.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: