Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Oh, I know. The markup is incomprehensible, but not to a render engine. It doesn't even seem to impact loading speed. It generates amazing machine-text.

As for scraping... Parsing the hell that is wikitext is all you can do. Or apparently, pipe it through a text browser.



> pipe it through a text browser

That's an interesting idea, and one that I hadn't thought of, but I'd place it closer to matching HTML with regex than actual "parsing". I might use it if I'm really desperate though.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: