Oh, I know. The markup is incomprehensible, but not to a render engine. It doesn't even seem to impact loading speed. It generates amazing machine-text.
As for scraping... Parsing the hell that is wikitext is all you can do. Or apparently, pipe it through a text browser.
That's an interesting idea, and one that I hadn't thought of, but I'd place it closer to matching HTML with regex than actual "parsing". I might use it if I'm really desperate though.
As for scraping... Parsing the hell that is wikitext is all you can do. Or apparently, pipe it through a text browser.