Nice! I worked on something similar a year ago but for the Ruby world. If you're on a Unix with Ruby installed (e.g. OS X), you can mostly repeat the linked demo like so:
gem install pismo
pismo http://techcrunch.com/2011/06/09/twitter-ios/ title lede author datetime body
And then enjoy the output. No image pick out, but it's the first IMG tag in the 'html_body'.. just never got around to implementing it as I didn't need that feature.
The downside is I haven't worked on it for months and it's in sore need of improvements. For its current in-production use though, it's proving sufficient and a reasonable option for Rubyists. More info at https://github.com/peterc/pismo
Not knocking Jim's work on Plush, btw, he's actively working on it so if Java works out for you, stick to him! :)
One thing I've always wanted is something to extract multi-page forum threads and render them in a normalized readable way. For example, Reddit comment threads like IAMAs.
Anyone know of a service or library that does that?
I actually started playing with the concept one weekend for IAmAs specifically. I was trying to do it all client-side and the issue I was running up against was reddit's jsonp responses get VERY slow on large threads.
The downside is I haven't worked on it for months and it's in sore need of improvements. For its current in-production use though, it's proving sufficient and a reasonable option for Rubyists. More info at https://github.com/peterc/pismo
Not knocking Jim's work on Plush, btw, he's actively working on it so if Java works out for you, stick to him! :)