Just curious, where did you execute ab from? I've uploaded your code and ran ab from my laptop - I've tried it under 3 different internet connections - and I only get about 100 req/s.
1 Unicorn worker is roughly equivalent to a Thin instance. You can only put a single Thin instance on a dyno, meaning that the first benchmark is what you previously could only attain in terms of throughput.
I might've not explicitly said this in my article, but see the first benchmark where I say:
"1 Unicorn Worker (Roughly equivalent to a single Thin instance)" - stating that this is Thin's (and was Heroku's) previous limit.
With Unicorn you can gain 3 to 4 times more throughput.
No problem. I doubt there is anything app-server specific to the Celadon Cedar stack. Maybe the Bambidious Bamboo stacks there is Thin-specific configuration. But Cedar is meant to run anything you want through the Procfile, you could also use Mongrel or Rainbows or maybe even Passenger. But Unicorn gets my preference as of late! Amazing app server.