How Ravelry Scales to 10 Million Requests Using Rails

n8agrin · on Sept 23, 2009

For a post titled "How Ravelry Scales to 10 Million Requests Using Rails" the only scaling advice they mention are the technical specs of the site like:

Tokyo Cabinet/Tyrant is used instead of memcached in some places for caching larger objects. Specifically markdown text that has been converted to HTML.

and this one tip:

The database is the problem. Nearly all of the scaling/tuning/performance related work is database related. For example, MySQL schema changes on large tables are painful if you don’t want any downtime. One of the arguments for schemaless databases.

Not much "how" in that.

callmeed · on Sept 23, 2009

The article linked in this thread had pretty good details:

http://news.ycombinator.com/item?id=802889 (http://www.tbray.org/ongoing/When/200x/2009/09/02/Ravelry)

toddh · on Sept 23, 2009

It should be illuminating that a site of this size doesn't need to have a lavish description of arcane scaling strategies. It's fairly straightforward so the how they make the site becomes most interesting.

timf · on Sept 23, 2009

"Casey is the sole engineer for Ravelry and to run it takes only a few people."

Looking over the setup, that's quite a lot for one person to do during nights and weekends in 4 months. Pretty cool.

ionfish · on Sept 23, 2009

I suspect that was prior to the initial launch. The architecture has presumably evolved to the state described in the HS article since then.

Edit: yes, Casey says in the Tim Bray interview that "As soon as we could, we got alpha testers in to try it out... 4 months later, we had a site that we were ready to announce."

timf · on Sept 23, 2009

I'd never heard of HopToad before, that looks interesting.

steveklabnik · on Sept 23, 2009

+1 from me. I use it on my site, it's pretty awesome.

warfangle · on Sept 23, 2009

Curious:

10 million server requests per day sounds kind of impressive, until you actually do the math.. divided by how much physical iron they're using, that's a little less than 9 requests per second per server.

It makes me wonder: if they were using something other than rails, would they need that much iron?

sophacles · on Sept 23, 2009

I strongly suspect a flaw in your statistics: I'm willing to put money on this site having a spikey workload, not a constant workload. There are probably hours in a row that 6 of those servers sit idle.

Rant: I wish technical sites would stop using req/day as a metric. It leads to the op type of analysis. At the very least, such articles could use a format of "X req/day peaking at Y/s". Maybe if the NYT was writing it would be ok to use req/day but a sight who's tagline is: "High Scalability Building bigger, faster, more reliable websites." should know better.

warfangle · on Sept 23, 2009

Rebuttal accepted, observed, and original post withdrawn.

toddh · on Sept 23, 2009

What should I use in a title that is informative and gets people interested enough to read an article? A munin graph isn't quite as punchy.

sophacles · on Sept 25, 2009

Sorry for that late reply, IMO, that title was fine, it did it's job well. My rant, etc, was about the stats section in the article itself. It still uses a flat time model, on the scale of N things/day instead of a more representative N things/day (X things/(smaller than day time unit) at peak).

mechanical_fish · on Sept 23, 2009

How much development time can you afford to spend to save the cost of four or five servers?

(You can't save the sixth server if you want your site to be up while the seventh one is rebooting or being replaced.)

If the other comments are to be believed, this site was built by one person, working part-time, in four months. He can't afford to lavish time on unimportant problems, like desperately trying to conserve server resources that he could otherwise afford and that cost far less than a programmer's time is worth.

warfangle · on Sept 23, 2009

It was originally built by one guy in 4 months part time, but now (according to the article) there's a "small team."

Of course, you're also totally throwing away the "new agile" platforms, like django, scala lift, and so forth.

sstrudeau · on Sept 23, 2009

The "small team" is his wife and some community managers -- no other engineers or admins. He builds & runs the whole stack himself, and he has duties outside of the admin/development pieces of the business. So he's still essentially a part time developer/part time admin.

caseyf · on Sept 24, 2009

Hey there.

2.5 of our physical servers run Passenger/Rails.

If this were a Java app, I definitely would have been able to get away with less (mostly because of less memory consumption, but less CPU consumption wouldn't hurt either)

However, I'd probably still want 2 machines for redundancy.

rogerdpack · on Sept 25, 2009

if you want to save on RAM you can use a 32-bit OS and/or 1.9 or the MBARI patches--works well for me :)

http://programming-gone-awry.blogspot.com/2009/06/how-to-sav...

jherdman · on Sept 23, 2009

Wait... they have Nginx out front passing requests to HAProxy and THEN to Apache + mod_rails? That just seems like a bit much given that mod_rails can be installed with Nginx straight up. Why would you want a set up like this?

brett · on Sept 23, 2009

Having Nginx in front is a lot more flexible that just having HAProxy listen on port 80. For example, it can serve static files and do redirects both of which don't need to pass through the whole load balanced stack.

They could use Nginx -> HAProxy -> Nginx(w/ passenger), but the Apache version feels slightly more mature (e.g. it has some config options that the Nginx version lacks) and it's likely they were already using it before the Nginx version came out.

zepolen · on Sept 23, 2009

Why use HAProxy at all?

lacerus · on Sept 23, 2009

There is an interesting "Zero-Downtime Restarts with HAProxy" Tutorial that he might have implemented:

http://www.igvita.com/2008/12/02/zero-downtime-restarts-with...

brett · on Sept 23, 2009

It's better at load balancing. For example it handles app servers that have gone down more gracefully. Also it generates an awesome stats page that gives you way more info about what's going on than you can get from Nginx.

caseyf · on Sept 24, 2009

Yep! We used nginx's fair balancing module before switching to haproxy. It also helps me do rolling restarts/hot deployments in a nice way.

It's really a great piece of software. Kudos to Willy.

PS - you're also correct about the nginx->haproxy->apache. nginx makes a fabulous front end and I just plugged in Apache/Passenger where Mongrel used to be. I like that 1) I can easily plug in something else in the future and 2) Passenger on Apache is very stable. Nginx support is newish and I'm running stripped down Apaches that only do Passenger, so I'm not too fussed about it.

gdp · on Sept 23, 2009

Purely anecdotal, but does anyone else notice while browsing around even the (largely static) unauthenticated pages that the generation times are a bit lousy?

idleworx · on Sept 23, 2009

it's always interesting to find out what's behind the effort. great link.