I don't have anything against Cassandra. I will take a look at your link, and see how easy it is to integrate with Pig. I would be pretty excited to have another painless option available. The fact that Cassandra works with Whirr is very, very cool. However:
"Cassandra is an advanced topic, and while work is always underway to make things easier, it can still be daunting to get up and running for the first time."
Change your docs, or demonstrate how to one-liner push data to Cassandra, and I will happily update my post. Shadow puppet docs do not count.
Your statement about Hadoop being complex illustrates EXACTLY the problem I'm trying to solve. 'Big data' usability. ;) Amazon EMR against records in S3 with Pig is not hard. Publishing data from S3 via EMR to Mongo in Heroku... that is not hard either. Wow, suddenly 'big data' is open to anyone using Heroku. That is a big deal.
Cassandra's documentation: http://wiki.apache.org/cassandra/GettingStarted
"Cassandra is an advanced topic, and while work is always underway to make things easier, it can still be daunting to get up and running for the first time."
Change your docs, or demonstrate how to one-liner push data to Cassandra, and I will happily update my post. Shadow puppet docs do not count.
Your statement about Hadoop being complex illustrates EXACTLY the problem I'm trying to solve. 'Big data' usability. ;) Amazon EMR against records in S3 with Pig is not hard. Publishing data from S3 via EMR to Mongo in Heroku... that is not hard either. Wow, suddenly 'big data' is open to anyone using Heroku. That is a big deal.