Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'd love to hear from someone who has implemented this in production. Seems like really cool tech, but haven't had a chance to use it on a project yet.


Using it in production currently with dual-write and dual-read to compare perf. I'll do a write-up showing how Cockroach performs to Citus and Cassandra for my use case.


We use Citus and Memsql (big data analytics use cases). How does Cockroach handle joins and other OLAP style queries?


CockroachDB is not (yet) ready for use on OLAP-style workloads. Our performance work has focused on OLTP workloads so far. That said, we do great on OLTP joins (which is a stressed in the TPC-C workload).


You're not going to get better performance for OLAP than MemSQL's columnstore and in-memory rowstore for reference tables to join.

Citus is great if you want the Postgres interface but is still using standard rowstore tables. CockroachDB is similar with rowstore performance but with added distributed consensus overhead. They are both much better for OLTP and sharding. CockroachDB also provides easy high-availability and replication.


MemSQL is one of those "ask for a quote" products. What are some rule of thumb estimates for what it costs?


Licensed by total RAM of all nodes. $25k/year minimum license now, but you should still talk to them if you're a small company. Regardless of price, I highly recommend the product as one of the most polished data warehouses available for on-prem/self-managed operations.


$25K/year/box from previous comments


Yeah this is what we do. Citus is our single source of truth and powers a few interactive apps and admin panels. These sync hourly to our Memsql cluster which is cstore + ref tables and works amazingly.


kdb+?


Sure, but it's far more expensive and not as generally usable as the mysql-flavored MemSQL for common data warehouse scenarios. Performance will be similar but there are differences in functionality like kdb's asof joins that can't really be compared.

kdb+ is much better for numeric/financial analysis apps, especially when used with the integrated query language and interpreter environment.


The statement was that you will not get better performance than memsql, not about which is cheaper.


As written: "Performance will be similar but there are differences in functionality like kdb's asof joins that can't really be compared."

Also the original post only mentioned MemSQL, Citus, and CockroachDB. With those, MemSQL is the fastest.


I'm using MemSQL's columnstore myself and the performance is nothing short of amazing. I migrated from Citus DB to it for OLAP workloads.

But for what reasons are you using Citus as well? Would like to know if I am missing something or hear another perspective.

Can you explain your use case? Thanks


We use clickhouse cluster with 1000 nodes and 50000 GB clickstream data.


That's only 50gb per node. Why do you/Clickhouse need so many nodes?


Maybe some space is dedicated to replication? Or for query execution temp space like Redshift. Or he could be trying to keep everything in memory.


Please post your write-up on HN! I would love to read CockroachDB performing in real world.


Please do, that would be a very popular read I think.


Please do!


Works great, just a tad slow. Hopefully this improves things. Deploying with Kubernetes is pretty seamless as well.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: