We are working with a Cassandra database that will store data in the petabyte range. We are thinking of using either ElasticSearch or Solandra, but we are having a fun time deciding between which to use. I'm wondering if the our database might get too large. I know ElasticSearch is scalable, but to what extent - especially with a Cassandra database.
Solandra on the other hand is made for Cassandra and is highly scalable, but again, to what extent?
Both are scalable, but how scalable using Cassandra?
Solandra is being used in the 10s of Terabytes range.
Are you saying you want to index a PB of data in solandra or a subset? I think if you want 1 big index with a PB of data you are stretching the limits. but If you want a PB of indexes, then this will scale the same as Cassandra.
How many nodes are you planning to run? how much disk per node?
Have a look into this nice discussion:
http://groups.google.com/a/elasticsearch.com/group/users/browse_thread/thread/3f99e682887f98e4
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With