Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Riak vs GlusterFS

I need to setup a data storage which can store PB level of files (files are mostly small json, images and csv files, but some of them can be ~100MB binary files).

I am looking into distributed data storage which is master-less and no-single-point-of-failure.

And I found Riak and GlusterFS.

I want to ask anyone of you have used both of them before?

I know that there interface (DB/Map) is very different. But seems to me that they are both use hashing and similar distributed tech. Will they have similar performance, consistency and availability?

like image 889
Eric Fong Avatar asked Dec 13 '11 02:12

Eric Fong


1 Answers

We are running a 17 node (24GB RAM, 2T disk) Riak cluster with a Bitcask backend, storing around 1 billion 3k objects. This setup is performant but very resource intensive. We are considering moving away from Riak to GlusterFS as performance is not that important for us. Perhaps using LevelDB as a backend would also mitigate our worries.

ATM the self healing properties of Riak seem stronger and the configuration seem a tad easier. In your case I'd be more comfortable storing 100MB files on GlusterFS.

like image 164
harm Avatar answered Sep 22 '22 18:09

harm