Is HBase stable and production-ready?

Tags:

hbase

For folks who have deployed HBase on their own clusters, do you feel that it's sufficiently stable for production use? What types of troubles or issues have you run into?

I do see a bunch of companies listed as using HBase in production (http://wiki.apache.org/hadoop/Hbase/PoweredBy), but I'm curious as to whether a lot of maintenance, patching, and firedrills goes into keeping the HBase cluster up and running.

798

asked Jun 20 '09 18:06

Edmond Lau

1 Answers

HBase is about to hit a major milestone with HBase-0.20. There's is an alpha and soon to be a RC. It has had very major performance improvements. StumbleUpon reportedly serve their site live out the trunk version of HBase, with no additional caching layer, as do others. So I'd say it's definitely ready for production use.

Ryan Rawson (of StumbleUpon) gave a nice talk on it at the nosql conference recently, which mostly is about how far it's come over the last 6 months. There are slides if you don't want to watch the whole thing. Apart from performance improvements the other major addition is it integrates with zookeeper now, so the master isn't a single point of failure anymore.

HBase used to fall over with small cell sizes with memory issues because of a limitation of the file format. This has been addressed too with a new custom file format, which also gave performance gains.

I've been experimenting with HBase for about a year now, I'm ready to trust 0.20 with a production service, I wasn't quite with older versions. I recommended at least a 4 or 5 node devcluster when experimenting.

I can't really comment on what it's like care-taking a production cluster, because we only just started with a production one. An aspect that helps is the mailing list is extremely active and irc is in constant use so there's a very strong community for helping out at least.

answered Oct 19 '22 18:10

Tim

Related questions
                            
                                Simple oozie example of hive query?
                            
                                Pig, how to refer to a field after a join and a group by
                            
                                In Hive, how can I add a column only if that column does not exist?
                            
                                Should the HBase region server and Hadoop data node on the same machine?
                            
                                Hadoop 2.6 Connecting to ResourceManager at /0.0.0.0:8032
                            
                                could only be replicated to 0 nodes instead of minReplication (=1). There are 4 datanode(s) running and no node(s) are excluded in this operation
                            
                                how to tune mapred.reduce.parallel.copies?
                            
                                How oozie handle dependencies?
                            
                                What is the HDFS Location on Hadoop?
                            
                                Hive: Fast concatenate two tables into one?
                            
                                How to save a file in hadoop with python
                            
                                Change hadoop version using spark-ec2
                            
                                Hive inserting values to an array complex type column
                            
                                How does the percentile function work in Hive?
                            
                                How to import from sql dump to MongoDB?
                            
                                Generating Separate Output files in Hadoop Streaming
                            
                                (Hadoop) MapReduce - Chain jobs - JobControl doesn't stop
                            
                                Yarn JobHistory Error: Failed redirect for container_1400260444475_3309_01_000001
                            
                                how to find HADOOP_HOME path on Linux?
                            
                                Hive: Is it possible to rename an existing hive database?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is HBase stable and production-ready?

Tags:

hadoop

hbase

Edmond Lau

People also ask

1 Answers

Tim

Recent Activity

Donate For Us