How HBase partitions table across regionservers?

Tags:

Please tell me how HBase partitions table across regionservers.

For example, let's say my row keys are integers from 0 to 10M and I have 10 regionservers.
Does this mean that first regionserver will store all rows with keys with values 0 - 10M, second 1M - 2M, third 2M-3M , ... tenth 9M - 10M ?

I would like my row key to be timestamp, but I case most queries would apply to latest dates, all queries would be processed by only one regionserver, is it true?

Or maybe this data would be spread differently?
Or maybe can I somehow create more regions than I have region servers, so (according to given example) server 1 would have keys 0 - 0,5M and 3M - 3,5M, this way my data would be spread more equally, is this possible?

update

I just found that there's option hbase.hregion.max.filesize, do you think this will solve my problem?

406

asked Aug 05 '10 00:08

wlk

1 Answers

WRT partitionning, you can read Lars' blog post on HBase's architecture or Google's Bigtable paper which HBase "clones".

If your row key is only a timestamp, then yes the region with the biggest keys will always be hit with new requests (since a region is only served by a single region server).

Do you want to use timestamps in order to do short scans? If so, consider salting your keys (search google for how Mozilla did it with Sorocco).

Can your prefix the timestamp with any ID? For example, if you only request data for specific users, then prefix the ts with that user ID and it will give you a much better load distribution.

If not, then use UUIDs or anything else that will randomly distribute your keys.

About hbase.hregion.maxfilesize

Setting the maxfilesize on that table (which you can do with the shell), doesn't make it that each region is exactly X MB (where X is the value you set) big. So let's say your row keys are all timestamps, which means that each new row key is bigger than the previous one. This means that it will always be inserted in the region with the empty end key (the last one). At some point, one of the files will grow bigger than maxfilesize (through compactions), and that region will be split around the middle. The lower keys will be in their own region, the higher keys in another one. But since your new row key is always bigger than the previous, this means that you will only write to that new region (and so on).

tl;dr even though you have more than 1,000 regions, with this schema the region with the biggest row keys will always get the writes, which means that the hosting region server will become a bottleneck.

answered Sep 25 '22 09:09

jdcryans

Related questions
                            
                                Could the "reduce" function be parallelized in Functional Programming?
                            
                                How to stop a running TTask thread-safe?
                            
                                Best block size value for block matrix matrix multiplication
                            
                                Execute "git submodule foreach" in parallel
                            
                                Multiprocessing slower than serial processing in Windows (but not in Linux)
                            
                                Multiprocessing large XML file with shared memory complex objects
                            
                                Are HTML5 Web Workers threads or processes?
                            
                                Sum reduction with CUDA: What is N?
                            
                                Parallel Bulk Inserting with SqlBulkCopy and Azure
                            
                                Why is there a limit in the concurrent number of downloads?
                            
                                Initializing MPI cluster with snowfall R
                            
                                Calling mpi binary in serial as subprocess of mpi application
                            
                                Pytables/Pandas : Combining (reading?) mutliple HDF5 stores split by rows
                            
                                run database query and not wait for result
                            
                                How to install/compile pip requirements in parallel (make -j equivalent)
                            
                                Oozie fork kills all actions when one is killed
                            
                                MPI_Init() VS MPI_Init_thread()
                            
                                Parallelize pandas apply
                            
                                Why does tensorflow/keras choke when I try to fit multiple models in parallel?
                            
                                How to determine if numba's prange actually works correctly?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How HBase partitions table across regionservers?

Tags:

parallel-processing

hadoop

hbase

wlk

People also ask

1 Answers

jdcryans

Recent Activity

Donate For Us