I am studing couchbase, can anyone exlain what exactly is bucket and vbucket?

Tags:

I am studing couchbase now, I am really confused by the official description of the term 'bucket' and 'vbucket', can anybody explain what exactely a bucket or vbucket is ? what's the difference? Better to make some analogies and give some examples.

442

asked Nov 20 '13 09:11

user3012468

3 Answers

Short answer

Bucket is a logical keyspace of uniquely keyed documents, evenly distributed across all nodes in a cluster.

vBucket is a subset of a bucket which is located on a single node. Union of all vBuckets is a bucket.

Slightly longer answer

Imagine you have three nodes:

+----------+         +----------+        +----------+
|          |         |          |        |          |
|          |         |          |        |          |
|          |         |          |        |          |
|          |         |          |        |          |
|          |         |          |        |          |
|          |         |          |        |          |
|          |         |          |        |          |
+----------+         +----------+        +----------+
   node1                node2               node3

A bucket is a set of documents (that can be different in structure and attributes) that is distributed over all three nodes but it shares the same key space.

   +----------+         +----------+        +----------+
+---------------------------------------------------------------+
|  |          |         |          |        |          |        |
|  |          |         |          |        |          |      Bucket
|  |          |         |          |        |          |        |
+---------------------------------------------------------------+
   |          |         |          |        |          |
   |          |         |          |        |          |
   +----------+         +----------+        +----------+
      node1                node2               node3

Note that a key must be unique within a bucket, which is kind of different compared to a database concept in RDBMS where a key is unique within a table.

The bucket is divided into 1024 segments which are evenly distributed across all the nodes in the cluster. These segments are virtual buckets, or vBucketes. So, in this case, on each node there are 1024/3 vBuckets.

   +----------+         +----------+        +----------+
+---------------------------------------------------------------+
|  |          |         |          |        |          |        |
|  |  341 vBs |         |  341 vBs |        |  342 vBs |      Bucket
|  |          |         |          |        |          |        |
+---------------------------------------------------------------+
   |          |         |          |        |          |
   |          |         |          |        |          |
   +----------+         +----------+        +----------+
      node1                node2               node3

Each vBucket has its associated set of documents. So when the lookup is performed, clusterMap calculates the hash of the searched document's key and identifies the node and the vBucket where the document is located.

references: http://training.couchbase.com/online

answered Nov 18 '22 14:11

Milan

Bucket is like database at RDBMS. It contains documents, views and some configurations. VBucket is like shard at RDBMS. All keys at CB mapped to #VBucket and #VBucket mapped to server-name. Thanks to these hash functions results in an even distribution of documents on multiple nodes and fast get operation of the document by its id.

answered Nov 18 '22 12:11

Vladislav Koroteev

You can start with Couchbase documentation, section "Architecture and Concepts" http://docs.couchbase.com/admin/admin/Concepts/concept-intro.html

For more information about buckets, see http://docs.couchbase.com/admin/admin/Concepts/concept-dataStorage.html.

For more information about vBuckets, see http://docs.couchbase.com/admin/admin/Concepts/concept-vBucket.html.

In short, bucket is an abstraction, which describes certain resources on the cluster (like RAM and disk space) and also from the API standpoint it is namespace for the documents stored in the system, similar to database in SQL world.

answered Nov 18 '22 14:11

avsej

Related questions
                            
                                ElasticSearch or Couchbase or something else
                            
                                Couchbase connection timeout with Java SDK
                            
                                Callback in Golang
                            
                                Should I treat Couchbase bucket as table, or more like a schema
                            
                                How to use spring data with couchbase without _class attribute
                            
                                N1QL Query times out when Using parameterized IN clause
                            
                                Does couchbase actually support datasets larger than memory?
                            
                                can't stop docker couchbase-community
                            
                                Couchbase Lite pull replication fails with error in a sample Couchbase Mobile End to End testing project
                            
                                Couchbase Lite on Android L
                            
                                Best practice to store couchbase views
                            
                                Trying to install Couchbase, with gcc command fails, Python
                            
                                Alternate Couchbase UI [closed]
                            
                                Using an Increment counter for unique key generation in a Couchbase cluster
                            
                                Couchbase REST API for inserting/updating a document [closed]
                            
                                Couchbase - retrieving multiple documents using key prefix
                            
                                Couchbase concurrent timeout exception : Java SDK
                            
                                max number of couchbase views per bucket
                            
                                ElasticSearch - Using FilterBuilders

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With