Cassandra has a limit of 2 billion cells per partition, but what's a partition?

1 Answers

With the advent of CQL3 the terminology has changed slightly from the old thrift terms.

Basically

Create Table foo (a int , b int, c int, d int, PRIMARY KEY ((a,b),c))

Will make a CQL3 table. The information in a and b is used to make the partition key, this describes which node the information will reside on. This is the 'partiton' talked about in the 2 billion cell limit.

Within that partition the information will be organized by c, known as the clustering key. Together a,b and c, define a unique value of d. In this case the number of cells in a partition would be c * d. So in this example for any given pair of a and b there can only be 2 billion combinations of c and d

So as you model your data you want to ensure that the primary key will vary so that your data will be randomly distributed across Cassandra. Then use clustering keys to ensure that your data is available in the way you want it.

Watch this video for more info on Datmodeling in cassandra The Datamodel is Dead, Long live the datamodel

Edit: One more example from the comments

Create Table foo (a int , b int, c int, d int, e int, f int, PRIMARY KEY ((a,b),c,d))

Partitions will be uniquely identified by a combination of a and b.

Within a partition c and d will be used to order cells within the partition so the layout will look a little like:

(a1,b1) --> [c1,d1 : e1], [c1,d1  :f1], [c1,d2 : e2] ....

So in this example you can have 2 Billion cells with each cell containing:

A value of c
A value of d
A value of either e or f

So the 2 billion limit refers to the sum of unique tuples of (c,d,e) and (c,d,f).

149

answered Sep 21 '22 07:09

RussS

Related questions
                            
                                Primary key in cassandra is unique?
                            
                                What are the implications of R + W > N for Cassandra clusters?
                            
                                Executing CQL through Shell Script?
                            
                                Cassandra "no viable alternative at input"
                            
                                Why don't you start off with a "single & small" Cassandra server as you usually do it with MySQL?
                            
                                Cassandra: Generate a unique ID?
                            
                                alter composite primary key in cassandra CQL 3.0
                            
                                How does cassandra find the node that contains the data?
                            
                                Cassandra: Exiting due to error while processing commit log during initialization
                            
                                How to do a join queries with 2 or more tables in cassandra cql
                            
                                Is there any query for Cassandra as same as SQL:LIKE Condition?
                            
                                Cassandra and Java 9 - ThreadPriorityPolicy=42 is outside the allowed range
                            
                                Cassandra: File "cqlsh", line 95 except ImportError, e:
                            
                                Check CQL version with Cassandra and cqlsh?
                            
                                Is Cassandra production ready for Ruby on Rails?
                            
                                cassandra get all records in time range
                            
                                Getting Cassandra datacenter name in cqlsh
                            
                                How to connect Cassandra using Java class
                            
                                Cassandra - transaction support
                            
                                What's the difference between creating a table and creating a columnfamily in Cassandra?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cassandra has a limit of 2 billion cells per partition, but what's a partition?

Tags:

cassandra

limit

column-family

Benoit Thiery

People also ask

1 Answers

Edit: One more example from the comments

RussS

Recent Activity

Donate For Us