How does cassandra find the node that contains the data?

Tags:

cassandra-2.0

I've read quite a few articles and a lot of question/answers on SO about Cassandra but I still can't figure out how Cassandra decides which node(s) to go to when it's reading the data.

First, some assumptions about an imaginary cluster:

Replication Strategy = simple
Using Random Partitioner
Cluster of 10 nodes
Replication Factor of 5

Here's my understanding of how writes work based on various Datastax articles and other blog posts I've read:

Client sends the data to a random node
The "random" node is decided based on the MD5 hash of the primary key.
Data is written to the commit_log and memtable and then propagated 4 times (with RF = 5).
The 4 next nodes in the ring are then selected and data is persisted in them.

So far, so good.

Now the question is, when the client sends a read request (say with CL = 3) to the cluster, how does Cassandra know which nodes (5 out of 10 as the worst case scenario) it needs to contact to get this data? Surely it's not going to all 10 nodes as that would be inefficient.

Am I correct in assuming that Cassandra will again, do an MD5 hash of the primary key (of the request) and choose the node according to that and then walks the ring?

Also, how does the network topology case work? if I have multiple data centers, how does Cassandra know which nodes in each DC/Rack contain the data? From what I understand, only the first node is obvious (since the hash of the primary key has resulted in that node explicitly).

Sorry if the question is not very clear and please add a comment if you need more details about my question.

Many thanks,

396

asked Jul 28 '15 07:07

kha

1 Answers

Client sends the data to a random node

It might seem that way, but there is actually a non-random way that your driver picks a node to talk to. This node is called a "coordinator node" and is typically chosen based-on having the least (closest) "network distance." Client requests can really be sent to any node, and at first they will be sent to the nodes which your driver knows about. But once it connects and understands the topology of your cluster, it may change to a "closer" coordinator.

The nodes in your cluster exchange topology information with each other using the Gossip Protocol. The gossiper runs every second, and ensures that all nodes are kept current with data from whichever Snitch you have configured. The snitch keeps track of which data centers and racks each node belongs to.

In this way, the coordinator node also has data about which nodes are responsible for each token range. You can see this information by running a nodetool ring from the command line. Although if you are using vnodes, that will be trickier to ascertain, as data on all 256 (default) virtual nodes will quickly flash by on the screen.

So let's say that I have a table that I'm using to keep track of ship crew members by their first name, and let's assume that I want to look-up Malcolm Reynolds. Running this query:

SELECT token(firstname),firstname, id, lastname  FROM usersbyfirstname  WHERE firstname='Mal';

...returns this row:

 token(firstname)     | firstname | id | lastname ----------------------+-----------+----+-----------   4016264465811926804 |       Mal |  2 |  Reynolds

By running a nodetool ring I can see which node is responsible for this token:

192.168.1.22  rack1       Up     Normal  348.31 KB   3976595151390728557                          192.168.1.22  rack1       Up     Normal  348.31 KB   4142666302960897745

Or even easier, I can use nodetool getendpoints to see this data:

$ nodetool getendpoints stackoverflow usersbyfirstname Mal Picked up JAVA_TOOL_OPTIONS: -javaagent:/usr/share/java/jayatanaag.jar  192.168.1.22

For more information, check out some of the items linked above, or try running nodetool gossipinfo.

answered Sep 21 '22 12:09

Aaron

Related questions
                            
                                com.datastax.driver.core.exceptions.InvalidQueryException: unconfigured table schema_keyspaces
                            
                                cqlsh connection error: 'ref() does not take keyword arguments'
                            
                                Apache Cassandra remote access
                            
                                Apache Cassandra vs Datastax Cassandra [closed]
                            
                                What is the batch limit in Cassandra?
                            
                                Cassandra: can I have default value for a column like sql
                            
                                Spatial data with mongodb or cassandra
                            
                                How to rename table in Cassandra CQL3
                            
                                Error while connecting to Cassandra using Java Driver for Apache Cassandra 1.0 from com.example.cassandra
                            
                                difference between exactly-once and at-least-once guarantees
                            
                                problem on starting cassandra
                            
                                Why are super columns in Cassandra no longer favoured?
                            
                                How to load Spark Cassandra Connector in the shell?
                            
                                Primary key in cassandra is unique?
                            
                                What are the implications of R + W > N for Cassandra clusters?
                            
                                Executing CQL through Shell Script?
                            
                                Cassandra "no viable alternative at input"
                            
                                Why don't you start off with a "single & small" Cassandra server as you usually do it with MySQL?
                            
                                Cassandra: Generate a unique ID?
                            
                                alter composite primary key in cassandra CQL 3.0

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does cassandra find the node that contains the data?

Tags:

cassandra

cassandra-2.0

kha

People also ask

1 Answers

Aaron

Recent Activity

Donate For Us