I read this from the official DSE doc but it did not go in depth in to how. Can someone explain or provide any links to how?

It's better to look into architecture guide for this kind of information. There are multiple places that could be considered as some kind of load balancers. First - you can send requests to any node in the cluster, and this node will work as "coordinator", re-sending the request to the nodes that actually owns the data. Because this is not very optimal, drivers provides so-called token-aware load balancing policy, where driver is able to infer from data, which nodes are responsible for handling them, and send request to one of the nodes, selected based on other information (contributed by other load balancing policies). In case of the multiple data centers, drivers & Cassandra itself, are able to send requests to "remote" DCs if "local" isn't available (notion of remote & local are specific to consumers). But in this case, some other factors will play their role - for example, if you have <code>LOCAL_</code> consistency levels, then your requests won't be sent to "remote" data center. Talking about application design - you may use load balancer before your application layer that will connect to Cassandra cluster in their "local" data center, and use <code>LOCAL_</code> consistency levels to perform their operations. In case of downtime of one of the DCs, the load balancer should stop to send traffic to application layer in that DC.

Load balancer is builtin to the drivers/connections. For example, Java driver "roundrobin" behavior is explained in the documentation here: https://docs.datastax.com/en/developer/java-driver-dse/1.6/manual/load_balancing/ Also explained here: https://docs.datastax.com/en/developer/java-driver/3.1/manual/load_balancing/

How is Cassandra designed to avoid the need for load balancers?

2 Answers

It's better to look into architecture guide for this kind of information.

There are multiple places that could be considered as some kind of load balancers. First - you can send requests to any node in the cluster, and this node will work as "coordinator", re-sending the request to the nodes that actually owns the data. Because this is not very optimal, drivers provides so-called token-aware load balancing policy, where driver is able to infer from data, which nodes are responsible for handling them, and send request to one of the nodes, selected based on other information (contributed by other load balancing policies).

In case of the multiple data centers, drivers & Cassandra itself, are able to send requests to "remote" DCs if "local" isn't available (notion of remote & local are specific to consumers). But in this case, some other factors will play their role - for example, if you have LOCAL_ consistency levels, then your requests won't be sent to "remote" data center.

Talking about application design - you may use load balancer before your application layer that will connect to Cassandra cluster in their "local" data center, and use LOCAL_ consistency levels to perform their operations. In case of downtime of one of the DCs, the load balancer should stop to send traffic to application layer in that DC.

131

answered Oct 05 '22 04:10

Alex Ott

Load balancer is builtin to the drivers/connections. For example, Java driver "roundrobin" behavior is explained in the documentation here:

https://docs.datastax.com/en/developer/java-driver-dse/1.6/manual/load_balancing/

Also explained here:

https://docs.datastax.com/en/developer/java-driver/3.1/manual/load_balancing/

answered Oct 05 '22 03:10

spencer7593

Related questions
                            
                                Combine results from batch RDD with streaming RDD in Apache Spark
                            
                                Cassandra IN query not working if table has SET type column
                            
                                Streaming data from Kafka into Cassandra in real time
                            
                                modelling cassandra tables for upsert and select query
                            
                                Database that consumes less disk space
                            
                                How should I copy a keyspace within a cluster
                            
                                Is TTL for Cassandra counter column family supported?
                            
                                Pandas and Cassandra: numpy array format incompatibility
                            
                                Best way to add multiple nodes to existing cassandra cluster
                            
                                Generate a script to create a table from the entity definition
                            
                                best way to run nodetool upgradesstables after update?
                            
                                Order by created date In Cassandra
                            
                                Is there any harm in running PHP and Ruby on the same server?
                            
                                How do you check for the existence of a column family in hector?
                            
                                What NoSQL solution is best to store Apache error_log and access_log? Cassandra or MongoDB?
                            
                                Do you need Solr/Lucene for MongoDB, CouchDB and Cassandra?
                            
                                What does cassandra do during compaction?
                            
                                How does cassandra split keyspace data when multiple directories are configured?
                            
                                how to perform "not in" filter in cql3 query select?
                            
                                Cassandra control SSTable size

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How is Cassandra designed to avoid the need for load balancers?

Tags:

cassandra

datastax-enterprise

user2345093

People also ask

2 Answers

Alex Ott

spencer7593

Recent Activity

Donate For Us