Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Cassandra adding disks / increase storage volume without adding new nodes

I have to increase storage volume in an cassandra cluster, the performance and throughput however is still more than enough. My first thoughts were to only add drives. Is it possible to increasing storage volume without adding new nodes?

Is it possible with jbod to add new drives live in an running node?

Or is the only way taking it offline, add the disks and take it back online afterwards?

Any best practises?

Thx, I really appreciate your help

like image 350
Manuel Avatar asked Apr 16 '14 12:04

Manuel


People also ask

What happens when a new node is added to Cassandra?

Adding a new node to an existing cluster, in Apache Cassandra version 3 and higher is fairly easy. When a new node is added to the cluster, Cassandra will automatically adjust the token ranges each node is responsible for resulting in each node in the cluster storing a smaller subset of the data.

How much data can a single Cassandra node effectively handle?

Maximum recommended capacity for Cassandra 1.2 and later is 3 to 5TB per node for uncompressed data. For Cassandra 1.1, it is 500 to 800GB per node. Be sure to account for replication.

How many Cassandra nodes do I need?

The minimal number should be 5, as a lower number (such as 3) will result in high stress on the machines during node failure (replication factor is 2 in this case, and each node will have to read 50% of the data and write 50% of data).


1 Answers

You can modify the cassandra.yaml to have multiple storage directories specified in data_file_directories. This will require a node restart in order to pick up the new configuration, but after the restart, Cassandra will start to use the new directory, and data will even out between the two over time.

like image 61
Tupshin Harper Avatar answered Oct 14 '22 10:10

Tupshin Harper