What options are there to speed up a full repair in Cassandra?

Tags:

I have a Cassandra datacenter which I'd like to run a full repair on. The datacenter is used for analytics/batch processing and I'm willing to sacrifice latencies to speed up a full repair (nodetool repair). Writes to the datacenter is moderate.

What are my options to make the full repair faster? Some ideas:

Increase streamthroughput?
I guess I could disable autocompation and decrase compactionthroughput temporarily. Not sure I'd want to that, though...

Additional information:

I'm running SSDs but haven't spent any time adjusting cassandra.yaml for this.

204

asked Mar 19 '15 13:03

Ztyx

1 Answers

Full repairs are run sequentially by default. The state and differences of the nodes' datasets are stored in binary trees. Recreating these is the main factor here. According to this datastax blog entry, "Every time a repair is carried out, the tree has to be calculated, each node that is involved in the repair has to construct its merkle tree from all the sstables it stores making the calculation very expensive."

The only way I see to significantly increase the speed of a full repair is to run it in parallel or repair subrange by subrange. Your tag implies that you run Cassandra 2.0.

1) Parallel full repair

 nodetool repair -par, or --parallel, means carry out a parallel repair.

According to the nodetool documentation for Cassandra 2.0

Unlike sequential repair (described above), parallel repair constructs the Merkle tables for all nodes at the same time. Therefore, no snapshots are required (or generated). Use a parallel repair to complete the repair quickly or when you have operational downtime that allows the resources to be completely consumed during the repair.

2) Subrange repair nodetool accepts start and end token parameters like so

 nodetool repair -st (start token) -et (end token) $keyspace $columnfamily

For simplicity sake, check out this python script that calculates tokens for you and executes the range repairs: https://github.com/BrianGallew/cassandra_range_repair

Let me point out two alternative options:

A) Jeff Jirsa pointed to incremental repairs.

These are available starting with Cassandra 2.1. You will need to perform certain migration steps before you can use nodetool like this:

nodetool repair -inc, or --incremental means do an incremental repair.

B) OpsCenter Repair Service

For the couple of clusters at my company itembase.com, we use the repair service in DataStax OpsCenter which is executing and managing small range repairs as a service.

141

answered Sep 24 '22 03:09

John

Related questions
                            
                                Comparing two uuids in Node.js
                            
                                Cassandra SSL with own Certificate Authority
                            
                                Lambda Architecture with Apache Spark
                            
                                CQL: Invalid set literal for values of type map
                            
                                Delete data from Cassandra with part of the partition key
                            
                                Cassandra installation Failed 64-bit check. Re-running to get version from 32-bit on windows 10
                            
                                Misunderstanding on Composite Key for Cassandra
                            
                                PoolTimeoutException when connecting to Cassandra via Astyanax
                            
                                Cassandra CQL3 support in Astyanax
                            
                                Add columns dynamically in cassandra
                            
                                How to trace back a large partition of a column family in cassandra
                            
                                Multiple constructors with the same number of parameters exception while transforming data in spark using scala
                            
                                How to pushdown limit predicate for Cassandra when you use dataframes?
                            
                                Correct way of creating a realtime application with Cassandra
                            
                                Can I expect a significant performance boost by moving a large key value store from MySQL to a NoSQL DB?
                            
                                Which clustered NoSQL DB for a Message Storing purpose?
                            
                                Cassandra: Query with where clause containing greather- or lesser-than (< and >)
                            
                                Availability of Cassandra
                            
                                cassandra 1.2 fails to init snappy in freebsd
                            
                                cassandra -tokens and org.apache.cassandra.exceptions.ConfigurationException: For input string:

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What options are there to speed up a full repair in Cassandra?

Tags:

cassandra

cassandra-2.0

Ztyx

People also ask

1 Answers

John

Recent Activity

Donate For Us