Cassandra node is unable to start after a HDD failure

Tags:

I have a 5 node Cassandra 2.0.7 cluster, each node has 4 HDDs. Recently one of these HDDs on node3 had failed and was replaced by a new shiny empty drive. After the replacement cassandra on this node was unable to start with this exception:

 INFO [main] 2014-06-02 12:45:17,232 ColumnFamilyStore.java (line 254) Initializing system.paxos
 INFO [main] 2014-06-02 12:45:17,236 ColumnFamilyStore.java (line 254) Initializing system.schema_columns
 INFO [SSTableBatchOpen:1] 2014-06-02 12:45:17,237 SSTableReader.java (line 223) Opening /mnt/disk2/cassandra/system/schema_columns/system-schema_columns-jb-310 (25418 bytes)
 INFO [main] 2014-06-02 12:45:17,241 ColumnFamilyStore.java (line 254) Initializing system.IndexInfo
 INFO [main] 2014-06-02 12:45:17,245 ColumnFamilyStore.java (line 254) Initializing system.peers
 INFO [SSTableBatchOpen:1] 2014-06-02 12:45:17,246 SSTableReader.java (line 223) Opening /mnt/disk3/cassandra/system/peers/system-peers-jb-25 (20411 bytes)
 INFO [main] 2014-06-02 12:45:17,253 ColumnFamilyStore.java (line 254) Initializing system.local
 INFO [SSTableBatchOpen:1] 2014-06-02 12:45:17,254 SSTableReader.java (line 223) Opening /mnt/disk3/cassandra/system/local/system-local-jb-35 (80 bytes)
 INFO [SSTableBatchOpen:2] 2014-06-02 12:45:17,254 SSTableReader.java (line 223) Opening /mnt/disk3/cassandra/system/local/system-local-jb-34 (80 bytes)
 ERROR [main] 2014-06-02 12:45:17,361 CassandraDaemon.java (line 237) Fatal exception during initialization
  org.apache.cassandra.exceptions.ConfigurationException: Found system keyspace files, but they couldn't be loaded!
    at org.apache.cassandra.db.SystemKeyspace.checkHealth(SystemKeyspace.java:532)
    at org.apache.cassandra.service.CassandraDaemon.setup(CassandraDaemon.java:233)
    at org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:462)
    at org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:552)

Because of cassandra node being unable to start, I cannot use nodetool repair.

The only way I see to recover the node is to remove all data and bootstrap it from nearly bare metal. Is there a shorter way to recover in a typical HDD failure scenario?

614

asked Jun 02 '14 09:06

shutty

1 Answers

Fixed the issue by these steps:

physically removed files related to system keyspace: cassandra was able to start and recreated it, but without any metadata about other keyspaces.
ran nodetool resetlocalschema, which synchronized keyspace schema from other nodes.

160

answered Oct 03 '22 15:10

shutty

Related questions
                            
                                cassandra sharding and replication
                            
                                Can I have null values in clustering columns of a primary key?
                            
                                TTL vs default_time_to_live which one is better and why?
                            
                                How to configure Cassandra TimeWindowCompactionStrategy
                            
                                Tuning write performance in cassandra
                            
                                error while trying to install cassandra-driver using python
                            
                                How to cast varchar to MAP(VARCHAR,VARCHAR) in presto
                            
                                What is the most efficient way to map/transform/cast a Cassandra BoundStatement's ResultSet to a Java classe built using the Object-mapping API?
                            
                                Which backend database Neo4j uses to store data?
                            
                                Why MongoDB is Consistent not available and Cassandra is Available not consistent?
                            
                                Can I use cassandra with heroku?
                            
                                Spark Cassandra connector - Range query on partition key
                            
                                Cassandra Tombstoning warning and failure thresholds breached
                            
                                Cassandra query making - Cannot execute this query as it might involve data filtering and thus may have unpredictable performance
                            
                                cassandra:: can i create a table without a primarykey?
                            
                                How to list all materialized view tables in Cassandra CQL
                            
                                Low write performance of Cassandra
                            
                                How do I get all the keys that are stored in the Cassandra column family with pycassa?
                            
                                Connecting and playing with Cassandra in Java
                            
                                mysql cluster catching up with cassandra?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cassandra node is unable to start after a HDD failure

Tags:

recovery

cassandra

datastax

shutty

People also ask

1 Answers

shutty

Recent Activity

Donate For Us