Cassandra commit log clarification

Tags:

I have read over several documents regarding the Cassandra commit log and, to me, there is conflicting information regarding this "structure(s)". The diagram shows that when a write occurs, Cassandra writes to the memtable and commit log. The confusing part is where this commit log resides.

The diagram that I've seen over-and-over shows the commit log on disk. However, if you do some more reading, they also talk about a commit log buffer in memory - and that piece of memory is flushed to disk every 10 seconds.

DataStax Documentation states: "When a write occurs, Cassandra stores the data in a memory structure called memtable, and to provide configurable durability, it also appends writes to the commit log buffer in memory. This buffer is flushed to disk every 10 seconds".

Nowhere in their diagram do they show a memory structure called a commit log buffer. They only show the commit log residing on disk.

It also states: "When a write occurs, Cassandra stores the data in a structure in memory, the memtable, and also appends writes to the commit log on disk."

So I'm confused by the above. Is it written to the commit log memory buffer, which is eventually flushed to disk (which I would assume is also called the "commit log"), or is it written to the memtable and commit log on disk?

Apache's documentation states this: "Instead, like other modern systems, Cassandra provides durability by appending writes to a commitlog first. This means that only the commitlog needs to be fsync'd, which, if the commitlog is on its own volume, obviates the need for seeking since the commitlog is append-only. Implementation details are in ArchitectureCommitLog.

Cassandra's default configuration sets the commitlog_sync mode to periodic, causing the commitlog to be synced every commitlog_sync_period_in_ms milliseconds, so you can potentially lose up to that much data if all replicas crash within that window of time."

What I have inferred from the Apache statement is that ONLY because of the asynchronous nature of writes (acknowledgement of a cache write) could you lose data (it even states you can lose data if all replicas crash before it is flushed/sync'd).

I'm not sure what I can infer from the DataStax documentation and diagram as they've mentioned two different statements regarding the commit log - one in memory, one on disk.

Can anyone clarify, what I consider, a poorly worded and conflicting set of documentation?

I'll assume there is a commit log buffer, as they both reference it (yet DataStax doesn't show it in the diagram). How and when this is managed, I think, is a key to understand.

971

asked Jul 21 '16 14:07

Jim Wartnick

1 Answers

Generally when explaining the write path, the commit log is characterized as a file - and it's true the commit log is the on-disk storage mechanism that provides durability. The confusion is introduced when going deeper and the part about buffer cache and having to issue fsyncs is introduced. The reference to "commit log buffer in memory" is talking about OS buffer cache, not a memory structure in Cassandra. You can see in the code that there's not a separate in-memory structure for the commit log, but rather the mutation is serialized and written to a file-backed buffer.

Cassandra comes with two strategies for managing fsync on the commit log.

commitlog_sync 
    (Default: periodic) The method that Cassandra uses to acknowledge writes in milliseconds:
    periodic: (Default: 10000 milliseconds [10 seconds])
    Used with commitlog_sync_period_in_ms to control how often the commit log is synchronized to disk. Periodic syncs are acknowledged immediately.

    batch: (Default: disabled)note
    Used with commitlog_sync_batch_window_in_ms (Default: 2 ms) to control how long Cassandra waits for other writes before performing a sync. When using this method, writes are not acknowledged until fsynced to disk.

The periodic offers better performance at the cost of a small increase in the chance that data can be lost. The batch setting guarantees durability at the cost of latency.

171

answered Sep 24 '22 09:09

Andrew Weaver

Related questions
                            
                                Is there a way to "EXPLAIN" a Cassandra query?
                            
                                Bad Request: unconfigured columnfamily <CF_name> in Cassandra
                            
                                How does Apache Cassandra do aggregate operations?
                            
                                What is the best api/library for Java to use Cassandra? [closed]
                            
                                Cassandra and Secondary-Indexes, how do they work internally?
                            
                                Methods to Verify Cassandra Node Sync
                            
                                How Cassandra stores multicolumn primary key (CQL)
                            
                                How do I run geospatial queries at scale with NoSQL?
                            
                                If Dynamic columns are discouraged in cassandra 1.2/Cql3 , then how is it better than Mysql in functionality?
                            
                                Get a BigInteger attribute from Cassandra ResultSet
                            
                                Is Cassandra suitable to use as a primary data store?
                            
                                Generate UUID for Cassandra in Python
                            
                                Thinking of storing serialized java objects into cassandra as JSON. What is the catch?
                            
                                CQL Cassandra OR operator
                            
                                Optimal JVM settings for Cassandra
                            
                                Using cassandra instead of memcache?
                            
                                How can I trace Queries at Cassandra Server?
                            
                                Why swap needs to be turned off in Datastax Cassandra?
                            
                                Nested query not working in Cassandra
                            
                                IN clause with Spring Data and Cassandra @Query

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cassandra commit log clarification

Tags:

cassandra

datastax

Jim Wartnick

People also ask

1 Answers

Andrew Weaver

Recent Activity

Donate For Us