I would like to use Cassandra to store session related informations. I do not have real HTTP session - it's different protocol, but the same concept. Memcached would be fine, but I would like to additionally persist data. Cassandra setup: <ul> <li>non replicated Key Space</li> <li>single Column Family, where key is session ID and each column within row stores single key/value - (<code>Map<String,Set<String,String>></code>)</li> <li>column TTL = 10 minutes</li> <li>write CL = ONE</li> <li>read CL = ONE</li> <li>2.000 writes/s</li> <li>5.000 reads/s </li> </ul> Data example: <pre class="prettyprint"><code>session1:{ // CF row key {prop1:val1, TTL:10 min}, {prop2:val2, TTL:10 min}, ..... {propXXX:val3, TTL:10 min} }, session2:{ // CF row key {prop1:val1, TTL:10 min}, {prop2:val2, TTL:10 min}, }, ...... sessionXXXX:{ // CF row key {prop1:val1, TTL:10 min}, {prop2:val2, TTL:10 min}, } </code></pre> In this case consistency is not a problem, but the performance could be, especially disk IO. Since data in my session leaves for short time, I would like to avoid storing it on hard drive - except for commit log. I have some questions: <ol> <li>If column expires in Memtable before flushing it to SSTable, will Cassandra anyway store such column in SSTable (flush it to HDD)? </li> <li>Replication is disabled for my Key Space, in this case storing such expired column in SSTable would not be necessary, right?</li> <li>Each CF hat max 10 columns. In such case I would enable row cache and disable key cache. But I am expecting my data to be still available in Memtable, in this case I could disable whole cache, right?</li> <li>Any Cassandra configuration hints for such session-store use case would be really appreciated :)</li> </ol> Thank you, Maciej

Here is what I did - and it works fine: <ol> <li>Set replication_factor to 1 - means disable replication</li> <li>Set <code>gc_grace to 0</code> - means delete columns on first compaction. This is fine, since data is not replicated.</li> <li>Increase memtable size and decrease cache size. We want to read data from memtable and omit cache - flushing data to HDD and reading it again from HDD into cache.</li> <li>Additionally commit log can be disabled - durable_writes=false</li> </ol> In this setup, data will be read from memtable and cache will be not used. Memtable can allocate enough heap to keep my data until it expires or even longer. After flushing data to SSTable, compaction will immediately remove expired rows, since <code>gc_grace=0</code>.

Cassandra as session store under heavy load

Tags:

cassandra

I would like to use Cassandra to store session related informations. I do not have real HTTP session - it's different protocol, but the same concept.

Memcached would be fine, but I would like to additionally persist data.

Cassandra setup:

non replicated Key Space
single Column Family, where key is session ID and each column within row stores single key/value - (Map<String,Set<String,String>>)
column TTL = 10 minutes
write CL = ONE
read CL = ONE
2.000 writes/s
5.000 reads/s

Data example:

Click to copy

session1:{ // CF row key
   {prop1:val1, TTL:10 min},
   {prop2:val2, TTL:10 min},
.....
   {propXXX:val3, TTL:10 min}
},
session2:{ // CF row key
   {prop1:val1, TTL:10 min},
   {prop2:val2, TTL:10 min},
},
......
sessionXXXX:{ // CF row key
   {prop1:val1, TTL:10 min},
   {prop2:val2, TTL:10 min},
}

In this case consistency is not a problem, but the performance could be, especially disk IO.

Since data in my session leaves for short time, I would like to avoid storing it on hard drive - except for commit log.

I have some questions:

If column expires in Memtable before flushing it to SSTable, will Cassandra anyway store such column in SSTable (flush it to HDD)?
Replication is disabled for my Key Space, in this case storing such expired column in SSTable would not be necessary, right?
Each CF hat max 10 columns. In such case I would enable row cache and disable key cache. But I am expecting my data to be still available in Memtable, in this case I could disable whole cache, right?
Any Cassandra configuration hints for such session-store use case would be really appreciated :)

Thank you, Maciej

431

asked Oct 10 '11 08:10

Maciej Miklas

2 Answers

Here is what I did - and it works fine:

Set replication_factor to 1 - means disable replication
Set gc_grace to 0 - means delete columns on first compaction. This is fine, since data is not replicated.
Increase memtable size and decrease cache size. We want to read data from memtable and omit cache - flushing data to HDD and reading it again from HDD into cache.
Additionally commit log can be disabled - durable_writes=false

In this setup, data will be read from memtable and cache will be not used. Memtable can allocate enough heap to keep my data until it expires or even longer.

After flushing data to SSTable, compaction will immediately remove expired rows, since gc_grace=0.

answered Oct 15 '22 07:10

Maciej Miklas

Considering your use case if I'm not wrong you wish to have all your key value[sessionID=>sessionData] pairs in memory and those values will expire every 10min[Means you don't want persistence].

Then why can't you try something like redis which is a in-memory store.

From Doc:

Redis is an open source, advanced key-value store. It is often referred to as a data structure server since keys can contain strings, hashes, lists, sets and sorted sets.

Since u don't need replication redis master slave architecture even might not affect you

Redis supports TTL also

AFAIK cassandra is good for wide fat rows[More columns less rows] rather skinny rows[transpose of previous]. Your use case doesn't seem so.

Regards, Tamil

answered Oct 15 '22 06:10

Tamil

Related questions
                            
                                Extending Cassandra cluster with datacenter in China (CGF)
                            
                                Can you use solr_query in Cassandra to find map field containing some particular value?
                            
                                Cassandra using composite indexes and secondary together
                            
                                Cassandra 1.1 storage engine how does it store composites?
                            
                                Is there a high performance difference in a Key-Value db on a single server with MySQL vs. NoSQL
                            
                                Astyanax's EntityPersister & Collection Updates
                            
                                How to create KEYSPACE in Cassandra using java Class
                            
                                Cassandra nodetool "compactionstats" meaning of displayed values
                            
                                how to load .tsv files into cassandra
                            
                                cassandra-stress "Failed to connect over JMX; not collecting these stats"
                            
                                Installing php datastax driver on ubuntu
                            
                                Cassandra can not delete role or user which in the role or user list
                            
                                Spring-data-cassandra 1.3.4 not compatible with Cassandra 3.x
                            
                                Deploying Cassandra on ECS?
                            
                                The usage of Cassandra's internal keyspace "system"
                            
                                Expanding a Cassandra cluster with one additional node: what ports need to be open?
                            
                                How to translate complex sql into the equivalent Cassandra representation
                            
                                Deploy Cassandra on EC2?
                            
                                Apache Cassandra as a message data store for ActiveMQ
                            
                                Hector (Cassandra) Delete Anomaly

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cassandra as session store under heavy load

Tags:

cassandra

Maciej Miklas

People also ask

2 Answers

Maciej Miklas

Tamil

Recent Activity

Donate For Us