Advantages of using cql over thrift

Tags:

Are there any distinct advantages for using cql over thrift or is it simply a case of developers being too used to SQL? I'm wanting to switch from thrift querying to cql, the only problem is I'm not sure about the downsides of doing so. What are they?

786

asked Mar 29 '13 10:03

Daniel Tomey

2 Answers

Lyuben's answer is a good one, but I believe he may be misinformed on a few points. First, you should be aware that the Thrift API is not going to be getting new features; it's there for backwards compatibility, and not recommended for new projects. There are already some features that can not be used through the Thrift interface.

Another factor is that the quoted benchmarks from Acunu are misleading; they don't measure the performance of CQL with prepared statements. See, for example, the graphs at https://issues.apache.org/jira/browse/CASSANDRA-3634 (probably the same data set on which the Acunu post is based, since Eric Evans wrote both). There have also been some improvements to CQL parsing and execution speed in the last year. It is not likely that you will observe any real speed difference between CQL 3 and Thrift.

Finally, I don't think I even agree that Thrift is more flexible. The CQL 3 datamodel allows using the same data structures that Thrift does for nearly all usages that are not antipatterns; it just allows you to think about the model in a more organized way. For example, Lyuben mentioned rows with differing numbers of columns. A CQL 3 table may still utilize that capability: there is a difference between "storage engine rows" (which is Cassandra's low level storage, and what Thrift uses directly) and "CQL rows" (what you see through the Thrift interface). CQL just does the extra work necessary to visualize wide storage engine rows as structured tables.

It's a little difficult to explain in a quick SO answer, but see this post for a somewhat gentle explanation.

answered Oct 12 '22 01:10

the paul

Querying
In CQL you can query cassandra and get data in a couple of lines (using JDBC driver):

String query = "SELECT * FROM message;";
PreparedStatement statement = con.prepareStatement(query);

While in thrift based API's it's a bit more complicated (example with Astyanax):

OperationResult<ColumnList<String>> result = 
     keyspace.prepareQuery(mail/*specify columnfamily structure*/)
             .getKey("lyuben@1363115059").execute();
ColumnList<String> columns = result.getResult();

Performance
Based on the benchmarks carried out by Acunu, Thrift (RPC) is slightly ahead of CQL when it comes to query performance, but you need to be in a situation where high throughput is key for this performance advantage to have a significant benefit.

Some excellent articles to lookup are:

A thrift to CQL3 upgrade guide.
CQL vs RPC - Acunu benchmarks
CQL3 for cassandra experts

EDIT

The above benchmarks are outdated, the paul provided newer benchmarks on prepared statements.

answered Oct 12 '22 02:10

Lyuben Todorov

Related questions
                            
                                Cassandra Delete by Secondary Index or By Allowing Filtering
                            
                                Serializing Java objects to Cassandra 1.2 via ByteBuffer & CQL 3
                            
                                Cassandra - unique constraint on row key
                            
                                Best practice cassandra setup on ec2 with large amount of data
                            
                                Is there any ultimate good documentation about Apache Cassandra? [closed]
                            
                                How to read data from Cassandra with R?
                            
                                What are the pros or cons of storing json as text vs blob in cassandra?
                            
                                Cassandra data model for simple messaging app
                            
                                Cassandra Java Driver- QueryBuilder API vs PreparedStatements
                            
                                Cassandra error - Order By only supported when partition key is restricted by EQ or IN
                            
                                Cassandra, mongodb or couchdb for Ruby on Rails [closed]
                            
                                SELECT Specific Value from map
                            
                                how to rapidly increment counters in Cassandra w/o staleness
                            
                                cassandra, select via a non primary key
                            
                                Server-side warning: Aggregation query used without partition key
                            
                                How to pass along username and password to cassandra in python
                            
                                What is virtual nodes. and how it is helping during partitioning in Cassandra
                            
                                Unable to start Cassandra: "node already exists"
                            
                                Create Cassandra table using cql3 with default TTL
                            
                                Cassandra CQLSH TEXT field limit on COPY FROM CSV (field larger than field limit (131072))

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Advantages of using cql over thrift

Tags:

cassandra

cql

thrift

Daniel Tomey

People also ask

2 Answers

the paul

Lyuben Todorov

Recent Activity

Donate For Us