I am new to cassandra. Here I have two tables <code>EVENTS</code> and <code>TOWER</code>. I need to join those for some queries. But I'm not enable to do it. Structure of <code>EVENTS</code> table: <pre class="prettyprint lang-sql prettyprint-override"><code>eid int PRIMARY KEY, a_end_tow_id text, a_home_circle text, a_home_operator text, a_imei text, a_imsi text, </code></pre> Structure of <code>TOWER</code> table: <pre class="prettyprint lang-sql prettyprint-override"><code> tid int PRIMARY KEY, tower_address_1 text, tower_address_2 text, tower_azimuth text, tower_cgi text, tower_circle text, tower_id_no text, tower_lat_d text, tower_long_d text, tower_name text, </code></pre> Now, I want to join these table with respect to <code>EID</code> and <code>TID</code> so that I can fetch the data of both tables.

There are a couple of ways that you can join tables together in Cassandra and query them. But of course you have to rethink the data model part. <ol> <li>Use Apache Spark’s SparkSQL™ with Cassandra (either open source or in DataStax Enterprise – DSE).</li> <li>Use DataStax provided ODBC connectors with Cassandra and DSE.</li> </ol>

How to do a join queries with 2 or more tables in cassandra cql

Tags:

cassandra

cql

I am new to cassandra. Here I have two tables EVENTS and TOWER. I need to join those for some queries. But I'm not enable to do it.

Structure of EVENTS table:

eid int PRIMARY KEY,
a_end_tow_id text,
a_home_circle text,
a_home_operator text,
a_imei text,
a_imsi text,

Structure of TOWER table:

 tid int PRIMARY KEY,
 tower_address_1 text,
 tower_address_2 text,
 tower_azimuth text,
 tower_cgi text,
 tower_circle text,
 tower_id_no text,
 tower_lat_d text,
 tower_long_d text,
 tower_name text,

Now, I want to join these table with respect to EID and TID so that I can fetch the data of both tables.

228

asked Jun 22 '13 07:06

BlueShark

2 Answers

Cassandra = No Joins. Your model is 100% relational. You need to rethink it for Cassandra. I would advice you take a look at these slides. They dig deep into how to model data for cassandra. Also here is a webinar covering the topic. But stop thinking foreign keys and joining tables, because if you need relations cassandra isn't the tool for the job.

But Why?
Because then you need to check consistency and do many other things that relational databases do and so you loose the performance and scalability that cassandra offers.

What can I do?
DENORMALIZE! Lots of data in one table? But the table will have too many columns!
So? Cassandra can handle a very large number of columns in a table.

The other thing you can do is to simulate the join in your client application. Match the two datasets in your code, but this will be very slow because you'll have to iterate over all your information.

Another way is to carry out multiple queries. Select the event you want, then the matching tower.

answered Oct 20 '22 12:10

Lyuben Todorov

There are a couple of ways that you can join tables together in Cassandra and query them. But of course you have to rethink the data model part.

Use Apache Spark’s SparkSQL™ with Cassandra (either open source or in DataStax Enterprise – DSE).
Use DataStax provided ODBC connectors with Cassandra and DSE.

answered Oct 20 '22 13:10

Mayank Raghav

Related questions
                            
                                Apache Cassandra remote access
                            
                                Apache Cassandra vs Datastax Cassandra [closed]
                            
                                What is the batch limit in Cassandra?
                            
                                Cassandra: can I have default value for a column like sql
                            
                                Spatial data with mongodb or cassandra
                            
                                How to rename table in Cassandra CQL3
                            
                                Error while connecting to Cassandra using Java Driver for Apache Cassandra 1.0 from com.example.cassandra
                            
                                difference between exactly-once and at-least-once guarantees
                            
                                problem on starting cassandra
                            
                                Why are super columns in Cassandra no longer favoured?
                            
                                How to load Spark Cassandra Connector in the shell?
                            
                                Primary key in cassandra is unique?
                            
                                What are the implications of R + W > N for Cassandra clusters?
                            
                                Executing CQL through Shell Script?
                            
                                Cassandra "no viable alternative at input"
                            
                                Why don't you start off with a "single & small" Cassandra server as you usually do it with MySQL?
                            
                                Cassandra: Generate a unique ID?
                            
                                alter composite primary key in cassandra CQL 3.0
                            
                                How does cassandra find the node that contains the data?
                            
                                Cassandra: Exiting due to error while processing commit log during initialization

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With