Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Using Cassandra for OLAP

We are using Cassandra for OLTP DB, storing DB transactions, and evaluating requirements for reporting solution.

We are evaluating using Cassandra for reporting database with flattened schema.

What are the advantages/ pitfalls for using Cassandra as reporting DB?

like image 750
Amit Kaneria Avatar asked Jan 08 '18 19:01

Amit Kaneria


People also ask

Can Cassandra be used for OLAP?

The OLAP system requires data from Cassandra on a periodic basis. Requirements pertaining to this scenario are: The frequency of the data copy needs to be reduced drastically. Data has to be consistent.

When should you not use Cassandra?

When you want many-to-many mappings or join tables. Cassandra doesn't support a relational schema with foreign keys and join tables. So if you want to write a lot of complex join queries, then Cassandra might not be the right database for you.

Can Cassandra be used as data warehouse?

Cassandra can be used both as a data warehouse(raw data storage) and as a database (for final data storage). It depends more on the cases you want to do with the data. You even may need to have both Hadoop and Cassandra for different purposes.

Is Cassandra a OLTP database?

So, what is Apache Cassandra? A distributed OLTP database built for high availability and linear scalability.


1 Answers

It's recommended to consider using Spark in conjunction to Cassandra for OLAP.

Here is a related post on stackoverflow:

Is Cassandra for OLAP or OLTP or both?

Here is a presentation for similar use case: https://www.slideshare.net/EvanChan2/breakthrough-olap-performance-with-cassandra-and-spark

like image 64
Alex M981 Avatar answered Sep 19 '22 16:09

Alex M981