Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Appropriate database for web analytics?

I am working on the Web analytics similar to Google Analytics, and currently using MongoDB.

Problem using MongoDB

Writing and reading is performed at the same time, so after some point writing becomes slow. So I am looking for an appropriate database which suits the following requirements :

Requirements

  1. It should be Open Source
  2. Is scalable, and can adapt large volumes of data
  3. Better performance while aggregating data
  4. Time based aggregation
  5. Real time data processing
  6. Has Driver support (clients) for Node.js and Java

The Database should be scalable with Node.js and Java

I am saving the data using Java, and fetching it via Node.js

like image 602
karthick Avatar asked Nov 12 '13 08:11

karthick


1 Answers

I think you are looking for DBs that are designed for analytics. Those would be databases which support large amount of reads as well as writes on storage. May be you should check these solutions:.

some opensource data storage solutions specific to realtime analytics

  1. Druid
  2. Spark
  3. Cassandra
  4. Hbase
  5. hypertable
  6. accumulo

note: Here I think cassandra would have appropriate drivers for languages that you are looking for and would be a good fit when you have large read and write operations. We can see cassandra being used in java/storm based projects which are aimed at real time processing.

reference:

  1. http://kkovacs.eu/cassandra-vs-mongodb-vs-couchdb-vs-redis
  2. http://relistan.com/cassandra-vs-mongo/
  3. http://hypertable.com/why_hypertable/hypertable_vs_hbase_2/
like image 105
Mithun Satheesh Avatar answered Oct 20 '22 21:10

Mithun Satheesh