ElasticSearch for Time Series Data [closed]

Tags:

elasticsearch

I am evaluating a number of different NoSQL databases to store time series JSON data. ElasticSearch has been very interesting due to the query engine, I just don't know how well it is suited to storing time series data.

The data is composed of various metrics and stats collected at various intervals from devices. Each piece of data is a JSON object. I expect to collect around 12GB/day, but only need to keep the data in ES for 180 days.

Would ElasticSearch be a good fit for this data vs MongoDB or Hbase?

468

asked Jul 22 '14 14:07

Patrick

1 Answers

You can read up on ElasticSearch time-series use-case example here.

But I think columnar databases are a better fit for your requirements.

My understanding is that ElasticSearch works best when your queries return a small subset of results, and it caches such parameters to be used later. If same parameters are used in queries again, it can use these cached results together in union, hence returning results really fast. But in time series data, you generally need to aggregate data, which means you will be traversing a lot of rows and columns together. Such behavior is quite structured and is easy to model, in which case there does not seem to be a reason why ElasticSearch should perform better than columnar databases. On the other hand, it may provide ease of use, less tuning, etc all of which may make it more preferable.

Columnar databases generally provide a more efficient data structure for time series data. If your query structures are known well in advance, then you can use Cassandra. Beware that if your queries request without using the primary key, Cassandra will not be performant. You may need to create different tables with the same data for different queries, as its read speed is dependent on the way it writes to disk. You need to learn its intricacies, a time-series example is here.

Another columnar database that you can try is the columnar extension provided for Postgresql. Considering that your max db size will be about 180 * 12 = 2.16 TB, this method should work perfectly, and may actually be your best option. You can also expect some significant size compression of about 3x. You can learn more about it here.

129

answered Nov 15 '22 15:11

SerkanSerttop

Related questions
                            
                                org.elasticsearch.common.xcontent.DeprecationHandler Exception while using Elasticsearch REST High level client
                            
                                Understanding Elastic Search
                            
                                Why do people ship logs to Logstash with NXLog and not Logstash itself?
                            
                                AWS Elasticsearch Kibana with Cognito - Missing role
                            
                                Elastic Search vs Sunspot comparison on features
                            
                                Elastic search alphabetical sorting based on first character
                            
                                Aggregation with 0 count Elastic Search
                            
                                How to find fields with mapping conflicts
                            
                                Renaming fields in elasticsearch
                            
                                Preserving order of terms in ElasticSearch query
                            
                                How to run rake in ruby-on-rails application in production?
                            
                                How do I list all stored scripts on an Elasticsearch cluster?
                            
                                Elasticsearch More Like this no result
                            
                                ElasticSearch query_string fails to parse query with some characters
                            
                                master_not_discovered_exception ElasticSearch single node
                            
                                How do I set the path.repo in Docker compose 3?
                            
                                Elasticsearch: HOW-TO delete a (cluster) setting
                            
                                Elastic NEST using Term filter on text field with inner keyword field
                            
                                Error: The 'elasticsearch' backend requires the installation of 'requests'. How do I fix it?
                            
                                How to make our customised dashboard as default dashboard on kibana

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With