Amazon elasticsearch interpretation of FreeStorageSpace metrics

2 Answers

I was also confused by this. Minimum means size on single data node - one which has least free space. And Sum means size of entire cluster (summation of free space on all data nodes). Got this info from following link

http://docs.aws.amazon.com/elasticsearch-service/latest/developerguide/es-managedomains.html

answered Sep 28 '22 01:09

jigar

We ran into the same confusion. Avg, Min, Max spreads the calculation across all nodes and Sum combines the Free/Used space for the whole cluster.

We had assumed that Average FreeStorageSpace means average free storage space of the whole cluster and set an alarm keeping the following calculation in mind:

Per day index = 1 TB
Max days to keep indices = 10

Hence we had an average utilization of 10 TB at any point of time. Assuming, we will go 2x - i.e. 20 TB our actual storage need as per https://docs.aws.amazon.com/elasticsearch-service/latest/developerguide/sizing-domains.html#aes-bp-storage was with replication factor of 2 is:

(20 * 2 * 1.1 / 0.95 / 0.8) = 57.89 =~ 60 TB

So we provisioned 18 X 3.8 TB instances =~ 68 TB to accomodated 2x = 60 TB

So we had set an alarm that if we go below 8 TB free storage - it means we have hit our 2x limit and should scale up. Hence we set the alarm

FreeStorageSpace <= 8388608.00 for 4 datapoints within 5 minutes + Statistic=Average + Duration=1minute

FreeStorageSpace is in MB hence - 8 TB = 8388608 MB.

But we immediately got alerted because our average utilization per node was below 8 TB.

After realizing that to get accurate storage you need to do FreeStorageSpace sum for 1 min - we set the alarm as

FreeStorageSpace <= 8388608.00 for 4 datapoints within 5 minutes + Statistic=Sum + Duration=1minute

The above calculation checked out and we were able to set the right alarms.

The same applies for ClusterUsedSpace calculation.

You should also track the actual free space percent using Cloudwatch Math:

enter image description here

answered Sep 28 '22 02:09

Saurabh Hirani

Related questions
                            
                                Amazon S3 - 405 Method Not allowed using POST (Although I allowed POST on the bucket)
                            
                                Route53 route subdomain to AWS Lambda?
                            
                                DynamoDB primary key and indexes table design
                            
                                How do I connect to aws ec2 server from chromebook using the secure shell extension?
                            
                                Speeding up Jenkins build
                            
                                AccessDenied: User is not authorized to perform: cloudfront:CreateInvalidation
                            
                                Hadoop in the AWS free tier?
                            
                                AWS API Gateway Method to Serve static content from S3 Bucket
                            
                                Creating a user authentication system for iOS (previously with Parse hopefully AWS) [closed]
                            
                                What is the difference between an S3 Object and an ObjectSummary?
                            
                                CodeDeploy deployment fails: bad interpreter /bin/sh^M
                            
                                Copy to Redshift from another accounts S3 bucket
                            
                                How do I scale a Java app with a REST API and a Database?
                            
                                Will Cognito User Pools support internationalization?
                            
                                Avoiding INSUFFICIENT_DATA CloudWatch alarms on SNS notifications
                            
                                Is it possible to loop through Amazon S3 bucket and count the number of lines in its file/key using Python?
                            
                                How to set an environment variable with a '$' in the value in Elastic Beanstalk?
                            
                                Having "key is a directory name" exception
                            
                                Is it possible to choose what should be the field to be return in DynamoDB?
                            
                                Cloudformation security group set group name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Amazon elasticsearch interpretation of FreeStorageSpace metrics

Tags:

amazon-web-services

elasticsearch

amazon-elasticsearch

logstash

elastic-stack

Karup

People also ask

2 Answers

jigar

Saurabh Hirani

Recent Activity

Donate For Us