AWS Network load balancer - What is client reset count (and why is it high)

Tags:

The documentation for the various client/target/elb reset count metrics (TCP_Client_Reset_Count, TCP_Target_Reset_Count, TCP_ELB_Reset_Count) just says they count RST packets. I tried to understand what a RST packet is, and it seems to have to do with broken TCP connections. My load balancer has a single, long-term, seemingly successful client connection. Why do I see on the order of 100 client resets per hour? I also see about 10 load balancer resets per hour, and 0 target resets.

EDIT: I just observed that increasing the size of the server instance (I'm using Farscape--increased 0.25 vCPU to 0.5) led to a 10-fold reduction in client resets per hour. The number of load balancer resets did not change.

665

asked Mar 28 '18 17:03

Aleksandr Dubinsky

2 Answers

My hunch is that this is related to a bug in the Network Load Balancer that causes it to send 100x as many health checks as it should. See: NLB Target Group health checks are out of control My theory is that a bug causes the health check connection to be broken in an unclean way if the target instance is not quick enough. These broken health check connections get reported as "client resets" even though they should be reported as "ELB resets" or not reported at all.

189

answered Oct 31 '22 02:10

Aleksandr Dubinsky

There are many reasons for an TCP RST to be sent. Some are not normal, meaning errors, and some are normal connection cleanups that the TCP/IP stack or application performs.

An example of a normal TCP RST would be a long lived connection that exceeds some time limit imposed by one side or the other. Once the time limit is exceeded the connection can be "forceably" closed which will generate the RST.

An example of a not normal TCP RST would be an application that abruptly disconnected due to an internal error.

A poorly written application can also cause TCP RST when it does not perform graceful shutdowns on the TCP socket before closing the connection.

I will guess that the behavior you are seeing is not a problem. However, to really know, you will need to do a wire trace and protocol analysis on each connection to determine exactly what is happening.

answered Oct 31 '22 04:10

John Hanley

Related questions
                            
                                Alternative to AWS Lambda + NAT gateway
                            
                                MongoDB hosting options now that Heroku mLab add-on is being removed
                            
                                How to send HTML mails using Amazon SNS?
                            
                                What does the default trust policy in an AWS IAM role mean?
                            
                                Is it possible to embed AWS Cloudwatch dashboards in a webpage for internal company use?
                            
                                How to run Spark Scala code on Amazon EMR
                            
                                Running RabbitMQ+Celery in the same server as production environment
                            
                                How to deploy to a specific object key inside an S3 Bucket with the Serverless framework?
                            
                                boto EMR add step and auto terminate
                            
                                Amazon aurora postgres serverless: Database returned more than the allowed response size limit
                            
                                AWS S3 Glacier upload-archive taking a long time to finish execution - ways to check status or speed upload?
                            
                                Can I test locally when developing for Amazon Elastic Beanstalk?
                            
                                Amazon AWS S3 file naming strategy for performance
                            
                                SSH Agent forward specific keys rather than all registered ssh keys
                            
                                Download MySql Backup/Snapshot from Amazon RDS
                            
                                AWS Elastic Beanstalk CLI does not prompt to create new keypair
                            
                                How to make TensorFlow use more available CPU
                            
                                AWS EC2 get system log [closed]
                            
                                DynamoDB global tables using CloudFormation
                            
                                Impersonate user in AWS Cognito

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

AWS Network load balancer - What is client reset count (and why is it high)

Tags:

amazon-web-services

amazon-elb

Aleksandr Dubinsky

People also ask

2 Answers

Aleksandr Dubinsky

John Hanley

Recent Activity

Donate For Us