How does pod replica scaling down work in Kubernetes Horizontal Pod Autoscaler?

Tags:

My understanding is that in Kubernetes, when using the Horizontal Pod Autoscaler, if the targetCPUUtilizationPercentage field is set to 50%, and the average CPU utilization across all the pod's replicas is above that value, the HPA will create more replicas. Once the average CPU drops below 50% for some time, it will lower the number of replicas.

Here is the part that I am not sure about:
What if the CPU utilization on a pod is 10%, not 0%?Will HPA still terminate the replica?
10% CPU isn't much, but since it's not 0%, some task is currently running on that pod. If it's a long lasting task (several seconds) and HPA decides to terminate the pod, that task will not be finished.

Does the HPA terminate pods only if the CPU utilization on them is 0% or does it terminate them whenever it sees that the value is below targetCPUUtilizationPercentage?

How does HPA decide which pods to remove?
Thank you!

243

asked Apr 28 '18 19:04

pkout

1 Answers

So you have two questions in there and let me address one by one. The first part - if a pod in a replica set is consuming let's say 10% then will Kubernetes kill that pod? The answer is Yes. Kubernetes is not looking at the individual pods but at an average of that metric across all pods in that replica set. Also the scaling down is gradual as explained here

The second part of the question - how does your application behave gracefully when a pod is about to be killed and it is still serving some requests? This can be handled by the grace period of the pod termination and even better if you implement a PreStop hook - which will allow you to do something like stop taking incoming requests but process existing requests. The implementation of this will vary based on the language runtime you are using, so I won't go in the details here.

Lastly - one scenario you should consider is what if VM on which pod was running goes down abruptly - you have no chance to execute PreStop hook! I think the application needs to be robust enough to handle failures.

160

answered Nov 13 '22 19:11

Vishal Biyani

Related questions
                            
                                Use Visual Studio debugger with ASP.NET Core web app running in Kubernetes?
                            
                                how to add a node to my kops cluster? (node in here is my external instance)
                            
                                How do I access a private Docker registry with a self signed certificate using Kubernetes?
                            
                                Why do I get "unbound immediate PersistentVolumeClaims" on Minikube?
                            
                                Change node machine type on GKE cluster
                            
                                How can you publish a Kubernetes Service without using the type LoadBalancer (on GCP)
                            
                                Integration of Kubernetes with Apache Airflow
                            
                                pod will not start due to "No nodes are available that match all of the following predicates:: Insufficient cpu"
                            
                                How to configure a Persistent Volume Claim using AWS EFS and ReadWriteMany?
                            
                                Kubernetes docker volume mounting option
                            
                                How to verify that a Kubernetes deployment update has been successful?
                            
                                Is this necessary to have multiple processes / threads in a Kubernetes pod?
                            
                                How to fix weave-net CrashLoopBackOff for the second node?
                            
                                What is the difference between the core os projects kube-prometheus and prometheus operator?
                            
                                How to mount entire directory in Kubernetes using configmap?
                            
                                Determine what resource was not found from "Error from server (NotFound): the server could not find the requested resource"
                            
                                My kubernetes cluster IP address changed and now kubectl will no longer connect
                            
                                Dynamic proxy_pass in nginx to another pod in Kubernetes
                            
                                Kubernetes - container communication within a pod using names instead of 'localhost'?
                            
                                Sharing a persistent volume between pods in Kubernetes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does pod replica scaling down work in Kubernetes Horizontal Pod Autoscaler?

Tags:

kubernetes

autoscaling

kubernetes-hpa

pkout

People also ask

1 Answers

Vishal Biyani

Recent Activity

Donate For Us