Auto-scaling is taking more time to bring up new pod and giving connection error in google container engine

Tags:

I have used following command for autoscaling.

kubectl autoscale deployment catch-node --cpu-percent=50 --min=1 --max=10

The status of autoscaling in my case on load test is as like below .

27th minute

NAME         REFERENCE                     TARGET    CURRENT   MINPODS   MAXPODS   AGE
catch-node   Deployment/catch-node/scale   50%       20%      1         10        27m

NAME         DESIRED   CURRENT   UP-TO-DATE   AVAILABLE   AGE
catch-node   1         1         1            1           27m

29th minute

NAME         REFERENCE                     TARGET    CURRENT   MINPODS   MAXPODS   AGE
catch-node   Deployment/catch-node/scale   50%       35%      1         10        29m

NAME         DESIRED   CURRENT   UP-TO-DATE   AVAILABLE   AGE
catch-node   1         1         1            1           29m

31st minute

NAME         REFERENCE                     TARGET    CURRENT   MINPODS   MAXPODS   AGE
catch-node   Deployment/catch-node/scale   50%       55%      1         10        31m

NAME         DESIRED   CURRENT   UP-TO-DATE   AVAILABLE   AGE
catch-node   1         1         1            1           31m

34th minute

NAME         REFERENCE                     TARGET    CURRENT   MINPODS   MAXPODS   AGE
catch-node   Deployment/catch-node/scale   50%       190%      1         10        34m

NAME         DESIRED   CURRENT   UP-TO-DATE   AVAILABLE   AGE
catch-node   4         4         4            4           34m

Here i am getting connection refusing error in the time between transition of 1 pod to 4pods on autoscaling. Please let me know how much time it will take to bring up new pods once it exceed the CPU % limit given during autoscale .Also please let me know is there any method to reduce this time .once all new pods comes up, the issue is not there . Thanks in advance

996

asked May 06 '16 11:05

Priyesh Karatha

1 Answers

As documented in this doc, there are two factors affect the reaction time of the autoscaler:

--horizontal-pod-autoscaler-sync-period, which defines how often the autoscaler checks the status of the controlled resources. The default value is 30s. It can be changed via the flag of the controller-manager.
upscaleForbiddenWindow, which defines how often the autoscaler can scale up the resource. The default value is 3 mins. Currently it's not adjustable.

According to the log you pasted, if the load is stable, the autoscaler should reacted in 30s after CPU usage reaches 55%, is that the case?

171

answered Sep 21 '22 21:09

caesarxuchao

Related questions
                            
                                Kube ingress with hostname (how to know IP to forward domain name?)
                            
                                How to get filebeat to ignore certain container logs
                            
                                openshift 3.11 install fails - Unable to update cni config: No networks found in /etc/cni/net.d",
                            
                                The PersistentVolume is invalid: spec: Required value: must specify a volume type
                            
                                Kubernetes Deployment/Pod/Container statuses
                            
                                Pass variable that includes double quotes (") in its value to a container from K8s deployment
                            
                                Conditional Cloud Builds with Many Packages in a Monorepo
                            
                                Kubernetes ingress "an error on the server ("") has prevented the request from succeeding"
                            
                                Kubernetes 1.16 Nginx Ingress (0.26.1) TCP Mariadb/MySQL service not working
                            
                                Spark/k8s: How to run spark submit on Kubernetes with client mode
                            
                                Postgres / K8S : PANIC could not locate a valid checkpoint record / CrashLoopBackOff
                            
                                How can we see cached images in kubernetes?
                            
                                I don't fully understand how containerisation doesn't lead to over provisioning instances from the start
                            
                                Airflow- dag_id could not be found issue when using kubernetes executor
                            
                                Get the list of Pods stuck in terminating state for more than 10 mins and remove them in Ansible
                            
                                java.lang.ClassNotFoundException: com.fasterxml.jackson.databind.ser.FilterProvider when flink boot up
                            
                                How to have asp.net core running locally with HTTPS in Kubernetes
                            
                                Kubernetes Storage on bare-metal/private cloud
                            
                                Deploying local Docker image (DockerFIle) as local Kubernetes pod
                            
                                Configuring kubectl against remote clusters

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Auto-scaling is taking more time to bring up new pod and giving connection error in google container engine

Tags:

kubernetes

google-kubernetes-engine

google-container-registry

Priyesh Karatha

People also ask

1 Answers

caesarxuchao

Recent Activity

Donate For Us