Say we have the following deployment: <pre class="prettyprint"><code>apiVersion: apps/v1 kind: Deployment metadata: ... spec: replicas: 2 template: spec: containers: - image: ... ... resources: requests: cpu: 100m memory: 50Mi limits: cpu: 500m memory: 300Mi </code></pre> And we also create a <code>HorizontalPodAutoscaler</code> object which automatically scales up/down the number of pods based on CPU average utilization. I know that the HPA will compute the number of pods based on the resource requests, but what if I want the containers to be able to request more resources before scaling horizontally? I have two questions: 1) Are resource limits even used by K8s when a HPA is defined? 2) Can I tell the HPA to scale based on resource limits rather than requests? Or as a means of implementing such a control, can I set the <code>targetUtilization</code> value to be more than 100%?

Hi in deployment we have resources requests and limits. As per documentation here those parameters acts before HPA gets main role as autoscaler: <blockquote> <ol> <li> When you create a Pod, the Kubernetes scheduler selects a node for the Pod to run on. Each node has a maximum capacity for each of the resource types: the amount of CPU and memory it can provide for Pods. </li> <li>Then the kubelet starts a Container of a Pod, it passes the CPU and memory limits to the container runtime.</li> <li>If a Container exceeds its memory limit, it might be terminated. If it is restartable, the kubelet will restart it, as with any other type of runtime failure.</li> </ol> </blockquote> If a Container exceeds its memory request, it is likely that its Pod will be evicted whenever the node runs out of memory. On the other hand: <blockquote> The Horizontal Pod Autoscaler is implemented as a control loop, with a period controlled by the controller manager’s (with default value of 15 seconds). The controller manager queries the resource utilization against the metrics specified in each HorizontalPodAutoscaler definition. </blockquote> Note: Please note that if some of the pod’s containers do not have the relevant resource request set, CPU utilization for the pod will not be defined and the autoscaler will not take any action for that metric. Hope this help

Using Horizontal Pod Autoscaling along with resource requests and limits

Tags:

kubernetes

horizontal-scaling

Say we have the following deployment:

apiVersion: apps/v1
kind: Deployment
metadata:
  ...
spec:
  replicas: 2
  template:
    spec:
      containers:
        - image: ...
          ...
          resources:
            requests:
              cpu: 100m
              memory: 50Mi
            limits:
              cpu: 500m
              memory: 300Mi

And we also create a HorizontalPodAutoscaler object which automatically scales up/down the number of pods based on CPU average utilization. I know that the HPA will compute the number of pods based on the resource requests, but what if I want the containers to be able to request more resources before scaling horizontally?

I have two questions:

1) Are resource limits even used by K8s when a HPA is defined?

2) Can I tell the HPA to scale based on resource limits rather than requests? Or as a means of implementing such a control, can I set the targetUtilization value to be more than 100%?

409

asked May 18 '19 12:05

mittelmania

2 Answers

No, HPA is not looking at limits at all. You can specify target utilization to any value even higher than 100%.

104

answered Oct 18 '22 01:10

Vasili Angapov

Hi in deployment we have resources requests and limits. As per documentation here those parameters acts before HPA gets main role as autoscaler:

When you create a Pod, the Kubernetes scheduler selects a node for the Pod to run on. Each node has a maximum capacity for each of the resource types: the amount of CPU and memory it can provide for Pods.

Then the kubelet starts a Container of a Pod, it passes the CPU and memory limits to the container runtime.

If a Container exceeds its memory limit, it might be terminated. If it is restartable, the kubelet will restart it, as with any other type of runtime failure.

If a Container exceeds its memory request, it is likely that its Pod will be evicted whenever the node runs out of memory.

On the other hand:

The Horizontal Pod Autoscaler is implemented as a control loop, with a period controlled by the controller manager’s (with default value of 15 seconds). The controller manager queries the resource utilization against the metrics specified in each HorizontalPodAutoscaler definition.

Note: Please note that if some of the pod’s containers do not have the relevant resource request set, CPU utilization for the pod will not be defined and the autoscaler will not take any action for that metric.

Hope this help

answered Oct 18 '22 01:10

Mark

Related questions
                            
                                Multiple apps running in one kubernetes cluster or a clusters for each app
                            
                                Create kubernetes secret from .json file
                            
                                How to get number of pods (available or terminating) in kubernetes?
                            
                                Multiple Env Variables in Helm Charts
                            
                                Kubernetes kube-dns pod is pending
                            
                                Golang template (helm) iterating over a list of maps
                            
                                Run a pod without scheduler in Kubernetes
                            
                                Restarting a specific pod inside a Deployment
                            
                                Kube-Flannel cant get CIDR although PodCIDR available on node
                            
                                Kubernetes CronJob - Skip job if previous is still running AND wait for the next schedule time
                            
                                calico/node is not ready: BIRD is not ready: BGP not established
                            
                                Printing not being logged by Kubernetes
                            
                                Does a completed kubernetes pod still reserves the required resources?
                            
                                adding single quotes to helm value
                            
                                All Kubernetes proxy targets down - Prometheus Operator
                            
                                How to debug Kubernetes CreateContainerConfigError
                            
                                Multiple K8S containers connecting to Google Cloud SQL through proxy
                            
                                Get Deployment annotation from a Kubernetes Pod
                            
                                Cannot deploy MySQL pod. --initialize specified but the data directory has files in it
                            
                                kubelet.service: Main process exited, code=exited, status=255/n/a

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With