Coming from numerous years of running node/rails apps on bare metal; i was used to be able to run as many apps as i wanted on a single machine (let's say, a 2Go at digital ocean could easily handle 10 apps without worrying, based on correct optimizations or fairly low amount of traffic) Thing is, using kubernetes, the game sounds quite different. I've setup a "getting started" cluster with 2 standard vm (3.75Go). Assigned a limit on a deployment with the following : <pre class="prettyprint"><code> resources: requests: cpu: "64m" memory: "128Mi" limits: cpu: "128m" memory: "256Mi" </code></pre> Then witnessing the following : <pre class="prettyprint"><code>Namespace Name CPU Requests CPU Limits Memory Requests Memory Limits --------- ---- ------------ ---------- --------------- ------------- default api 64m (6%) 128m (12%) 128Mi (3%) 256Mi (6%) </code></pre> What does this 6% refers to ? Tried to lower the CPU limit, to like, 20Mi… the app does to start (obviously, not enough resources). The docs says it is percentage of CPU. So, 20% of 3.75Go machine ? Then where this 6% comes from ? Then increased the size of the node-pool to the n1-standard-2, the same pod effectively span 3% of node. That sounds logical, but what does it actually refers to ? Still wonder what is the metrics to be taken in account for this part. The app seems to need a large amount of memory on startup, but then it use only a minimal fraction of this 6%. I then feel like I misunderstanding something, or misusing it all Thanks for any experienced tips/advices to have a better understanding Best

According to the docs, CPU requests (and limits) are always fractions of available CPU cores on the node that the pod is scheduled on (with a <code>resources.requests.cpu</code> of <code>"1"</code> meaning reserving one CPU core exclusively for one pod). Fractions are allowed, so a CPU request of <code>"0.5"</code> will reserve half a CPU for one pod. For convenience, Kubernetes allows you to specify CPU resource requests/limits in millicores: <blockquote> The expression <code>0.1</code> is equivalent to the expression <code>100m</code>, which can be read as “one hundred millicpu” (some may say “one hundred millicores”, and this is understood to mean the same thing when talking about Kubernetes). A request with a decimal point, like <code>0.1</code> is converted to <code>100m</code> by the API, and precision finer than <code>1m</code> is not allowed. </blockquote> As already mentioned in the other answer, resource requests are guaranteed. This means that Kubernetes will schedule pods in a way that the sum of all requests will not exceed the amount of resources actually available on a node. So, by requesting <code>64m</code> of CPU time in your deployment, you are requesting actually 64/1000 = 0,064 = 6,4% of one of the node's CPU cores time. So that's where your 6% come from. When upgrading to a VM with more CPU cores, the amount of available CPU resources increases, so on a machine with two available CPU cores, a request for 6,4% of one CPU's time will allocate 3,2% of the CPU time of two CPUs.

kubernetes / understanding CPU resources limits

Tags:

google-cloud-platform

kubernetes

Coming from numerous years of running node/rails apps on bare metal; i was used to be able to run as many apps as i wanted on a single machine (let's say, a 2Go at digital ocean could easily handle 10 apps without worrying, based on correct optimizations or fairly low amount of traffic)

Thing is, using kubernetes, the game sounds quite different. I've setup a "getting started" cluster with 2 standard vm (3.75Go).

Assigned a limit on a deployment with the following :

        resources:
          requests:
            cpu: "64m"
            memory: "128Mi"
          limits:
            cpu: "128m"
            memory: "256Mi"

Then witnessing the following :

Namespace       Name            CPU Requests    CPU Limits  Memory Requests Memory Limits
---------       ----            ------------    ----------  --------------- -------------
default         api             64m (6%)        128m (12%)  128Mi (3%)      256Mi (6%)

What does this 6% refers to ?

Tried to lower the CPU limit, to like, 20Mi… the app does to start (obviously, not enough resources). The docs says it is percentage of CPU. So, 20% of 3.75Go machine ? Then where this 6% comes from ?

Then increased the size of the node-pool to the n1-standard-2, the same pod effectively span 3% of node. That sounds logical, but what does it actually refers to ?

Still wonder what is the metrics to be taken in account for this part.

The app seems to need a large amount of memory on startup, but then it use only a minimal fraction of this 6%. I then feel like I misunderstanding something, or misusing it all

Thanks for any experienced tips/advices to have a better understanding Best

819

asked Feb 19 '17 11:02

Ben

1 Answers

According to the docs, CPU requests (and limits) are always fractions of available CPU cores on the node that the pod is scheduled on (with a resources.requests.cpu of "1" meaning reserving one CPU core exclusively for one pod). Fractions are allowed, so a CPU request of "0.5" will reserve half a CPU for one pod.

For convenience, Kubernetes allows you to specify CPU resource requests/limits in millicores:

The expression 0.1 is equivalent to the expression 100m, which can be read as “one hundred millicpu” (some may say “one hundred millicores”, and this is understood to mean the same thing when talking about Kubernetes). A request with a decimal point, like 0.1 is converted to 100m by the API, and precision finer than 1m is not allowed.

As already mentioned in the other answer, resource requests are guaranteed. This means that Kubernetes will schedule pods in a way that the sum of all requests will not exceed the amount of resources actually available on a node.

So, by requesting 64m of CPU time in your deployment, you are requesting actually 64/1000 = 0,064 = 6,4% of one of the node's CPU cores time. So that's where your 6% come from. When upgrading to a VM with more CPU cores, the amount of available CPU resources increases, so on a machine with two available CPU cores, a request for 6,4% of one CPU's time will allocate 3,2% of the CPU time of two CPUs.

137

answered Oct 15 '22 12:10

helmbert

Related questions
                            
                                Why do we need a port/containerPort in a Kuberntes deployment/container definition?
                            
                                How does one use Apache in a Docker Container and write nothing to disk (all logs to STDIO / STDERR)?
                            
                                Unable to connect to kubernetes python api - .kube/config file not found
                            
                                How to clone a private git repository into a kubernetes pod using ssh keys in secrets?
                            
                                Bind different Persistent Volume for each replica in a Kubernetes Deployment
                            
                                WaitForFirstConsumer PersistentVolumeClaim waiting for first consumer to be created before binding
                            
                                Ingress controller vs api gateway
                            
                                (Kubernetes + Minikube) can't get docker image from local registry
                            
                                Can kubectl describe show timestamp of pod events?
                            
                                Setting up AWS EKS - Don't know username and password for config
                            
                                field is immutable k8s
                            
                                How to stop Docker (and Kubernetes) using Docker desktop?
                            
                                How does the GKE metadata server work in Workload Identity
                            
                                How to get ssl on a kubernetes application?
                            
                                How to fix "Forbidden!Configured service account doesn't have access" with Spark on Kubernetes?
                            
                                Run Multiple Services on Port 80 in same Kubernetes Cluster on Google Container Engine
                            
                                Minikube volumes
                            
                                Kubernetes Cluster on AWS with Kops - NodePort Service Unavailable
                            
                                How to pass entire JSON string to Helm chart value?
                            
                                GKE Cluster can't pull (ErrImagePull) from GCR Registry in same project (GitLab Kubernetes Integration): Why?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With