I allocated resource to 1 pod only with 650MB/30% of memory (with other built-in pods, limit memory is 69% only) However, when the pod handling process, the usage of pod is within 650MB but overall usage of node is 94%. Why does it happen because it supposed to have upper limit of 69%? Is it due to other built-in pods which did not set limit? How to prevent this as sometimes my pod with error if usage of Memory > 100%? My allocation setting (<code>kubectl describe nodes</code>): <img src="https://i.stack.imgur.com/tDoZ6.png" alt="enter image description here"> Memory usage of Kubernetes Node and Pod when idle: <code>kubectl top nodes</code> <img src="https://i.stack.imgur.com/JtXgo.png" alt="enter image description here"> <code>kubectl top pods</code> <img src="https://i.stack.imgur.com/ijLHU.png" alt="enter image description here"> Memory usage of Kubernetes Node and Pod when running task: <code>kubectl top nodes</code> <img src="https://i.stack.imgur.com/phCZS.png" alt="enter image description here"> <code>kubectl top pods</code> <img src="https://i.stack.imgur.com/7Ja9B.png" alt="enter image description here"> <hr> Further Tested behaviour: 1. Prepare deployment, pods and service under namespace test-ns 2. Since only kube-system and test-ns have pods, so assign 1000Mi to each of them (from <code>kubectl describe nodes</code>) aimed to less than 2GB 3. Suppose memory used in kube-system and test-ns will be less than 2GB which is less than 100%, why memory usage can be 106%? In .yaml file: <pre class="prettyprint"><code> apiVersion: v1 kind: LimitRange metadata: name: default-mem-limit namespace: test-ns spec: limits: - default: memory: 1000Mi type: Container --- apiVersion: v1 kind: LimitRange metadata: name: default-mem-limit namespace: kube-system spec: limits: - default: memory: 1000Mi type: Container --- apiVersion: apps/v1 kind: Deployment metadata: name: devops-deployment namespace: test-ns labels: app: devops-pdf spec: selector: matchLabels: app: devops-pdf replicas: 2 template: metadata: labels: app: devops-pdf spec: containers: - name: devops-pdf image: dev.azurecr.io/devops-pdf:latest imagePullPolicy: Always ports: - containerPort: 3000 resources: requests: cpu: 600m memory: 500Mi limits: cpu: 600m memory: 500Mi imagePullSecrets: - name: regcred --- apiVersion: v1 kind: Service metadata: name: devops-pdf namespace: test-ns spec: type: LoadBalancer ports: - port: 8007 selector: app: devops-pdf </code></pre> <img src="https://i.stack.imgur.com/mjIKA.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/EGqCs.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/BXpEe.png" alt="enter image description here">

This effect is most likely caused by the 4 Pods that run on that node without a memory limit specified, shown as <code>0 (0%)</code>. Of course 0 doesn't mean it can't use even a single byte of memory as no program can be started without using memory; instead it means that there is no limit, it can use as much as available. Also programs running not in pod (ssh, cron, ...) are included in the total used figure, but are not limited by kubernetes (by cgroups). Now kubernetes sets up the kernel oom adjustment values in a tricky way to favour containers that are under their memory request, making it more more likely to kill processes in containers that are between their memory request and limit, and making it most likely to kill processes in containers with no memory limits. However, this is only shown to work fairly in the long run, and sometimes the kernel can kill your favourite process in your favourite container that is behaving well (using less than its memory request). See https://kubernetes.io/docs/tasks/administer-cluster/out-of-resource/#node-oom-behavior The pods without memory limit in this particular case are coming from the aks system itself, so setting their memory limit in the pod templates is not an option as there is a reconciler that will restore it (eventually). To remedy the situation I suggest that you create a LimitRange object in the kube-system namespace that will assign a memory limit to all pods without a limit (as they are created): <pre class="prettyprint"><code>apiVersion: v1 kind: LimitRange metadata: name: default-mem-limit namespace: kube-system spec: limits: - default: memory: 150Mi type: Container </code></pre> (You will need to delete the already existing Pods without a memory limit for this to take effect; they will get recreated) This is not going to completely eliminate the problem as you might end up with an overcommitted node; however the memory usage will make sense and the oom events will be more predictable.

Why memory usage is greater than what I set in Kubernetes's node?

Tags:

memory-limit

kubernetes

nodes

azure-aks

I allocated resource to 1 pod only with 650MB/30% of memory (with other built-in pods, limit memory is 69% only)

However, when the pod handling process, the usage of pod is within 650MB but overall usage of node is 94%.

Why does it happen because it supposed to have upper limit of 69%? Is it due to other built-in pods which did not set limit? How to prevent this as sometimes my pod with error if usage of Memory > 100%?

My allocation setting (kubectl describe nodes): enter image description here

Memory usage of Kubernetes Node and Pod when idle:
kubectl top nodes
enter image description here
kubectl top pods

Memory usage of Kubernetes Node and Pod when running task:
kubectl top nodes
enter image description here
kubectl top pods

Further Tested behaviour:
1. Prepare deployment, pods and service under namespace test-ns
2. Since only kube-system and test-ns have pods, so assign 1000Mi to each of them (from kubectl describe nodes) aimed to less than 2GB
3. Suppose memory used in kube-system and test-ns will be less than 2GB which is less than 100%, why memory usage can be 106%?

In .yaml file:

Click to copy

    apiVersion: v1
    kind: LimitRange
    metadata:
      name: default-mem-limit
      namespace: test-ns
    spec:
      limits:
      - default:
          memory: 1000Mi
        type: Container
    ---
    apiVersion: v1
    kind: LimitRange
    metadata:
      name: default-mem-limit
      namespace: kube-system
    spec:
      limits:
      - default:
          memory: 1000Mi
        type: Container
    ---
    apiVersion: apps/v1
    kind: Deployment
    metadata:
      name: devops-deployment
      namespace: test-ns
      labels:
        app: devops-pdf
    spec:
      selector:
        matchLabels:
          app: devops-pdf
      replicas: 2
      template:
        metadata:
          labels:
            app: devops-pdf
        spec:
          containers:
          - name: devops-pdf
            image: dev.azurecr.io/devops-pdf:latest
            imagePullPolicy: Always
            ports:
            - containerPort: 3000
            resources:
              requests:
                cpu: 600m
                memory: 500Mi
              limits:
                cpu: 600m
                memory: 500Mi
          imagePullSecrets:
          - name: regcred
    ---
    apiVersion: v1
    kind: Service
    metadata:
      name: devops-pdf
      namespace: test-ns
    spec:
      type: LoadBalancer
      ports:
      - port: 8007
      selector:
        app: devops-pdf

enter image description here

216

asked Aug 30 '19 07:08

DaiKeung

1 Answers

This effect is most likely caused by the 4 Pods that run on that node without a memory limit specified, shown as 0 (0%). Of course 0 doesn't mean it can't use even a single byte of memory as no program can be started without using memory; instead it means that there is no limit, it can use as much as available. Also programs running not in pod (ssh, cron, ...) are included in the total used figure, but are not limited by kubernetes (by cgroups).

Now kubernetes sets up the kernel oom adjustment values in a tricky way to favour containers that are under their memory request, making it more more likely to kill processes in containers that are between their memory request and limit, and making it most likely to kill processes in containers with no memory limits. However, this is only shown to work fairly in the long run, and sometimes the kernel can kill your favourite process in your favourite container that is behaving well (using less than its memory request). See https://kubernetes.io/docs/tasks/administer-cluster/out-of-resource/#node-oom-behavior

The pods without memory limit in this particular case are coming from the aks system itself, so setting their memory limit in the pod templates is not an option as there is a reconciler that will restore it (eventually). To remedy the situation I suggest that you create a LimitRange object in the kube-system namespace that will assign a memory limit to all pods without a limit (as they are created):

Click to copy

apiVersion: v1
kind: LimitRange
metadata:
  name: default-mem-limit
  namespace: kube-system
spec:
  limits:
  - default:
      memory: 150Mi
    type: Container

(You will need to delete the already existing Pods without a memory limit for this to take effect; they will get recreated)

This is not going to completely eliminate the problem as you might end up with an overcommitted node; however the memory usage will make sense and the oom events will be more predictable.

148

answered Sep 30 '22 18:09

Janos Lenart

Related questions
                            
                                Kubernetes Permission denied in container
                            
                                Kubernetes DNS lookg not working from worker node - connection timed out; no servers could be reached
                            
                                How to not expose Traefik port to the internet?
                            
                                How to mimic "--log-driver=syslog" in Kubernetes
                            
                                How to ask Kubernetes nicely to retrieve newer image?
                            
                                Preserving remote client IP with Ingress
                            
                                Instructions to install addons with Kubernetes 1.6 on bare metal machine?
                            
                                kubectl error: "You must be logged in to the server (the server has asked for the client to provide credentials)"
                            
                                GitLab 9.x Kubernetes Integration
                            
                                Providing multiple health check URLs for kubernetes probes
                            
                                Can a kubernetes Deployment inject the service nodeport as an environment variable?
                            
                                Kubernetes Volumes - Dynamic path
                            
                                Is it possible for 2 containers inside a Pod to share the same Environment Variables?
                            
                                Kubernetes Ingress Nginx loading resources 404
                            
                                How to bootstrap RBAC privileges when bringing up a GKE cluster with Terraform
                            
                                Writing custom functions in Helm charts
                            
                                Default path on multiple nginx ingress rewrite
                            
                                Implementing iptables rules on Kubernetes nodes
                            
                                Error while using local persistent volumes in statefulset pod
                            
                                MetalLB External IP to Internet

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With