Kubernetes OOM pod killed because kernel memory grows to much

Tags:

I am working on a java service that basically creates files in a network file system to store data. It runs in a k8s cluster in a Ubuntu 18.04 LTS. When we began to limit the memory in kubernetes (limits: memory: 3Gi), the pods began to be OOMKilled by kubernetes.

At the beginning we thought it was a leak of memory in the java process, but analyzing more deeply we noticed that the problem is the memory of the kernel. We validated that looking at the file /sys/fs/cgroup/memory/memory.kmem.usage_in_bytes

We isolated the case to only create files (without java) with the DD command like this:

for i in {1..50000}; do dd if=/dev/urandom bs=4096 count=1 of=file$i; done

And with the dd command we saw that the same thing happened ( the kernel memory grew until OOM). After k8s restarted the pod, I got doing a describe pod:

Last State:Terminated
Reason: OOMKilled
Exit Code: 143

Creating files cause the kernel memory grows, deleting those files cause the memory decreases . But our services store data , so it creates a lot of files continuously, until the pod is killed and restarted because OOMKilled.

We tested limiting the kernel memory using a stand alone docker with the --kernel-memory parameter and it worked as expected. The kernel memory grew to the limit and did not rise anymore. But we did not find any way to do that in a kubernetes cluster. Is there a way to limit the kernel memory in a K8S environment ? Why the creation of files causes the kernel memory grows and it is not released ?

446

asked Dec 12 '18 22:12

Pablo Hadziatanasiu

1 Answers

Thanks for all this info, it was very useful!

On my app, I solved this by creating a new side container that runs a cron job, every 5 minutes with the following command:

echo 3 > /proc/sys/vm/drop_caches

(note that you need the side container to run in privileged mode)

It works nicely and has the advantage of being predictable: every 5 minutes, your memory cache will be cleared.

answered Sep 24 '22 14:09

Cyrille99

Related questions
                            
                                Elasticsearch docker image with data persistence
                            
                                Docker: How to extract the Docker image into local system
                            
                                'exec user process caused: exec format error' in AWS Fargate Service
                            
                                pull access denied for container-registry.oracle.com/database/enterprise
                            
                                ERROR: readlink /var/lib/docker/overlay2: invalid argument
                            
                                Docker has the same error regardless of what I try to build (windows 10)
                            
                                React Typescript: add location state to react router component
                            
                                on building docker image level=error msg="Can't close tar writer: io: read/write on closed pipe"
                            
                                Permission denied to Docker daemon socket at unix:///var/run/docker.sock
                            
                                How to run a windows docker container on linux host?
                            
                                Dockerfile - Defining an ENV variable with a dynamic value
                            
                                Unable to resolve domain names inside docker container
                            
                                from .cv2 import * ImportError: libgthread-2.0.so.0: cannot open shared object file: No such file or directory [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Kubernetes OOM pod killed because kernel memory grows to much

Tags:

docker

linux-kernel

out-of-memory

kubernetes

Pablo Hadziatanasiu

People also ask

1 Answers

Cyrille99

Recent Activity

Donate For Us