We are running a Spark Streaming application on a Kubernetes cluster using spark 2.4.5. The application is receiving massive amounts of data through a Kafka topic (one message each 3ms). 4 executors and 4 kafka partitions are being used. While running, the memory of the driver pod keeps increasing until it is getting killed by K8s with an 'OOMKilled' status. The memory of executors is not facing any issues. When checking the driver pod resources using this command : <pre class="prettyprint"><code>kubectl top pod podName </code></pre> We can see that the memory increases until it reaches 1.4GB, and the pod is getting killed. However, when checking the storage memory of the driver on Spark UI, we can see that the storage memory is not fully used (50.3 KB / 434 MB). Is there any difference between the storage memory of the driver, and the memory of the pod containing the driver ? Has anyone had experience with a similar issue before? Any help would be appreciated. Here are few more details about the app : <ul> <li>Kubernetes version : 1.18</li> <li>Spark version : 2.4.5</li> <li>Batch interval of spark streaming context : 5 sec</li> <li>Rate of input data : 1 kafka message each 3 ms</li> <li>Scala language</li> </ul>

In brief, the Spark memory consists of three parts: <ul> <li>Reversed memory (300MB)</li> <li>User memory ((all - 300MB)*0.4), used for data processing logic.</li> <li>Spark memory ((all-300MB)*0.6(<code>spark.memory.fraction</code>)), used for cache and shuffle in Spark.</li> </ul> Besides this, there is also <code>max(executor memory * 0.1, 384MB)</code>(<code>0.1</code> is <code>spark.kubernetes.memoryOverheadFactor</code>) extra memory used by non-JVM memory in K8s. Adding executor memory limit by memory overhead in K8S should fix the OOM. You can also decrease <code>spark.memory.fraction</code> to allocate more RAM to user memory.

Spark driver pod getting killed with 'OOMKilled' status

Tags:

out-of-memory

kubernetes

apache-spark

spark-streaming

We are running a Spark Streaming application on a Kubernetes cluster using spark 2.4.5. The application is receiving massive amounts of data through a Kafka topic (one message each 3ms). 4 executors and 4 kafka partitions are being used.

While running, the memory of the driver pod keeps increasing until it is getting killed by K8s with an 'OOMKilled' status. The memory of executors is not facing any issues.

When checking the driver pod resources using this command :

kubectl top pod podName

We can see that the memory increases until it reaches 1.4GB, and the pod is getting killed.

However, when checking the storage memory of the driver on Spark UI, we can see that the storage memory is not fully used (50.3 KB / 434 MB). Is there any difference between the storage memory of the driver, and the memory of the pod containing the driver ?

Has anyone had experience with a similar issue before?

Any help would be appreciated.

Here are few more details about the app :

Kubernetes version : 1.18
Spark version : 2.4.5
Batch interval of spark streaming context : 5 sec
Rate of input data : 1 kafka message each 3 ms
Scala language

774

asked Aug 14 '20 11:08

Nab

1 Answers

In brief, the Spark memory consists of three parts:

Reversed memory (300MB)
User memory ((all - 300MB)*0.4), used for data processing logic.
Spark memory ((all-300MB)*0.6(spark.memory.fraction)), used for cache and shuffle in Spark.

Besides this, there is also max(executor memory * 0.1, 384MB)(0.1 is spark.kubernetes.memoryOverheadFactor) extra memory used by non-JVM memory in K8s.

Adding executor memory limit by memory overhead in K8S should fix the OOM.

You can also decrease spark.memory.fraction to allocate more RAM to user memory.

133

answered Oct 27 '22 13:10

Hunger

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark driver pod getting killed with 'OOMKilled' status

Tags:

out-of-memory

kubernetes

apache-spark

spark-streaming

Nab

People also ask

1 Answers

Hunger

Recent Activity

Donate For Us

Spark driver pod getting killed with 'OOMKilled' status

Tags:

out-of-memory

kubernetes

apache-spark

spark-streaming

Nab

People also ask

1 Answers

Hunger

Related questions

Recent Activity

Donate For Us