Apache Spark Executors Dead - is this the expected behaviour?

Tags:

I am running a pipeline to process my data on Spark. It seems like my Executors die every now and then when they reach near the Storage Memory limit. The job continues and eventually finishes but is this the normal behaviour? Is there something I should be doing to prevent this from happening? Every time this happens the job hangs for some time until (and I am guessing here) YARN provides some new executors for the job to continue.

Spark UI Executor tab

812

asked Apr 04 '19 08:04

Augusto

1 Answers

I think this turned out to be related with a Yarn bug. It doesn't happen anymore after I set the following YARN options like suggested in section 4. of this blog post

Best practice 5: Always set the virtual and physical memory check flag to false.

"yarn.nodemanager.vmem-check-enabled":"false",

"yarn.nodemanager.pmem-check-enabled":"false"

answered Oct 10 '22 21:10

Augusto

Related questions
                            
                                Spark Job submitted - Waiting (TaskSchedulerImpl : Initial job not accepted)
                            
                                Spark performance tuning - number of executors vs number for cores
                            
                                Spark Dataframe Maximum Column Count
                            
                                Run Spark-shell with error :SparkContext: Error initializing SparkContext
                            
                                Spark num-executors
                            
                                Spark SQL: INSERT INTO statement syntax
                            
                                Cannot create temp dir with proper permission: /mnt1/s3
                            
                                Pyspark 1.6 - Aliasing columns after pivoting with multiple aggregates
                            
                                Apache Spark read file as a stream from HDFS
                            
                                "GC overhead limit exceeded" on cache of large dataset into spark memory (via sparklyr & RStudio)
                            
                                spark 2.1.1 : Parsed JSON values do not match with class constructor
                            
                                How can I join a spark live stream with all the data collected by another stream during its entire life cycle?
                            
                                Efficient load CSV coordinate format (COO) input to local matrix spark
                            
                                Spark: Reading big MySQL table into DataFrame fails
                            
                                SparkAppHandle Listener not getting invoked
                            
                                Spark 2.3 dynamic partitionBy not working on S3 AWS EMR 5.13.0
                            
                                KryoException: Unable to find class with spark structured streaming
                            
                                Pyspark and local variables inside UDFs
                            
                                Spark watermark and windowing in Append mode
                            
                                Latent Dirichlet allocation (LDA) in Spark - replicate model

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Apache Spark Executors Dead - is this the expected behaviour?

Tags:

apache-spark

hadoop-yarn

Augusto

People also ask

1 Answers

Augusto

Recent Activity

Donate For Us