Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What happens when an executor is lost?

Tags:

apache-spark

I get these messages:

16/05/22 13:33:53 ERROR YarnScheduler: Lost executor 61 on <host>: Executor heartbeat timed out after 134828 ms
16/05/22 13:33:53 WARN TaskSetManager: Lost task 25.0 in stage 12.0 (TID 2214, <host>): ExecutorLostFailure (executor 61 lost)

Will a replacement executor be spawned?

like image 870
sds Avatar asked May 22 '16 17:05

sds


People also ask

What happens when an executor is lost in spark?

Will a replacement executor be spawned? Yes, it will. Sparks DAGScheduler and it's lower level cluster manager implementation (Standalone, YARN or Mesos) will notice a task failed and will take care of rescheduling the said task as part of the overall stages executed.

What is spark yarn executor memoryOverhead?

spark.yarn.executor.memoryOverhead. Is just the max value .The goal is to calculate OVERHEAD as a percentage of real executor memory, as used by RDDs and DataFrames --executor-memory/spark.executor.memory.


1 Answers

Will a replacement executor be spawned?

Yes, it will. Sparks DAGScheduler and it's lower level cluster manager implementation (Standalone, YARN or Mesos) will notice a task failed and will take care of rescheduling the said task as part of the overall stages executed.

DAGScheduler

DAGScheduler does three things in Spark (thorough explanations follow):

  • Computes an execution DAG, i.e. DAG of stages, for a job.
  • Determines the preferred locations to run each task on.
  • Handles failures due to shuffle output files being lost.

For more on that you can find in Advanced Spark Tutorial and in Mastering Apache Spark.

like image 118
Yuval Itzchakov Avatar answered Oct 12 '22 12:10

Yuval Itzchakov