Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Spark UI Output Op Duration vs Job Duration: What's the difference?

On Spark UI page, what is the difference of column: "Output Op Duration" and "Job Duration"?

enter image description here

like image 650
Mekal Avatar asked Oct 18 '22 04:10

Mekal


1 Answers

From Sparks mailing list:

"It means the total time to run a batch, including the Spark job duration + time spent on the driver. E.g.,

foreachRDD { rdd =>
   rdd.count() // say this takes 1 second.
   Thread.sleep(10000) // sleep 10 seconds
}

In the above example, the Spark job duration is 1 seconds and the output op duration is 11 seconds."

like image 192
its_a_paddo Avatar answered Oct 21 '22 22:10

its_a_paddo