Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Peak Execution Memory in Spark

In Apache spark UI One of metrics at stage level is "Peak Execution Memory"? What this metrics indicating.

like image 802
mandar Avatar asked Sep 15 '16 05:09

mandar


1 Answers

Peak Execution memory refers to the memory used by internal data structures created during shuffles, aggregations and joins. The value of this accumulator should be approximately the sum of the peak sizes across all such data structures created in this task. For SQL jobs, this only tracks all unsafe operators, broadcast joins, and external sort.

You can get more insights about the same using Tracking spark's memory usage

like image 130
Hokam Avatar answered Sep 28 '22 03:09

Hokam