I'm getting confused about <code>spill to disk</code> and <code>shuffle write</code>. Using the default Sort shuffle manager, we use an <code>appendOnlyMap</code> for aggregating and combine partition records, right? Then when execution memory fill up, we start sorting map, spilling it to disk and then clean up the map for the next spill(if occur), my questions are : <ul> <li>What is the difference between spill to disk and shuffle write? They consist basically in creating file on local file system and also record.</li> <li>Admit are different, so Spill records are sorted because the are passed through the map, instead shuffle write records no because they don't pass from the map.</li> <li>I have the idea that the total size of the spilled file, should be equal to the size of the Shuffle write, maybe I'm missing something, please help to understand that phase.</li> </ul> Thanks. Giorgio

<code>spill to disk</code> and <code>shuffle write</code> are two different things <code>spill to disk</code> - Data move from Host RAM to Host Disk - is used when there is no enough RAM on your machine, and it place part of its RAM into disk http://spark.apache.org/faq.html Does my data need to fit in memory to use Spark? <blockquote> No. Spark's operators spill data to disk if it does not fit in memory, allowing it to run well on any sized data. Likewise, cached datasets that do not fit in memory are either spilled to disk or recomputed on the fly when needed, as determined by the RDD's storage level. </blockquote> <code>shuffle write</code> - Data move from Executor(s) to another Executor(s) - is used when data needs to move between executors (e.g. due to JOIN, groupBy, etc) more data can be found here: <ul> <li>https://0x0fff.com/spark-architecture-shuffle/</li> <li>http://blog.cloudera.com/blog/2015/05/working-with-apache-spark-or-how-i-learned-to-stop-worrying-and-love-the-shuffle/</li> </ul> An edge case example which might help clearing this issue: <ul> <li>You have 10 executors</li> <li>Each executor with 100GB RAM</li> <li>Data size is 1280MB, and is partitioned into 10 partitions</li> <li>Each executor holds 128MB of data. </li> </ul> Assuming that the data holds one key, Performing groupByKey, will bring all the data into one partition. <code>Shuffle size</code> will be 9*128MB (9 executors will transfer their data into the last executor), and there won't be any <code>spill to disk</code> as the executor has 100GB of RAM and only 1GB of data Regarding AppendOnlyMap : <blockquote> As written in the <code>AppendOnlyMap</code> code (see above) - this function is a low level implementation of a simple open hash table optimized for the append-only use case, where keys are never removed, but the value for each key may be changed. </blockquote> The fact that two different modules uses the same low-level function doesn't mean that those functions are related in hi-level.

Spill to disk and shuffle write spark

Tags:

shuffle

apache-spark

rdd

I'm getting confused about spill to disk and shuffle write. Using the default Sort shuffle manager, we use an appendOnlyMap for aggregating and combine partition records, right? Then when execution memory fill up, we start sorting map, spilling it to disk and then clean up the map for the next spill(if occur), my questions are :

What is the difference between spill to disk and shuffle write? They consist basically in creating file on local file system and also record.
Admit are different, so Spill records are sorted because the are passed through the map, instead shuffle write records no because they don't pass from the map.
I have the idea that the total size of the spilled file, should be equal to the size of the Shuffle write, maybe I'm missing something, please help to understand that phase.

Thanks.

Giorgio

721

asked Jan 15 '17 13:01

Giorgio

1 Answers

spill to disk and shuffle write are two different things

spill to disk - Data move from Host RAM to Host Disk - is used when there is no enough RAM on your machine, and it place part of its RAM into disk

http://spark.apache.org/faq.html

Does my data need to fit in memory to use Spark?

No. Spark's operators spill data to disk if it does not fit in memory, allowing it to run well on any sized data. Likewise, cached datasets that do not fit in memory are either spilled to disk or recomputed on the fly when needed, as determined by the RDD's storage level.

shuffle write - Data move from Executor(s) to another Executor(s) - is used when data needs to move between executors (e.g. due to JOIN, groupBy, etc)

more data can be found here:

https://0x0fff.com/spark-architecture-shuffle/
http://blog.cloudera.com/blog/2015/05/working-with-apache-spark-or-how-i-learned-to-stop-worrying-and-love-the-shuffle/

An edge case example which might help clearing this issue:

You have 10 executors
Each executor with 100GB RAM
Data size is 1280MB, and is partitioned into 10 partitions
Each executor holds 128MB of data.

Assuming that the data holds one key, Performing groupByKey, will bring all the data into one partition. Shuffle size will be 9*128MB (9 executors will transfer their data into the last executor), and there won't be any spill to disk as the executor has 100GB of RAM and only 1GB of data

Regarding AppendOnlyMap :

As written in the AppendOnlyMap code (see above) - this function is a low level implementation of a simple open hash table optimized for the append-only use case, where keys are never removed, but the value for each key may be changed.

The fact that two different modules uses the same low-level function doesn't mean that those functions are related in hi-level.

115

answered Sep 19 '22 12:09

Yaron

Related questions
                            
                                Profiling a Scala Spark application
                            
                                Why is Spark faster than Hadoop Map Reduce
                            
                                Count on Spark Dataframe is extremely slow
                            
                                to_date fails to parse date in Spark 3.0
                            
                                How to implement custom job listener/tracker in Spark?
                            
                                How to implement "Cross Join" in Spark?
                            
                                How to zip two (or more) DataFrame in Spark
                            
                                Running EMR Spark With Multiple S3 Accounts
                            
                                How to select and order multiple columns in a Pyspark Dataframe after a join
                            
                                Timeout Exception in Apache-Spark during program Execution
                            
                                How to split pipe-separated column into multiple rows?
                            
                                Spark: Find Each Partition Size for RDD
                            
                                PySpark: match the values of a DataFrame column against another DataFrame column
                            
                                How to remove duplicate values from a RDD[PYSPARK]
                            
                                How to flatten list inside RDD?
                            
                                SPARK/SQL:spark can't resolve symbol toDF
                            
                                What is apache zeppelin? [closed]
                            
                                How to use collect_set and collect_list functions in windowed aggregation in Spark 1.6?
                            
                                Spark 1.6: drop column in DataFrame with escaped column names
                            
                                Spark merge/combine arrays in groupBy/aggregate

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spill to disk and shuffle write spark

Tags:

shuffle

apache-spark

rdd

Giorgio

People also ask

1 Answers

Yaron

Recent Activity

Donate For Us