How does Round Robin partitioning in Spark work?

Tags:

I've trouble to understand Round Robin Partitioning in Spark. Consider the following exampl. I split a Seq of size 3 into 3 partitions:

val df = Seq(0,1,2).toDF().repartition(3)

df.explain

== Physical Plan ==
Exchange RoundRobinPartitioning(3)
+- LocalTableScan [value#42]

Now if I inspect the partitions, I get:

df
  .rdd
  .mapPartitionsWithIndex{case (i,rows) => Iterator((i,rows.size))}
  .toDF("partition_index","number_of_records")
  .show

+---------------+-----------------+
|partition_index|number_of_records|
+---------------+-----------------+
|              0|                0|
|              1|                2|
|              2|                1|
+---------------+-----------------+

If I do the same with Seq of size 8 and split it into 8 partitions, I get even worse skew:

(0 to 7).toDF().repartition(8)
  .rdd
  .mapPartitionsWithIndex{case (i,rows) => Iterator((i,rows.size))}
  .toDF("partition_index","number_of_records")
  .show

+---------------+-----------------+
|partition_index|number_of_records|
+---------------+-----------------+
|              0|                0|
|              1|                0|
|              2|                0|
|              3|                0|
|              4|                0|
|              5|                0|
|              6|                4|
|              7|                4|
+---------------+-----------------+

Can somebody explain this behavior. As far as I understand round robin partitioning, all partitions show be ~same size.

832

asked Jan 10 '19 07:01

Raphael Roth

1 Answers

(Checked for Spark version 2.1-2.4)

As far as I can see from ShuffleExchangeExec code, Spark tries to partition the rows directly from original partitions (via mapPartitions) without bringing anything to the driver.

The logic is to start with a randomly picked target partition and then assign partitions to the rows in a round-robin method. Note that "start" partition is picked for each source partition and there could be collisions.

The final distribution depends on many factors: a number of source/target partitions and the number of rows in your dataframe.

109

answered Nov 15 '22 07:11

Sergey Khudyakov

Related questions
                            
                                Scala how to get last calculated value of stream?
                            
                                Play Slick: How to inject DbConfigProvider in tests
                            
                                How to implement multiple Silhouette Authenticators?
                            
                                Akka streams. Group by, aggregate for some time and emit result
                            
                                Does Scala Future[T] block internally? What happens inside Scala Future?
                            
                                Scala Spark connect to remote cluster
                            
                                Decode chunked JSON with AKKA Stream
                            
                                Why creating an actor within actor is dangerous
                            
                                How to convert the group by function to data frame
                            
                                Understanding Apache Spark RDD task serialization
                            
                                Value classes, universal traits and the necessity of instantiation
                            
                                How to compute statistics on a streaming dataframe for different type of columns in a single query?
                            
                                how to use shapeless to detect field type annotation
                            
                                ArrayIndexOutOfBoundsException when reading csv file in spark
                            
                                Merge Strategy in sbt assembly and missing application loader
                            
                                Spark can't find the application class itself (ClassNotFoundException) in spark-submit with SBT assembly JAR
                            
                                Unexpected behavior when creating Scala Option from java.lang.Long
                            
                                Why adding async boundary in Akka Streams costs a lot of CPU?
                            
                                How to get the TypeTag for a class in Java
                            
                                How to create jigsaw module in SBT?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does Round Robin partitioning in Spark work?

Tags:

scala

apache-spark

partitioning

Raphael Roth

People also ask

1 Answers

Sergey Khudyakov

Recent Activity

Donate For Us