Spark: shuffle operation leading to long GC pause

Tags:

I'm running Spark 2 and am trying to shuffle around 5 terabytes of json. I'm running into very long garbage collection pauses during shuffling of a Dataset:

val operations = spark.read.json(inPath).as[MyClass]
operations.repartition(partitions, operations("id")).write.parquet("s3a://foo")

Are there any obvious configuration tweaks to deal with this issue? My configuration is as follows:

spark.driver.maxResultSize 6G
spark.driver.memory 10G
spark.executor.extraJavaOptions -XX:+UseG1GC -XX:MaxPermSize=1G -XX:+HeapDumpOnOutOfMemoryError
spark.executor.memory   32G
spark.hadoop.fs.s3a.buffer.dir  /raid0/spark
spark.hadoop.fs.s3n.buffer.dir  /raid0/spark
spark.hadoop.fs.s3n.multipart.uploads.enabled   true
spark.hadoop.parquet.block.size 2147483648
spark.hadoop.parquet.enable.summary-metadata    false
spark.local.dir /raid0/spark
spark.memory.fraction 0.8
spark.mesos.coarse  true
spark.mesos.constraints  priority:1
spark.mesos.executor.memoryOverhead 16000
spark.network.timeout   600
spark.rpc.message.maxSize    1000
spark.speculation   false
spark.sql.parquet.mergeSchema   false
spark.sql.planner.externalSort  true
spark.submit.deployMode client
spark.task.cpus 1

287

asked Aug 16 '16 18:08

Luke

1 Answers

Adding the following flags got rid of the GC pauses.

spark.executor.extraJavaOptions -XX:+UseG1GC -XX:InitiatingHeapOccupancyPercent=35 -XX:ConcGCThreads=12

I think it does take a fair amount of tweaking though. This databricks post was very very helpful.

105

answered Nov 15 '22 07:11

Luke

Related questions
                            
                                Convert local Vectors to RDD[Vector]
                            
                                Websocket max frame length of 65536 has been exceeded
                            
                                How to enforce non-generic type at compile time
                            
                                SBT Plugin in an unmanaged jar file
                            
                                Wartremover still reports warts in excluded Play routes file
                            
                                List.fill(int)(method()) equivalent in Java?
                            
                                Does case class' copy-method use Structural Sharing?
                            
                                Working regex fails when using Scala pattern matching
                            
                                How to retry failed Unmarshalling of a stream of akka-http requests?
                            
                                How to convert an Option[String]to List [String]in Scala
                            
                                Object streaming is not a member of package org.apache.spark
                            
                                Sink for line-by-line file IO with backpressure
                            
                                How to return an Empty NodeSeq using Scala XML?
                            
                                winutils spark windows installation env_variable
                            
                                What is the actual class (not abstract and not trait) for Map and Set?
                            
                                IntelliJ Error:Abnormal build process termination
                            
                                Scalacheck number generator between 0 <= x < 2^64
                            
                                How to specify multiple tables in Spark SQL?
                            
                                How to query an optional column value with a fallback in Slick 3.x
                            
                                Using plotly with zeppellin in scala

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark: shuffle operation leading to long GC pause

Tags:

garbage-collection

scala

apache-spark

g1gc

apache-spark-sql

Luke

People also ask

1 Answers

Luke

Recent Activity

Donate For Us