spark.storage.memoryFraction setting in Apache Spark

Tags:

According to Spark documentation

spark.storage.memoryFraction: Fraction of Java heap to use for Spark's memory cache. This should not be larger than the "old" generation of objects in the JVM, which by default is given 0.6 of the heap, but you can increase it if you configure your own old generation size.

I found several blogs and article where it is suggested to set it to zero in yarn mode. Why is that better than set it to something close to 1? And in general, what is a reasonable value for it ?

976

asked Dec 29 '15 10:12

Bob

1 Answers

The Spark executor is set up into 3 regions.

Storage - Memory reserved for caching
Execution - Memory reserved for object creation
Executor overhead.

In Spark 1.5.2 and earlier:

spark.storage.memoryFraction sets the ratio of memory set for 1 and 2. The default value is .6, so 60% of the allocated executor memory is reserved for caching. In my experience, I've only ever found that the number is reduced. Typically when a developer is getting a GC issue, the application has a larger "churn" in objects, and one of the first places for optimizations is to change the memoryFraction.

If your application does not cache any data, then setting it to 0 is something you should do. Not sure why that would be specific to YARN, can you post the articles?

In Spark 1.6.0 and later:

Memory management is now unified. Both storage and execution share the heap. So this doesnt really apply anymore.

143

answered Sep 28 '22 02:09

Joe Widen

Related questions
                            
                                how to remove javadoc with IntelliJ IDEA
                            
                                Sum List of BigDecimal
                            
                                Download multiple files in parallel to a zip-file from S3 using Java
                            
                                How to use Eclipse embedded Maven installation from the command line?
                            
                                Java object is not an instance of declaring class
                            
                                Read mails again and again from gmail using JavaMail Api in java
                            
                                How do I split a string with a limit starting by the right in java?
                            
                                Executing a function in a specific context in Nashorn
                            
                                Making a logarithmic spiral in Java
                            
                                JAVAFX : why wait cursor needs a new thread?
                            
                                Create Java8 function reference programmatically
                            
                                Can I use a lambda expression to accumulate the sum in the variable?
                            
                                Serialization for Java Calendar
                            
                                How to force Spring Security OAuth 2 to use JSON instead of XML?
                            
                                Null values on swagger JSON file
                            
                                Dynamic Programming (Codility Q: NumberSolitaire)
                            
                                Gradle: How to check for duplicate dependencies in project?
                            
                                Remove WWW-authenticate header from Basic authentication in Spring Boot
                            
                                How to use hasNext() from the Scanner class?
                            
                                Java based Telegram Bot Api: How to send emojis?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

spark.storage.memoryFraction setting in Apache Spark

Tags:

java

python

apache-spark

hadoop-yarn

Bob

People also ask

1 Answers

Joe Widen

Recent Activity

Donate For Us