How to deal with executor memory and driver memory in Spark?

Tags:

apache-spark

I am confused about dealing with executor memory and driver memory in Spark.

My environment settings are as below:

Memory 128 G, 16 CPU for 9 VM
Centos
Hadoop 2.5.0-cdh5.2.0
Spark 1.1.0

Input data information:

3.5 GB data file from HDFS

For simple development, I executed my Python code in standalone cluster mode (8 workers, 20 cores, 45.3 G memory) with spark-submit. Now I would like to set executor memory or driver memory for performance tuning.

From the Spark documentation, the definition for executor memory is

Amount of memory to use per executor process, in the same format as JVM memory strings (e.g. 512m, 2g).

How about driver memory?

714

asked Nov 28 '14 04:11

wlsherica

1 Answers

The memory you need to assign to the driver depends on the job.

If the job is based purely on transformations and terminates on some distributed output action like rdd.saveAsTextFile, rdd.saveToCassandra, ... then the memory needs of the driver will be very low. Few 100's of MB will do. The driver is also responsible of delivering files and collecting metrics, but not be involved in data processing.

If the job requires the driver to participate in the computation, like e.g. some ML algo that needs to materialize results and broadcast them on the next iteration, then your job becomes dependent of the amount of data passing through the driver. Operations like .collect,.take and takeSample deliver data to the driver and hence, the driver needs enough memory to allocate such data.

e.g. If you have an rdd of 3GB in the cluster and call val myresultArray = rdd.collect, then you will need 3GB of memory in the driver to hold that data plus some extra room for the functions mentioned in the first paragraph.

183

answered Sep 18 '22 05:09

maasg

Related questions
                            
                                Why do I need to delete[]?
                            
                                Any way to reserve but not commit memory in linux?
                            
                                String vs char[]
                            
                                Release memory in R
                            
                                Memory Allocation Profiling in C++
                            
                                Does every process have its own page table?
                            
                                iOS memory allocation - how much memory can be used in an application?
                            
                                Why don't purely functional languages use reference counting?
                            
                                string.c_str() deallocation necessary?
                            
                                How to check heap usage of a running JVM from the command line?
                            
                                C++ object created with new, destroyed with free(); How bad is this?
                            
                                Is delete[] equal to delete?
                            
                                How to gain control of a 5GB heap in Haskell?
                            
                                DeadObjectException on android app
                            
                                .net string class alternative
                            
                                Can using too many static variables cause a memory leak in Java?
                            
                                Interface type cannot be statically allocated?
                            
                                Initial capacity of collection types, e.g. Dictionary, List
                            
                                initializing std::string from char* without copy
                            
                                Time complexity of memory allocation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With