Spark performance tuning - number of executors vs number for cores

Q: How many executors should I have Spark?

Number of available executors = (total cores/num-cores-per-executor) = 150/5 = 30.

Q: How many cores does executor Spark have?

The consensus in most Spark tuning guides is that 5 cores per executor is the optimum number of cores in terms of parallel processing.

Q: Why are there 5 cores of an executor?

The cores property controls the number of concurrent tasks an executor can run. - -executor-cores 5 means that each executor can run a maximum of five tasks at the same time.

Q: What is the recommended RAM size of each executor in Spark?

Memory for each executor: So memory for each executor in each node is 63/3 = 21GB.

Tags:

spark-streaming

I have two questions around performance tuning in Spark:

I understand one of the key things for controlling parallelism in the spark job is the number of partitions that exist in the RDD that is being processed, and then controlling the executors and cores processing these partitions. Can I assume this to be true:
- # of executors * # of executor cores shoud be <= # of partitions. i.e to say one partition is always processed in one core of one executor. There is no point having more executors*cores than the number of partitions
I understand that having a high number of cores per executor can have a -ve impact on things like HDFS writes, but here's my second question, purely from a data processing point of view what is the difference between the two? For e.g. if I have 10 node cluster what would be the difference between these two jobs (assuming there's ample memory per node to process everything):
1. 5 executors * 2 executor cores
2. 2 executors * 5 executor cores
Assuming there's infinite memory and CPU, from a performance point of view should we expect the above two to perform the same?

824

asked Aug 17 '16 20:08

1 Answers

Most of the time using larger executors (more memory, more cores) are better. One: larger executor with large memory can easily support broadcast joins and do away with shuffle. Second: since tasks are not created equal, statistically larger executors have better chance of surviving OOM issues. The only problem with large executors is GC pauses. G1GC helps.

182

answered Sep 30 '22 15:09

Rohit Karlupia

Related questions
                            
                                apache spark: local[K] master URL - job gets stuck
                            
                                InvalidRequestException(why:empid cannot be restricted by more than one relation if it includes an Equal)
                            
                                Apache Spark (MLLib) for real time analytics
                            
                                how to fetch all of data from hbase table in spark
                            
                                Can I use Hadoop with AWS4-HMAC-SHA256?
                            
                                Why does Spark submit script spark-submit ignore `--num-executors`?
                            
                                How does the Apache Spark scheduler split files into tasks?
                            
                                How to let Spark serialize an object using Kryo?
                            
                                Spark job failing when calling first() in PySpark
                            
                                Apache Spark ALS recommendations approach
                            
                                In Apache Spark SQL, How to close metastore connection from HiveContext
                            
                                must build Spark with Hive (spark 1.5.0)
                            
                                Spark partitionBy much slower than without it
                            
                                Combining PyCharm, Spark and Jupyter
                            
                                How to enable streaming from Cassandra to Spark?
                            
                                pySpark: Save ML Model
                            
                                Spark Job submitted - Waiting (TaskSchedulerImpl : Initial job not accepted)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark performance tuning - number of executors vs number for cores

Tags:

apache-spark

spark-streaming

Shay

People also ask

1 Answers

Rohit Karlupia

Recent Activity

Donate For Us