Apache Hadoop Yarn - Underutilization of cores

Tags:

No matter how much I tinker with the settings in yarn-site.xml i.e using all of the below options

yarn.scheduler.minimum-allocation-vcores yarn.nodemanager.resource.memory-mb yarn.nodemanager.resource.cpu-vcores yarn.scheduler.maximum-allocation-mb yarn.scheduler.maximum-allocation-vcores

i just still cannot get my application i.e Spark to utilize all the cores on the cluster. The spark executors seem to be correctly taking up all the available memory, but each executor just keeps taking a single core and no more.

Here are the options configured in spark-defaults.conf

spark.executor.cores                    3 spark.executor.memory                   5100m spark.yarn.executor.memoryOverhead      800 spark.driver.memory                     2g spark.yarn.driver.memoryOverhead        400 spark.executor.instances                28 spark.reducer.maxMbInFlight             120 spark.shuffle.file.buffer.kb            200

Notice that spark.executor.cores is set to 3, but it doesn't work. How do i fix this?

974

asked Apr 30 '15 10:04

Abbas Gadhia

1 Answers

The problem lies not with yarn-site.xml or spark-defaults.conf but actually with the resource calculator that assigns the cores to the executors or in the case of MapReduce jobs, to the Mappers/Reducers.

The default resource calculator i.e org.apache.hadoop.yarn.util.resource.DefaultResourceCalculator uses only memory information for allocating containers and CPU scheduling is not enabled by default. To use both memory as well as the CPU, the resource calculator needs to be changed to org.apache.hadoop.yarn.util.resource.DominantResourceCalculator in the capacity-scheduler.xml file.

Here's what needs to change.

<property>     <name>yarn.scheduler.capacity.resource-calculator</name>     <value>org.apache.hadoop.yarn.util.resource.DominantResourceCalculator</value> </property>

112

answered Oct 08 '22 12:10

Abbas Gadhia

Related questions
                            
                                LeaseExpiredException: No lease error on HDFS
                            
                                Hadoop safemode recovery - taking too long!
                            
                                How to delete files from the HDFS?
                            
                                How to restart yarn on AWS EMR
                            
                                HDFS_NAMENODE_USER, HDFS_DATANODE_USER & HDFS_SECONDARYNAMENODE_USER not defined
                            
                                MapReduce or Spark? [closed]
                            
                                Display the SQL definition of a hive view
                            
                                Apache Storm compared to Hadoop
                            
                                Python read file as stream from HDFS
                            
                                Pig Latin: Load multiple files from a date range (part of the directory structure)
                            
                                Working With Hadoop: localhost: Error: JAVA_HOME is not set
                            
                                What is a keytab exactly?
                            
                                How to Define Custom partitioner for Spark RDDs of equally sized partition where each partition has equal number of elements?
                            
                                How do I run graphx with Python / pyspark?
                            
                                What is hive, Is it a database? [closed]
                            
                                Set hadoop system user for client embedded in Java webapp
                            
                                hdfs dfs -mkdir, No such file or directory
                            
                                How to load a text file into a Hive table stored as sequence files
                            
                                $HADOOP_HOME is deprecated
                            
                                Caused by: ERROR XSDB6: Another instance of Derby may have already booted the database

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Apache Hadoop Yarn - Underutilization of cores

Tags:

apache-spark

hadoop

hadoop-yarn

resourcemanager

Abbas Gadhia

People also ask

1 Answers

Abbas Gadhia

Recent Activity

Donate For Us