Hadoop Yarn Container Does Not Allocate Enough Space

Tags:

hadoop

I'm running a Hadoop job, and in my yarn-site.xml file, I have the following configuration:

    <property>
            <name>yarn.scheduler.minimum-allocation-mb</name>
            <value>2048</value>
    </property>
    <property>
            <name>yarn.scheduler.maximum-allocation-mb</name>
            <value>4096</value>
    </property>

However, I still occasionally get the following error:

Container [pid=63375,containerID=container_1388158490598_0001_01_000003] is running beyond physical memory limits. Current usage: 2.0 GB of 2 GB physical memory used; 2.8 GB of 4.2 GB virtual memory used. Killing container.

I've found that by increasing yarn.scheduler.minimum-allocation-mb, the physical memory allocated for the container goes up. However, I don't always want 4GB being allocated for my container, and thought that by explicitly specifying a maximum size, I'd be able to go around this problem. I realize that Hadoop can't figure out how much memory it needs to allocate for the container before the mapper runs, so how should I go about allocating more memory for the container only if it needs that extra memory?

801

asked Dec 27 '13 15:12

Olshansk

1 Answers

You should also properly configure the memory allocations for MapReduce. From this HortonWorks tutorial:

[...]

For our example cluster, we have the minimum RAM for a Container (yarn.scheduler.minimum-allocation-mb) = 2 GB. We’ll thus assign 4 GB for Map task Containers, and 8 GB for Reduce tasks Containers.

In mapred-site.xml:

mapreduce.map.memory.mb: 4096

mapreduce.reduce.memory.mb: 8192

Each Container will run JVMs for the Map and Reduce tasks. The JVM heap size should be set to lower than the Map and Reduce memory defined above, so that they are within the bounds of the Container memory allocated by YARN.

In mapred-site.xml:

mapreduce.map.java.opts: -Xmx3072m

mapreduce.reduce.java.opts: -Xmx6144m

The above settings configure the upper limit of the physical RAM that Map and Reduce tasks will use.

Finally, someone in this thread in the Hadoop mailing list had the same problem and in their case, it turned out they had a memory leak in their code.

119

answered Oct 17 '22 18:10

cabad

Related questions
                            
                                HBase ERROR: hbase-default.xml file seems to be for and old version of HBase (null)
                            
                                Bypassing org.apache.hadoop.mapred.InvalidInputException: Input Pattern s3n://[...] matches 0 files
                            
                                Hive create table with inputs from nested sub-directories
                            
                                In spark join, does table order matter like in pig?
                            
                                HBase: Thrift vs Rest performance
                            
                                Get error "mismatched input 'as' expecting FROM near ')' in from clause" when run sql query Hadoop Java
                            
                                How to decide on the number of partitions required for input data size and cluster resources?
                            
                                When to prefer Hadoop MapReduce over Spark?
                            
                                Run Apache Flink with Amazon S3
                            
                                Retrieve files from remote HDFS
                            
                                Number of reducers in hadoop
                            
                                Hadoop, Hive, Pig, HBase, Cassandra - when to use what? [closed]
                            
                                Hadoop slave files configuration
                            
                                MapReduce shuffle/sort method
                            
                                Loading data with Hive, S3, EMR, and Recover Partitions
                            
                                Is there ln in hadoop HDFS
                            
                                zookeeper client does not provide CLI with "jline support is disabled" message
                            
                                HBase - What's the difference between WAL and MemStore?
                            
                                Configuring Hadoop logging to avoid too many log files
                            
                                Still getting "Unable to load realm info from SCDynamicStore" after bug fix

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With