Out of memory error in Mapreduce shuffle phase

Q: What is shuffle phase in MapReduce?

Shuffling in MapReduceThe process of transferring data from the mappers to reducers is known as shuffling i.e. the process by which the system performs the sort and transfers the map output to the reducer as input.

Q: What is the purpose of the shuffle operation in Hadoop MapReduce?

1 Answer. In Hadoop MapReduce, the process of shuffling is used to transfer data from the mappers to the necessary reducers. It is the process in which the system sorts the unstructured data and transfers the output of the map as an input to the reducer.

Q: What is shuffle stage in Hadoop?

Shuffle phase in Hadoop transfers the map output from Mapper to a Reducer in MapReduce. Sort phase in MapReduce covers the merging and sorting of map outputs. Data from the mapper are grouped by the key, split among reducers, and sorted by the key. Every reducer obtains all values associated with the same key.

Tags:

hadoop

mapreduce

I am getting strange errors while running a wordcount-like mapreduce program. I have a hadoop cluster with 20 slaves, each having 4 GB RAM. I configured my map tasks to have a heap of 300MB and my reduce task slots get 1GB. I have 2 map slots and 1 reduce slot per node. Everything goes well until the first round of map tasks finishes. Then there progress remains at 100%. I suppose then the copy phase is taking place. Each map task generates something like:

Map output bytes    4,164,335,564
Map output materialized bytes   608,800,675

(I am using SnappyCodec for compression)

After stalling for about an hour the reduce tasks crach with the following exception:

    Error: java.lang.OutOfMemoryError: Java heap space at  
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.shuffleInMemory(ReduceTask.java:1703) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.getMapOutput(ReduceTask.java:1563) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.copyOutput(ReduceTask.java:1401) at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.run(ReduceTask.java:1333

I was googling and found this link but I don't really know what to make of it: hadoop common link

I don't understand why hadoop would experience any problems in copying and merging if it is able to perform a terasort benchmark. It cannot be that all map output should fit into the RAM of the reducer thread. So what is going on here?

In the link provided above they have a discussion about tuning the following parameters:

mapreduce.reduce.shuffle.input.buffer.percent = 0.7
mapreduce.reduce.shuffle.memory.limit.percent = 0.25
mapreduce.reduce.shuffle.parallelcopies = 5

They claim that the fact that the product of the parameters is >1 allows for heapsize errors. EDIT: Note that 5*1.25*0.7 is still <1 so focus om my second solution post!) Before restarting this intensive simulation I would be very happy to hear about someone's opinion concerning the problem I am facing since it is bothering for almost a week now. I also seem to not completely understand what is happening in this copy phase, I'd expect a merge sort on disk not to require much heap size?

Thanks a lot in advance for any helpful comments and answers!

762

asked Oct 10 '13 14:10

DDW

3 Answers

I think the clue is that the heapsize of my reduce task was required almost completely for the reduce phase. But the shuffle phase is competing for the same heapspace, the conflict which arose caused my jobs to crash. I think this explains why the job no longer crashes if I lower the shuffle.input.buffer.percent.

121

answered Sep 22 '22 16:09

DDW

The parameter you cite mapred.job.shuffle.input.buffer.percent is apparently a pre Hadoop 2 parameter. I could find that parameter in the mapred-default.xml per the 1.04 docs but it's name has changed to mapreduce.reduce.shuffle.input.buffer.percent per the 2.2.0 docs.

Per the docs this parameter's description is:

The percentage of memory to be allocated from the maximum heap size to storing map outputs during the shuffle.

For a complete understanding of Sort and Shuffle see Chapter 6.4 of The Hadoop Definitive Guide. That book provides an alternate definition of the parameter mapred.job.shuffle.input.buffer.percent:

The proportion of total heap size to be allocated to the map outputs buffer during the copy phase of the shuffle.

Since you observed that decreasing the value of mapred.job.shuffle.input.buffer.percent from it's default of 0.7 to 0.2 solved your problem, it is pretty safe to say that you could have also solved your problem by increasing the value of the reducer's heap size.

answered Sep 26 '22 16:09

harschware

Even after changing the shuffle.input.buffer.percent to 0.2 it doesn't work for me and got the same error.

After doing hit and trial on single node cluster, I found that there needs to be enough space in / directory as the process uses that space in case of spill.

The spill directory also needs to be changed.

answered Sep 23 '22 16:09

Omkant

Related questions
                            
                                Split size vs Block size in Hadoop
                            
                                Container killed by the ApplicationMaster Exit code is 143
                            
                                Hadoop on EC2 vs Elastic Map Reduce
                            
                                How does Apache Spark know about HDFS data nodes?
                            
                                hadoop connection refused on port 9000
                            
                                How does Hive choose the number of reducers for a job?
                            
                                Hadoop MapReduce vs MPI (vs Spark vs Mahout vs Mesos) - When to use one over the other?
                            
                                How to execute Spark programs with Dynamic Resource Allocation?
                            
                                Failed to detect a valid hadoop home directory
                            
                                How to find the most recent partition in HIVE table
                            
                                Spark without Hadoop: Failed to Launch
                            
                                How to fix Hadoop WARNING: An illegal reflective access operation has occurred error on Ubuntu
                            
                                Top N values by Hadoop Map Reduce code
                            
                                org.apache.hadoop.hbase.PleaseHoldException: Master is initializing
                            
                                Writable and WritableComparable in Hadoop?
                            
                                Moving files in Hadoop using the Java API?
                            
                                Where is the configuration file for HDFS in Hadoop 2.2.0?
                            
                                How to keep YARN's log files?
                            
                                Hadoop IOException failure to login
                            
                                What are some good resources for studying Hadoop's source code?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With