Map Reduce Slot Definition

Tags:

I am on my way for becoming a cloudera Hadoop administrator. Since my start, I am hearing a lot about computing slots per machine in a Hadoop Cluster like defining number of Map Slots and Reduce slots.

I have searched internet for a log time for getting a Noob definition for a Map Reduce Slot but didn't find any.

I am really pissed off by going through PDF's explaining the configuration of Map Reduce.

Please explain what exactly it means when it comes to a computing slot in a Machine of a cluster.

372

asked Sep 30 '22 01:09

abbasdjinn

1 Answers

In map-reduce v.1 mapreduce.tasktracker.map.tasks.maximum and mapreduce.tasktracker.reduce.tasks.maximum are used to configure number of map slots and reduce slots accordingly in mapred-site.xml.

starting from map-reduce v.2 (YARN), containers is a more generic term is used instead of slots, containers represents the max number of tasks that can run in parallel under the node regardless being Map task, Reduce task or application master task (in YARN).

answered Nov 02 '22 10:11

Hassan Kalaldeh

Related questions
                            
                                What would be a good application for an enhanced version of MapReduce that shares information between Mappers?
                            
                                Updating a hadoop HDFS file
                            
                                what's the best practice for pooling Hive JDBC connections
                            
                                How do I use hadoop fs -getmerge to download .deflate files?
                            
                                Giraph Shortest Paths Example ClassNotFoundException
                            
                                handoop connect error with put/copyFromLocal
                            
                                When it comes to mapreduce how are the Accumulo tablets mapped to an HDFS block
                            
                                Permission denied (publickey) on EC2 while starting Hadoop
                            
                                Getting E0902: Exception occured: [User: oozie is not allowed to impersonate oozie]
                            
                                Hadoop Nodemanager and Resourcemanager not starting
                            
                                Reading files from hdfs vs local directory
                            
                                Download file weekly from FTP to HDFS
                            
                                Hadoop mapReduce How to store only values in HDFS
                            
                                How to parse a JSON string from a column with Pig
                            
                                Hadoop error in shuffle in fetcher: Exceeded MAX_FAILED_UNIQUE_FETCHES
                            
                                Hadoop Namenode Metadata - fsimage and edit logs
                            
                                What is an efficient way of running a logistic regression for large data sets (200 million by 2 variables)?
                            
                                What is the difference between an RDD partition and a slice?
                            
                                How do I correctly remove nodes in Hadoop?
                            
                                Where HDFS stores data

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Map Reduce Slot Definition

Tags:

hadoop

mapreduce

cluster-computing

cloudera-cdh

job-scheduling

abbasdjinn

People also ask

1 Answers

Hassan Kalaldeh

Recent Activity

Donate For Us