Can anyone explain how the RecordReader actually works? How are the methods <code>nextkeyvalue()</code>, <code>getCurrentkey()</code> and <code>getprogress()</code> work after the program starts executing?

(new API): The default Mapper class has a run method which looks like this: <pre class="prettyprint"><code>public void run(Context context) throws IOException, InterruptedException { setup(context); while (context.nextKeyValue()) { map(context.getCurrentKey(), context.getCurrentValue(), context); } cleanup(context); } </code></pre> The <code>Context.nextKeyValue()</code>, <code>Context.getCurrentKey()</code> and <code>Context.getCurrentValue()</code> methods are wrappers for the <code>RecordReader</code> methods. See the source file <code>src/mapred/org/apache/hadoop/mapreduce/MapContext.java</code>. So this loop executes and calls your Mapper implementation's <code>map(K, V, Context)</code> method. Specifically, what else would you like to know?

Working of RecordReader in Hadoop

1 Answers

(new API): The default Mapper class has a run method which looks like this:

public void run(Context context) throws IOException, InterruptedException {
    setup(context);
    while (context.nextKeyValue()) {
        map(context.getCurrentKey(), context.getCurrentValue(), context);
    }
    cleanup(context);
}

The Context.nextKeyValue(), Context.getCurrentKey() and Context.getCurrentValue() methods are wrappers for the RecordReader methods. See the source file src/mapred/org/apache/hadoop/mapreduce/MapContext.java.

So this loop executes and calls your Mapper implementation's map(K, V, Context) method.

Specifically, what else would you like to know?

125

answered Sep 22 '22 08:09

Chris White

Related questions
                            
                                Yarn container understanding and tuning
                            
                                Is it possible to install Beeline to run Hive queries without installing Hive?
                            
                                How to set gradle path after installing using sdkman
                            
                                Spark/Yarn: File does not exist on HDFS
                            
                                BindException in Hadoop on EC2
                            
                                hadoop failed to build from source
                            
                                HBase : get(...) vs scan and in-memory table
                            
                                map reduce word count example
                            
                                PIG: ERROR 1000: Error during parsing
                            
                                What is the difference between the hive jdbc client and the hive metastore java api?
                            
                                Running spark-submit with --master yarn-cluster: issue with spark-assembly
                            
                                Why there are many spark-warehouse folders got created?
                            
                                Getting started with MapReduce/Hadoop [closed]
                            
                                Error in starting hadoop Job Tracker
                            
                                Hadoop / MapReduce - Optimizing "Top N" Word Count MapReduce Job
                            
                                How to use hbase with Spring Boot using Java instead of XML?
                            
                                How to edit and relaunch a terminated cluster on Amazon EMR?
                            
                                Hadoop 2.0 Name Node, Secondary Node and Checkpoint node for High Availability
                            
                                Different ways of configuring the memory to the TaskTracker child process (Mapper and Reduce Tasks)
                            
                                Finding Connected Components using Hadoop/MapReduce

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Working of RecordReader in Hadoop

Tags:

hadoop

mapreduce

Amnesiac

People also ask

1 Answers

Chris White

Recent Activity

Donate For Us