all I have simple map/reduce implementation. Mapper is called and it does its job but reducer is never called. Here is mapper: <pre class="prettyprint"><code>static public class InteractionMap extends Mapper<LongWritable, Text, Text, InteractionWritable> { @Override protected void map(LongWritable offset, Text text, Context context) throws IOException, InterruptedException { System.out.println("mapper"); String[] tokens = text.toString().split(","); for (int idx = 0; idx < tokens.length; idx++) { String sourceUser = tokens[1]; String targetUser = tokens[2]; int points = Integer.parseInt(tokens[4]); context.write(new Text(sourceUser), new InteractionWritable(targetUser, points)); } } } } </code></pre> Here is my reducer: <pre class="prettyprint"><code>static public class InteractionReduce extends Reducer<Text, InteractionWritable, Text, Text> { @Override protected void reduce(Text token, Iterable<InteractionWritable> counts, Context context) throws IOException, InterruptedException { System.out.println("REDUCER"); Iterator<InteractionWritable> i = counts.iterator(); while (i.hasNext()) { InteractionWritable interaction = i.next(); context.write(token, new Text(token.toString() + " " + interaction.getTargetUser().toString() + " " + interaction.getPoints().get())); } } } </code></pre> And, here is configuration part: <pre class="prettyprint"><code>@Override public int run(String[] args) throws Exception { Configuration configuration = getConf(); Job job = new Job(configuration, "Interaction Count"); job.setJarByClass(InteractionMapReduce.class); job.setMapperClass(InteractionMap.class); job.setCombinerClass(InteractionReduce.class); job.setReducerClass(InteractionReduce.class); job.setInputFormatClass(TextInputFormat.class); job.setOutputFormatClass(TextOutputFormat.class); job.setOutputKeyClass(Text.class); job.setOutputValueClass(Text.class); FileInputFormat.addInputPath(job, new Path(args[0])); FileOutputFormat.setOutputPath(job, new Path(args[1])); return job.waitForCompletion(true) ? 0 : -1; } </code></pre> Does anyone have any idea why reducer is not being invoked?

<ol> <li>I hope the <code>text</code> in your <code>Mapper</code> method has some data.</li> <li>Do you really need the <code>Reducer</code> to be the <code>Combiner</code> as well as the <code>Reducer</code>?</li> </ol> I always have one main class <code>InteractionMapReduce</code> and inside it I have the <code>InteractionMap</code> and the <code>InteractionReduce</code> class. So while setting the <code>Mapper</code> and the <code>Reducer</code> class in the job, I set them like <code>InteractionMapReduce.InteractionMap.class</code> and <code>InteractionMapReduce.InteractionReduce.class</code>. I do not know whether this would help you but you could try it.

Hadoop reducer not being called

Tags:

hadoop

mapreduce

all

I have simple map/reduce implementation. Mapper is called and it does its job but reducer is never called.

Here is mapper:

static public class InteractionMap extends Mapper<LongWritable, Text, Text, InteractionWritable> {

    @Override
    protected void map(LongWritable offset, Text text, Context context) throws IOException, InterruptedException {
        System.out.println("mapper");
        String[] tokens = text.toString().split(",");
        for (int idx = 0; idx < tokens.length; idx++) {
            String sourceUser = tokens[1];
            String targetUser = tokens[2];
            int points = Integer.parseInt(tokens[4]);
            context.write(new Text(sourceUser), new InteractionWritable(targetUser, points));
            }
        }
    }
}

Here is my reducer:

static public class InteractionReduce extends Reducer<Text, InteractionWritable, Text, Text> {

    @Override
    protected void reduce(Text token, Iterable<InteractionWritable> counts, Context context) throws IOException, InterruptedException {
        System.out.println("REDUCER");
        Iterator<InteractionWritable> i = counts.iterator();
        while (i.hasNext()) {
            InteractionWritable interaction = i.next();
            context.write(token, new Text(token.toString() + " " + interaction.getTargetUser().toString() + " " + interaction.getPoints().get()));
        }
    }

}

And, here is configuration part:

@Override
public int run(String[] args) throws Exception {
    Configuration configuration = getConf();
    Job job = new Job(configuration, "Interaction Count");
    job.setJarByClass(InteractionMapReduce.class);
    job.setMapperClass(InteractionMap.class);
    job.setCombinerClass(InteractionReduce.class);
    job.setReducerClass(InteractionReduce.class);
    job.setInputFormatClass(TextInputFormat.class);
    job.setOutputFormatClass(TextOutputFormat.class);
    job.setOutputKeyClass(Text.class);
    job.setOutputValueClass(Text.class);
    FileInputFormat.addInputPath(job, new Path(args[0]));
    FileOutputFormat.setOutputPath(job, new Path(args[1]));
    return job.waitForCompletion(true) ? 0 : -1;
}

Does anyone have any idea why reducer is not being invoked?

743

asked Oct 01 '12 15:10

ezamur

2 Answers

Ok, it was my fault, as expected. Job configuration wasn't good. This is how it should look like:

Configuration configuration = getConf();

Job job = new Job(configuration, "Interaction Count");
job.setJarByClass(InteractionMapReduce.class);
job.setMapperClass(InteractionMap.class);
job.setReducerClass(InteractionReduce.class);
job.setMapOutputKeyClass(Text.class);
job.setMapOutputValueClass(InteractionWritable.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(Text.class);

FileInputFormat.addInputPath(job, new Path(args[0]));
FileOutputFormat.setOutputPath(job, new Path(args[1]));

return job.waitForCompletion(true) ? 0 : -1;

The problem occurred because map and reduce phases have different output types. Job failed silently after invoking context.write method. So, what I had to add are these lines:

job.setMapOutputKeyClass(Text.class);
job.setMapOutputValueClass(InteractionWritable.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(Text.class);

148

answered Sep 22 '22 16:09

ezamur

I hope the text in your Mapper method has some data.
Do you really need the Reducer to be the Combiner as well as the Reducer?

I always have one main class InteractionMapReduce and inside it I have the InteractionMap and the InteractionReduce class.

So while setting the Mapper and the Reducer class in the job, I set them like InteractionMapReduce.InteractionMap.class and InteractionMapReduce.InteractionReduce.class.

I do not know whether this would help you but you could try it.

answered Sep 25 '22 16:09

JHS

Related questions
                            
                                Is Cassandra for OLAP or OLTP or both?
                            
                                Cannot load main class from JAR file
                            
                                What does virtual core in YARN vcore mean?
                            
                                Is it possible to read pdf/audio/video files(unstructured data) using Apache Spark?
                            
                                How do we convert a string into Array in hive?
                            
                                Why does all columns get created as string when I use OpenCSVSerde in Hive?
                            
                                How HBase partitions table across regionservers?
                            
                                Hive/HBase Integration - Zookeeper Session Closes Immediately
                            
                                Debugging in PIG UDF
                            
                                How can I force Flume-NG to process the backlog of events after a sink failed?
                            
                                How to remove an ambari service after they have been added
                            
                                What is the difference between classic, local for mapreduce.framework.name in mapred-site.xml?
                            
                                using pyspark, read/write 2D images on hadoop file system
                            
                                How can I merge spark results files without repartition and copyMerge?
                            
                                spark + hadoop data locality
                            
                                How to filter out rows with NaN values in Hive?
                            
                                Can somebody give a high-level, simple explanation to a beginner about how Hadoop works?
                            
                                Chaining multiple mapreduce tasks in Hadoop streaming
                            
                                How do I make Hadoop find imported Python modules when using Python UDFs in Pig?
                            
                                MapReduce - How sort reduce output by value

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Hadoop reducer not being called

Tags:

hadoop

mapreduce

ezamur

People also ask

2 Answers

ezamur

JHS

Recent Activity

Donate For Us