How to write 'map only' hadoop jobs?

Tags:

I'm a novice on hadoop, I'm getting familiar to the style of map-reduce programing but now I faced a problem : Sometimes I need only map for a job and I only need the map result directly as output, which means reduce phase is not needed here, how can I achive that?

306

asked Feb 22 '12 12:02

Breakinen

1 Answers

This turns off the reducer.

job.setNumReduceTasks(0);

http://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapreduce/Job.html#setNumReduceTasks(int)

answered Sep 22 '22 08:09

Thomas Jungblut

Related questions
                            
                                How to choose between Cassandra, Membase, Hadoop, MongoDB, RDBMS etc.? [closed]
                            
                                How do I get schema / column names from parquet file?
                            
                                How does Hadoop perform input splits?
                            
                                Why do we need ZooKeeper in the Hadoop stack?
                            
                                Ports are not available: listen tcp 0.0.0.0/50070: bind: An attempt was made to access a socket in a way forbidden by its access permissions
                            
                                SparkSQL vs Hive on Spark - Difference and pros and cons?
                            
                                Why spark-shell fails with NullPointerException?
                            
                                Thrift, Avro, Protocolbuffers - Are they all dead?
                            
                                Setting the number of map tasks and reduce tasks
                            
                                How to get started with Big Data Analysis [closed]
                            
                                Free Large datasets to experiment with Hadoop
                            
                                Datanode process not running in Hadoop
                            
                                Datanode not starts correctly
                            
                                Cascading examples failed to compile?
                            
                                Spark on yarn concept understanding
                            
                                Cleanest way in Gradle to get the path to a jar file in the gradle dependency cache
                            
                                What is best way to start and stop hadoop ecosystem, with command line?
                            
                                How to get the input file name in the mapper in a Hadoop program?
                            
                                Why HBase is a better choice than Cassandra with Hadoop?
                            
                                Schema evolution in parquet format

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to write 'map only' hadoop jobs?

Tags:

hadoop

mapreduce

Breakinen

People also ask

1 Answers

Thomas Jungblut

Recent Activity

Donate For Us