Where does job.setOutputKeyClass and job.setOutputReduceClass refers to?

1 Answers

Calling job.setOutputKeyClass( NullWritable.class ); will set the types expected as output from both the map and reduce phases.

If your Mapper emits different types than the Reducer, you can set the types emitted by the mapper with the JobConf's setMapOutputKeyClass() and setMapOutputValueClass() methods. These implicitly set the input types expected by the Reducer.

(source: Yahoo Developer Tutorial)

Regarding your second question, the default InputFormat is the TextInputFormat. This treats each line of each input file as a separate record, and performs no parsing. You can call these methods if you need to process your input in a different format, here are some examples:

InputFormat             | Description                                      | Key                                      | Value
--------------------------------------------------------------------------------------------------------------------------------------------------------
TextInputFormat         | Default format; reads lines of text files        | The byte offset of the line              | The line contents
KeyValueInputFormat     | Parses lines into key, val pairs                 | Everything up to the first tab character | The remainder of the line
SequenceFileInputFormat | A Hadoop-specific high-performance binary format | user-defined                             | user-defined

The default instance of OutputFormat is TextOutputFormat, which writes (key, value) pairs on individual lines of a text file. Some examples below:

OutputFormat             | Description
---------------------------------------------------------------------------------------------------------
TextOutputFormat         | Default; writes lines in "key \t value" form
SequenceFileOutputFormat | Writes binary files suitable for reading into subsequent MapReduce jobs
NullOutputFormat         | Disregards its inputs

(source: Other Yahoo Developer Tutorial)

103

answered Sep 19 '22 09:09

Charles Menguy

Related questions
                            
                                HttpURLConnection implementation
                            
                                Java: Generics syntax
                            
                                How to sanitize HTML code to prevent XSS attacks in Java or JSP?
                            
                                How to execute the JAXB compiler from ANT
                            
                                How to create a gwt composite component with children using uibinder?
                            
                                Java execute a command with a space in the pathname
                            
                                how do i create a custom cursor adapter for a listview for use with images and text?
                            
                                What's the difference between Printwriter and OutputStream [duplicate]
                            
                                how to copy data from file to PostgreSQL using JDBC?
                            
                                How do I debug Segfaults occurring in the JVM when it runs my code?
                            
                                Google Interview Question [closed]
                            
                                how to convert hex to base64
                            
                                Is there a Coffeescript for Java? In other words X gets compiled to Java [closed]
                            
                                Spring security3 "You cannot use a spring-security-2.0.xsd schema"
                            
                                Hibernate: Cascade Type
                            
                                Character encoding issue with Tomcat
                            
                                Android Countdown Timer to Date
                            
                                Difference between cursor.count() and cursor.size() in MongoDB
                            
                                Dynamic vs XML layout in Android?
                            
                                java String.format: numbers with localization

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Where does job.setOutputKeyClass and job.setOutputReduceClass refers to?

Tags:

java

hadoop

mapreduce

nik686

People also ask

1 Answers

Charles Menguy

Recent Activity

Donate For Us