Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

identityreducer in the new Hadoop API

I spent almost a day but couldn't figure out how to use IdentityReducer in the new Hadoop API. All references or classes I can find are with the old API. And obviously mixing up old API idetntitreducer class in the new API codebase doesn't go well. Any help will be appreciated.

like image 581
kee Avatar asked Mar 17 '12 03:03

kee


People also ask

Is identity mapper is the default Hadoop mapper?

Identity Mapper is the default mapper class which is provided by Hadoop. Identity Mapper class is a generic class and it can be used with any key-value pairs data types.

What is identity mapper and identity reducer?

Identity mapper and reducer are default mapper and reducer which are picked up by the map-reduce framework when no mapper or reducer class is defined in driver class. They do not do any type of processing in the data and write the value to the output which it gets from the input.

Is Hadoop an API?

This is a specification of the Hadoop FileSystem APIs, which models the contents of a filesystem as a set of paths that are either directories, symbolic links, or files.


1 Answers

Mainly because in the new API Mapper and Reducer are classes instead of interfaces, they are the new replacement of IdentityMapper/Reducer.

So you just have to use Mapper.class and Reducer.class to get the identity.

like image 172
Thomas Jungblut Avatar answered Oct 11 '22 16:10

Thomas Jungblut