I spent almost a day but couldn't figure out how to use IdentityReducer in the new Hadoop API. All references or classes I can find are with the old API. And obviously mixing up old API idetntitreducer class in the new API codebase doesn't go well. Any help will be appreciated.
Identity Mapper is the default mapper class which is provided by Hadoop. Identity Mapper class is a generic class and it can be used with any key-value pairs data types.
Identity mapper and reducer are default mapper and reducer which are picked up by the map-reduce framework when no mapper or reducer class is defined in driver class. They do not do any type of processing in the data and write the value to the output which it gets from the input.
This is a specification of the Hadoop FileSystem APIs, which models the contents of a filesystem as a set of paths that are either directories, symbolic links, or files.
Mainly because in the new API Mapper
and Reducer
are classes instead of interfaces, they are the new replacement of IdentityMapper/Reducer.
So you just have to use Mapper.class
and Reducer.class
to get the identity.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With