Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

apache hadoop versions 2.0 vs. 0.23

Tags:

There are so many Hadoop versions and different distributions which make me confused. I have a few questions.

  1. Apache Hadoop 1.x is from 0.20.205?
  2. Apache Hadoop 2.0 is from 0.22 or 0.23?
like image 474
kee Avatar asked Sep 01 '12 07:09

kee


People also ask

Which Hadoop version should I use?

Hadoop 3. x we will be using Timeline service version v. 2. This version of Timeline service provides for more scalability, reliability and enhanced usability by introducing flows and aggregation. This version of the Timeline is more scalable than its previous version.

How do I check my Hadoop version?

Using HDFS command line is one of the best way to get the detailed version. Using HDP Select command on the host where you want to check the version. Using Ambari API also we can get some idea about the hdfs client version shipped and installed as part of the HDP.

Is Hadoop still relevant?

Or, is it dead altogether? In reality, Apache Hadoop is not dead, and many organizations are still using it as a robust data analytics solution. One key indicator is that all major cloud providers are actively supporting Apache Hadoop clusters in their respective platforms.


2 Answers

According to this blogpost from Cloudera:

There is next to no functional difference between 0.20.205 and 1.0. This is just a renumbering.

Hadoop's Yarn site states:

MapReduce has undergone a complete overhaul in hadoop-0.23 and we now have, what we call, MapReduce 2.0 (MRv2) or YARN

It's also worth to have a look at this diagram too. It shows the tree of different Hadoop versions as well as the 3rd party distributions on top of them.

like image 106
Lorand Bendig Avatar answered Sep 21 '22 13:09

Lorand Bendig


updated answer http://elephantscale.com/hadoop2_handbook/Hadoop_Versions.html

(disclaimer : I am a co-author of this online book)

like image 27
Sujee Maniyam Avatar answered Sep 22 '22 13:09

Sujee Maniyam