Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Good tutorial on how install Hadoop 2.2.0 (Yarn) as single node cluster on MacOS [closed]

Tags:

hadoop

I'm trying to install Apache Hadoop 2.2.0 on my MacOS as single-node cluster and unable to find any documentation which helps to complete setup without errors. All guidelines I found so far on Hadoop, Cloudera and other web sites are either lack details or contain outdated information. Can anyone refer to consistent and clean step-by-step instruction which really works for single-node Apache Hadoop 2.2.0 setup on MacOS ?

like image 604
Vladimir Kroz Avatar asked Nov 07 '13 17:11

Vladimir Kroz


People also ask

What is single node cluster in Hadoop?

Single Node Cluster – It Has one DataNode running and setting up all the NameNode, DataNode, Resource Manager, and NodeManager on a single machine. This is used for studying and testing purposes. Multi-Node Cluster – Has more than one DataNode running and each DataNode is running on different machines.

What is multi node cluster?

A Multi Node Cluster in Hadoop contains two or more DataNodes in a distributed Hadoop environment. This is practically used in organizations to store and analyze their Petabytes and Exabytes of data. Learning to set up a multi node cluster gears you closer to your much needed Hadoop certification.


1 Answers

I recently wrote a guide of my own as I also found the official documentation lacking and because all other guides seem to be stuck on the pre-YARN hadoop mentality and are full of no longer necessary steps/environment values. I also included a Fabric script for easy cluster deployment. I would love your feedback!

like image 175
AlexJF Avatar answered Nov 09 '22 18:11

AlexJF