Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Hadoop namenode disk size

Tags:

hadoop

Are there any suggestions about size of HDD on namenode physical machine? Sure, it does not store any data from HDFS like datanode but what should I depend on while creating cluster?

like image 597
Valery Yesypenko Avatar asked Aug 08 '13 13:08

Valery Yesypenko


2 Answers

Physical disk space on the NameNode does not really matter unless you run a Datanode on the same node. However, it is very important to have good memory (RAM) space allocated to the NameNode. This is because the NameNode stores all the metadata of the HDFS (block allocations, block locations etc.), in memory. If sufficient memory is not allocated, the NameNode might run out of memory and fail.

like image 140
Chaos Avatar answered Oct 24 '22 08:10

Chaos


You might need some space to actually store the the NameNode's FSImage, edit file and other relevant files.

It's actually recommended to configure NameNode to use multiple directories (one local and other NFS mount), so that multiple copies of File System metadata will be stored. That way, as long as the directories are on separate disks, a single disk failure will not corrupt the meta-data.

Please see this link for more details.

like image 4
SSaikia_JtheRocker Avatar answered Oct 24 '22 09:10

SSaikia_JtheRocker