Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Managing hdfs in pseudo-distributed hadoop mode

Tags:

hadoop

hdfs

I want to do some computation with hadoop and mahout on my quad core machine, so I am using hadoop in pseudo-distributed mode.

The problem is that the space on my root drve is limited, so how can I configure it to use space available on some other external hard drive.

like image 230
Pankaj Avatar asked Feb 07 '11 13:02

Pankaj


1 Answers

You can configure where hdfs strores its data. Add the following to your conf/hdfs-site.xml:

<property>
    <name>dfs.data.dir</name>
    <value>__path_to_where_you_want_to_store_your_data/hdfs/data/</value>
</property>
<property>
    <name>dfs.name.dir</name>
    <value>__path_to_where_you_want_to_store_your_data/hdfs/name/</value>
</property>

After theese changes you will have to format your namenode:

hadoop namenode -format

like image 62
Helmut Zechmann Avatar answered Nov 13 '22 18:11

Helmut Zechmann