How can I increase the configured capacity of my hadoop DFS from the default 50GB to 100GB? My present setup is hadoop 1.2.1 running on a centOS6 machine with 120GB of 450GB used. Have set up hadoop to be in psudodistributed mode with the /conf suggested by "Hadoop the Definitive Guide 3'rd). hdfs-site.xml had only one configured property: <pre class="prettyprint"><code> <configuration> <property> <name>dfs.replication</name> <value>1</value> </property> </configuration> </code></pre> The following line gave no error feedback... comes back to the prompt. <pre class="prettyprint"><code>hadoop dfsadmin -setSpaceQuota 100g /tmp/hadoop-myUserID </code></pre> If I am in a regen loop (have executed <pre class="prettyprint"><code> rm -rf /tmp/hadoop-myUserId </code></pre> in a attempt to "start from scratch") This seeming success of the setSpaceQuota occurs iff-and-only-if I have executed <pre class="prettyprint"><code> start-all.sh hadoop namenode -format </code></pre> The failure of my dfs capacity configuration is shown by <pre class="prettyprint"><code> hadoop dfsadmin -report </code></pre> which shows the same 50GB of configured capacity. I would be willing to switch over to hadoop 2.2 (now stable release) if that is the current best way to get 100GB hdfs configured capacity. Seems like there should be a configuration property for hdfs-site.xml which would allow me to use more of my free partition.

Set the location of the hdfs to a partition with more free space. For hadoop-1.2.1 this can be done by setting the hadoop.tmp.dir in hadoop-1.2.1/conf/core-site.xml <pre class="prettyprint"><code><?xml version="1.0"?> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>  <configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost:9000</value> </property> <property> <name>hadoop.tmp.dir</name> <value>/home/myUserID/hdfs</value> <description>base location for other hdfs directories.</description> </property> </configuration> </code></pre> Running <code>df</code> had said my _home partition was my hard disk, minus 50GB for my / ( _root) partition. The default location for hdfs is /tmp/hadoop-myUserId which is in the / partition. This is where my initial 50GB hdfs size came from. Creation and confirmation of the partition location of a directory for the hdfs was accomplished by <pre class="prettyprint"><code>mkdir ~/hdfs df -P ~/hdfs | tail -1 | cut -d' ' -f 1 </code></pre> successful implementation was accomplished by <pre class="prettyprint"><code>stop-all.sh start-dfs.sh hadoop namenode -format start-all.sh hadoop dfsadmin -report </code></pre> which reports the size of the hdfs as the size of my _home partition. Thank you jtravaglini for the comment/clue.

how can I increase hdfs capacity

Tags:

hadoop

hdfs

How can I increase the configured capacity of my hadoop DFS from the default 50GB to 100GB?

My present setup is hadoop 1.2.1 running on a centOS6 machine with 120GB of 450GB used. Have set up hadoop to be in psudodistributed mode with the /conf suggested by "Hadoop the Definitive Guide 3'rd). hdfs-site.xml had only one configured property:

   <configuration>
    <property>
         <name>dfs.replication</name>
         <value>1</value>
     </property>
 </configuration>

The following line gave no error feedback... comes back to the prompt.

hadoop dfsadmin -setSpaceQuota 100g  /tmp/hadoop-myUserID

If I am in a regen loop (have executed

 rm -rf /tmp/hadoop-myUserId

in a attempt to "start from scratch") This seeming success of the setSpaceQuota occurs iff-and-only-if I have executed

  start-all.sh
  hadoop namenode -format

The failure of my dfs capacity configuration is shown by

 hadoop dfsadmin -report

which shows the same 50GB of configured capacity.

I would be willing to switch over to hadoop 2.2 (now stable release) if that is the current best way to get 100GB hdfs configured capacity. Seems like there should be a configuration property for hdfs-site.xml which would allow me to use more of my free partition.

245

asked Oct 23 '13 12:10

teserecter

1 Answers

Set the location of the hdfs to a partition with more free space. For hadoop-1.2.1 this can be done by setting the hadoop.tmp.dir in hadoop-1.2.1/conf/core-site.xml

<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

<!-- Put site-specific property overrides in this file. -->

<configuration>
   <property>
      <name>fs.default.name</name>
     <value>hdfs://localhost:9000</value>
     </property>
   <property>
    <name>hadoop.tmp.dir</name>
    <value>/home/myUserID/hdfs</value>
    <description>base location for other hdfs directories.</description>
   </property>
</configuration>

Running

df

had said my _home partition was my hard disk, minus 50GB for my /
( _root) partition. The default location for hdfs is /tmp/hadoop-myUserId which is in the / partition. This is where my initial 50GB hdfs size came from.

Creation and confirmation of the partition location of a directory for the hdfs was accomplished by

mkdir ~/hdfs
df -P ~/hdfs | tail -1 | cut -d' ' -f 1

successful implementation was accomplished by

stop-all.sh
start-dfs.sh
hadoop namenode -format
start-all.sh
hadoop dfsadmin -report

which reports the size of the hdfs as the size of my _home partition.

Thank you jtravaglini for the comment/clue.

140

answered Sep 21 '22 20:09

teserecter

Related questions
                            
                                Can I get invidually sorted Mapper outputs from Hadoop when using zero Reducers?
                            
                                Hadoop Streaming Job Failed (Not Successful) in Python
                            
                                Hadoop seems to modify my key object during an iteration over values of a given reduce call
                            
                                rsync files to hadoop
                            
                                NullPointerException from Hadoop's JobSplitWriter / SerializationFactory when calling InputSplit's getClass()
                            
                                Enum value implementing Writable interface of Hadoop
                            
                                Doubts about page rank
                            
                                Merging two datasets in Pig
                            
                                Hbase Region server shutdown
                            
                                Can I rename the oozie job name dynamically
                            
                                Hadoop MapReduce, Java implementation questions
                            
                                how to attach debugger to remote Hadoop instance
                            
                                Error connecting: <class 'thrift.transport.TTransport.TTransportException'> Could not connect to localhost:21000
                            
                                What to use.. Impala on HDFS, or Impala on Hbase or just the Hbase?
                            
                                Flume NG and HDFS
                            
                                Why map and reduce run at the same time?
                            
                                How do I diff two tables in HBase
                            
                                Job and Task Scheduling In Hadoop
                            
                                Pyspark --py-files doesn't work
                            
                                Viewing the number of blocks for a file in hadoop

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how can I increase hdfs capacity

Tags:

hadoop

hdfs

teserecter

People also ask

1 Answers

teserecter

Recent Activity

Donate For Us