Hadoop has configuration parameter <code>hadoop.tmp.dir</code> which, as per documentation, is `"A base for other temporary directories." I presume, this path refers to local file system. I set this value to <code>/mnt/hadoop-tmp/hadoop-${user.name}</code>. After formatting the namenode and starting all services, I see exactly same path created on HDFS. Does this mean, <code>hadoop.tmp.dir</code> refers to temporary location on HDFS?

It's confusing, but <code>hadoop.tmp.dir</code> is used as the base for temporary directories locally, and also in HDFS. The document isn't great, but <code>mapred.system.dir</code> is set by default to <code>"${hadoop.tmp.dir}/mapred/system"</code>, and this defines the Path on the HDFS where where the Map/Reduce framework stores system files. If you want these to not be tied together, you can edit your <code>mapred-site.xml</code> such that the definition of mapred.system.dir is something that's not tied to <code>${hadoop.tmp.dir}</code>

What should be hadoop.tmp.dir ?

2 Answers

It's confusing, but hadoop.tmp.dir is used as the base for temporary directories locally, and also in HDFS. The document isn't great, but mapred.system.dir is set by default to "${hadoop.tmp.dir}/mapred/system", and this defines the Path on the HDFS where where the Map/Reduce framework stores system files.

If you want these to not be tied together, you can edit your mapred-site.xml such that the definition of mapred.system.dir is something that's not tied to ${hadoop.tmp.dir}

196

answered Sep 26 '22 09:09

kkrugler

Let me add a bit more to kkrugler's answer:

There're three HDFS properties which contain hadoop.tmp.dir in their values

dfs.name.dir: directory where namenode stores its metadata, with default value ${hadoop.tmp.dir}/dfs/name.
dfs.data.dir: directory where HDFS data blocks are stored, with default value ${hadoop.tmp.dir}/dfs/data.
fs.checkpoint.dir: directory where secondary namenode store its checkpoints, default value is ${hadoop.tmp.dir}/dfs/namesecondary.

This is why you saw the /mnt/hadoop-tmp/hadoop-${user.name} in your HDFS after formatting namenode.

answered Sep 24 '22 09:09

darcyy

Related questions
                            
                                How does Hadoop Namenode failover process works?
                            
                                How to change date format in hive?
                            
                                Iterate twice on values (MapReduce)
                            
                                Does Hive have something equivalent to DUAL?
                            
                                Hadoop input split size vs block size
                            
                                How to unzip .gz files in a new directory in hadoop?
                            
                                What is sequence file in hadoop?
                            
                                Books to start learning big data [closed]
                            
                                Unable to start cygwin sshd service
                            
                                How to check if Hadoop daemons are running?
                            
                                hadoop fs -put command
                            
                                What does msck stands for in Msck repair command
                            
                                How to copy data from one HDFS to another HDFS?
                            
                                How does Spark running on YARN account for Python memory usage?
                            
                                What is the advantage of storing schema in avro?
                            
                                Parquet without Hadoop?
                            
                                Writing to HDFS could only be replicated to 0 nodes instead of minReplication (=1)
                            
                                How to rename a hive table without changing location?
                            
                                Best splittable compression for Hadoop input = bz2?
                            
                                How do I copy files from S3 to Amazon EMR HDFS?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What should be hadoop.tmp.dir ?

Tags:

config

hadoop

hdfs

Shashikant Kore

People also ask

2 Answers

kkrugler

darcyy

Recent Activity

Donate For Us