I have Hadoop installed in this location <blockquote> /usr/local/hadoop$ </blockquote> Now I want to list the files inside the dfs. The command I used is : <blockquote> hduser@ubuntu:/usr/local/hadoop$ bin/hadoop dfs -ls </blockquote> This gave me the files in the dfs <pre class="prettyprint"><code>Found 3 items drwxr-xr-x - hduser supergroup 0 2014-03-20 03:53 /user/hduser/gutenberg drwxr-xr-x - hduser supergroup 0 2014-03-24 22:34 /user/hduser/mytext-output -rw-r--r-- 1 hduser supergroup 126 2014-03-24 22:30 /user/hduser/text.txt </code></pre> Next time, I tried the same command in a different manner <blockquote> hduser@ubuntu:/usr/local/hadoop$ hadoop dfs -ls </blockquote> It also gave me the same result. Could some one please explain why both are working despite of executing the ls command from different folders. I hope you guys understood my question.Just explain me difference between these two : <pre class="prettyprint"><code>hduser@ubuntu:/usr/local/hadoop$ bin/hadoop dfs -ls hduser@ubuntu:/usr/local/hadoop$ hadoop dfs -ls </code></pre>

In unix an executable file can be executed in two ways, either by giving the absolute/relative path or commands in system executables path(path should be specified in PATH variable) When you execute <code>bin/hadoop dfs -ls</code> should be inside the directory /usr/local/hadoop. Or <code>/usr/local/hadoop/bin/hadoop dfs -ls</code> will also work There is one environment variable PATH in unix which keeps in the list of executable location by default it keeps the following path <code>/usr/local/bin:/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/sbin:</code> . Whenever we execute any command like ls, mkdir etc it is taking from the one location in PATH variable. When you give the command hadoop(it will be taken from the path /usr/local/hadoop/bin/). Since you have specified the path /usr/local/hadoop/bin/ in PATH variable. Use the following command to check the value of your PATH variable <pre class="prettyprint"><code>echo $PATH </code></pre>

You set a hadoop global path <code>HADOOP_HOME</code> in your <code>~/.bashrc</code> file so that Hadoop commands will works in anywhere in Terminal.

Hadoop commands

Tags:

hadoop

hdfs

I have Hadoop installed in this location

/usr/local/hadoop$

Now I want to list the files inside the dfs. The command I used is :

hduser@ubuntu:/usr/local/hadoop$ bin/hadoop dfs -ls

This gave me the files in the dfs

Click to copy

Found 3 items
drwxr-xr-x   - hduser supergroup          0 2014-03-20 03:53 /user/hduser/gutenberg
drwxr-xr-x   - hduser supergroup          0 2014-03-24 22:34 /user/hduser/mytext-output
-rw-r--r--   1 hduser supergroup        126 2014-03-24 22:30 /user/hduser/text.txt

Next time, I tried the same command in a different manner

hduser@ubuntu:/usr/local/hadoop$ hadoop dfs -ls

It also gave me the same result.

Could some one please explain why both are working despite of executing the ls command from different folders. I hope you guys understood my question.Just explain me difference between these two :

Click to copy

hduser@ubuntu:/usr/local/hadoop$ bin/hadoop dfs -ls
hduser@ubuntu:/usr/local/hadoop$ hadoop dfs -ls

405

asked Mar 26 '14 07:03

Francis S

2 Answers

In unix an executable file can be executed in two ways, either by giving the absolute/relative path or commands in system executables path(path should be specified in PATH variable)

When you execute bin/hadoop dfs -ls should be inside the directory /usr/local/hadoop. Or /usr/local/hadoop/bin/hadoop dfs -ls will also work

There is one environment variable PATH in unix which keeps in the list of executable location by default it keeps the following path /usr/local/bin:/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/sbin: . Whenever we execute any command like ls, mkdir etc it is taking from the one location in PATH variable. When you give the command hadoop(it will be taken from the path /usr/local/hadoop/bin/). Since you have specified the path /usr/local/hadoop/bin/ in PATH variable. Use the following command to check the value of your PATH variable

Click to copy

echo $PATH

111

answered Sep 26 '22 04:09

SachinJ

You set a hadoop global path HADOOP_HOME in your ~/.bashrc file so that Hadoop commands will works in anywhere in Terminal.

answered Sep 24 '22 04:09

venkat4143

Related questions
                            
                                Differences between existing MapReduce and YARN (MRv2)
                            
                                spark on yarn; how to send metrics to graphite sink?
                            
                                Hadoop 2.x -- how to configure secondary namenode?
                            
                                query hive partitioned table over date/time range
                            
                                Kafka Memory requirement
                            
                                How to know the exact block size of a file on a Hadoop node?
                            
                                Hadoop HDFS - Difference between Missing replica and Under replicated blocks
                            
                                hdfs copy multiple files to same target directory
                            
                                Hadoop streaming job failure: Task process exit with nonzero status of 137
                            
                                finding mean using pig or hadoop
                            
                                Merging multiple sequence files into one sequencefile within Hadoop
                            
                                Hadoop and Amazon Web Services [closed]
                            
                                Map Reduce output to CSV or do I need Key Values?
                            
                                What kind of JBOD in hadoop? and COW with hadoop?
                            
                                How to set the VCORES in hadoop mapreduce/yarn?
                            
                                HIVE Insert overwrite into a partitioned Table
                            
                                How can I check the settings in hive CLI?
                            
                                Why declaring Mapper and Reducer classes as static?
                            
                                AWS EMR performance HDFS vs S3
                            
                                Usecases for mapred.job.queue.name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Hadoop commands

Tags:

hadoop

hdfs

Francis S

People also ask

2 Answers

SachinJ

venkat4143

Recent Activity

Donate For Us