How to find Hadoop HDFS directory on my system? I need this to run following command - <pre class="prettyprint"><code>hadoop dfs -copyFromLocal <local-dir> <hdfs-dir> </code></pre> In this command I don't knon my hdfs-dir. Not sure if its helpful or not but I ran following command and got this output - <pre class="prettyprint"><code> hdfs dfs -ls -rw-r--r-- 3 popeye hdfs 127162942 2016-04-01 19:47 . </code></pre> In hdfs-site.xml, I found following entry - <pre class="prettyprint"><code><property> <name>dfs.datanode.data.dir</name> <value>/hadoop/hdfs/data</value> <final>true</final> </property> </code></pre> I tried to run following command but it gives error - <pre class="prettyprint"><code>[root@sandbox try]# hdfs dfs -copyFromLocal 1987.csv /hadoop/hdfs/data copyFromLocal: `/hadoop/hdfs/data': No such file or directory </code></pre> FYI - I am doing all this on hortonworks sandbox on azure server.

Your approach is wrong or may be understanding is wrong <code>dfs.datanode.data.dir</code>, is where you want to store your data blocks If you type <code>hdfs dfs -ls /</code> you will get list of directories in hdfs. Then you can transfer files from local to hdfs using <code>-copyFromLocal</code> or <code>-put</code> to a particular directory or using <code>-mkdir</code> you can create new directory Refer below link for more information http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSCommands.html

If you run: <pre class="prettyprint"><code>hdfs dfs -copyFromLocal foo.txt bar.txt </code></pre> then the local file foo.txt will be copied into your own hdfs directory <code>/user/popeye/bar.txt</code> (where <code>popeye</code> is your username.) As a result, the following achieves the same: <pre class="prettyprint"><code>hdfs dfs -copyFromLocal foo.txt /user/popeye/bar.txt </code></pre> Before copying any file into hdfs, just be certain to create the parent directory first. You don't have to put files in this "home" directory, but (1) better to not clutter "/" with all sorts of files, and (2) following this convention will help prevent conflicts with other users.

How to find Hadoop hdfs directory on my system?

Tags:

linux

hadoop

azure

hdfs

hortonworks-data-platform

How to find Hadoop HDFS directory on my system? I need this to run following command -

hadoop dfs -copyFromLocal <local-dir> <hdfs-dir>

In this command I don't knon my hdfs-dir.

Not sure if its helpful or not but I ran following command and got this output -

 hdfs dfs -ls
-rw-r--r--   3 popeye hdfs  127162942 2016-04-01 19:47 .

In hdfs-site.xml, I found following entry -

<property>
      <name>dfs.datanode.data.dir</name>
      <value>/hadoop/hdfs/data</value>
      <final>true</final>
</property>

I tried to run following command but it gives error -

[root@sandbox try]# hdfs dfs -copyFromLocal 1987.csv /hadoop/hdfs/data
copyFromLocal: `/hadoop/hdfs/data': No such file or directory

FYI - I am doing all this on hortonworks sandbox on azure server.

653

asked Apr 02 '16 20:04

N..

3 Answers

Your approach is wrong or may be understanding is wrong

dfs.datanode.data.dir, is where you want to store your data blocks

If you type hdfs dfs -ls / you will get list of directories in hdfs. Then you can transfer files from local to hdfs using -copyFromLocal or -put to a particular directory or using -mkdir you can create new directory

Refer below link for more information

http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSCommands.html

answered Oct 20 '22 04:10

BruceWayne

If you run:

hdfs dfs -copyFromLocal foo.txt bar.txt

then the local file foo.txt will be copied into your own hdfs directory /user/popeye/bar.txt (where popeye is your username.) As a result, the following achieves the same:

hdfs dfs -copyFromLocal foo.txt /user/popeye/bar.txt

Before copying any file into hdfs, just be certain to create the parent directory first. You don't have to put files in this "home" directory, but (1) better to not clutter "/" with all sorts of files, and (2) following this convention will help prevent conflicts with other users.

answered Oct 20 '22 04:10

michael

As per the first answer, I am elaborating it in detailed for Hadoop 1.x -

Suppose, you are running this script on pseudo distribution model, you will probably get one or two list of users(NameNodes) illustrated -

on our fully distribution model, first you have the administrator rights to perform these things and there will be N number of list of NameNodes(users).

So now we move to our point -

First reach to your Hadoop home directory and from there run this script -

bin/hadoop fs -ls /

Result will like this -

drwxr-xr-x   - xuiob78126arif supergroup          0 2017-11-30 11:20 /user

so here xuiob78126arif is my name node(master/user) and the NameNode(user) directory is -

/user/xuiob78126arif/

now you can go to your browser and search the address -

http://xuiob78126arif:50070

and from there you can get the Cluster Summary, NameNode Storage, etc.

Note : the script will provide results only in one condition, if at least any file or directory exist in DataNode otherwise you will get -

ls: Cannot access .: No such file or directory.

so, in that case you first put any file by bin/hadoop fs -put <source file full path>

and there after run the bin/hadoop fs -ls / script.

and now I hope, you have get a bit on your issue, thanks.

answered Oct 20 '22 04:10

ArifMustafa

Related questions
                            
                                LD_PRELOAD with setuid binary
                            
                                Attach to 'screen' session with creating a new screen window
                            
                                How to use iptables in linux to forward http and https traffic to a transparent proxy [closed]
                            
                                Trying to launch an external editor from within a Go program
                            
                                Linux: Run cron job in foreground
                            
                                Making 'long' 4 bytes in gcc on a 64-bit Linux machine
                            
                                Linux serial port listener and interpreter?
                            
                                xmodmap clear command
                            
                                Perl encrypting STDIN passwords
                            
                                How can I tell what user Jenkins is running as?
                            
                                What are the Windows and Linux native OS/system calls made from malloc()?
                            
                                "+" and "-" output of the Jobs command [duplicate]
                            
                                cmake : How to change file permissions when installing?
                            
                                What is the significance of THIS_MODULE in Linux kernel module drivers?
                            
                                Running 32 bit exe on Ubuntu :libudev.so : cannot open shared object file: No such file or directory
                            
                                Compiling using arm-none-eabi-gcc and linking library liba.a error
                            
                                Find ulimit -a for other users
                            
                                wget force retry until there is a connection
                            
                                Configuring TCP keepalive after accept
                            
                                How to detach a terminal pane to a new window?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to find Hadoop hdfs directory on my system?

Tags:

linux

hadoop

azure

hdfs

hortonworks-data-platform

N..

People also ask

3 Answers

BruceWayne

michael

ArifMustafa

Recent Activity

Donate For Us