I ran the following command on my test hadoop instance :
hadoop fs -du /test/data/
51179082 153537246 /test/data/9875/2016/02/03
46949272 140847816 /test/data/9875/2016/02/04
I understand du
gives the size of files and directories, but for hadoop fs -du
it is giving two sizes. Can some one help me understand what are these?
Yes, there's a difference between hadoop fs and hdfs dfs. hadoop fs is used to communicate with any file system. hdfs dfs is used to communicate particularly with hadoop distributed file system.
The Hadoop fs shell command put is similar to the copyFromLocal, which copies files or directory from the local filesystem to the destination in the Hadoop filesystem.
hdfs dfsadmin -report outputs a brief report on the overall HDFS filesystem. It's a useful command to quickly view how much disk is available, how many DataNodes are running, corrupted blocks etc. Note: This article explains the disk space calculations as seen by the HDFS.
One column is size of the file and the other column is size of the file after replication.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With