Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

hadoop fs -du gives two data columns

I ran the following command on my test hadoop instance :

hadoop fs -du /test/data/

51179082   153537246   /test/data/9875/2016/02/03
46949272   140847816   /test/data/9875/2016/02/04

I understand du gives the size of files and directories, but for hadoop fs -du it is giving two sizes. Can some one help me understand what are these?

like image 750
Pradeep Avatar asked Feb 24 '16 22:02

Pradeep


People also ask

What is the difference between hadoop fs and HDFS DFS?

Yes, there's a difference between hadoop fs and hdfs dfs. hadoop fs is used to communicate with any file system. hdfs dfs is used to communicate particularly with hadoop distributed file system.

What does hadoop fs command does?

The Hadoop fs shell command put is similar to the copyFromLocal, which copies files or directory from the local filesystem to the destination in the Hadoop filesystem.

What is the output of HDFS DFS?

hdfs dfsadmin -report outputs a brief report on the overall HDFS filesystem. It's a useful command to quickly view how much disk is available, how many DataNodes are running, corrupted blocks etc. Note: This article explains the disk space calculations as seen by the HDFS.


1 Answers

One column is size of the file and the other column is size of the file after replication.

like image 81
Pradeep Avatar answered Sep 19 '22 18:09

Pradeep