I have a file on HDFS that I want to know how many lines are. (testfile) In linux, I can do: <pre class="prettyprint"><code>wc -l <filename> </code></pre> Can I do something similar with "hadoop fs" command? I can print file contents with: <pre class="prettyprint"><code>hadoop fs -text /user/mklein/testfile </code></pre> How do I know how many lines do I have? I want to avoid copying the file to local filesystem then running the wc command. Note: My file is compressed using snappy compression, which is why I have to use -text instead of -cat

Total number of files: <code>hadoop fs -ls /path/to/hdfs/* | wc -l</code> Total number of lines: <code>hadoop fs -cat /path/to/hdfs/* | wc -l</code> Total number of lines for a given file: <code>hadoop fs -cat /path/to/hdfs/filename | wc -l</code>

How to count lines in a file on hdfs command?

Tags:

I have a file on HDFS that I want to know how many lines are. (testfile)

In linux, I can do:

wc -l <filename>

Can I do something similar with "hadoop fs" command? I can print file contents with:

hadoop fs -text /user/mklein/testfile

How do I know how many lines do I have? I want to avoid copying the file to local filesystem then running the wc command.

Note: My file is compressed using snappy compression, which is why I have to use -text instead of -cat

244

asked Sep 16 '15 15:09

Setsuna

1 Answers

Total number of files: hadoop fs -ls /path/to/hdfs/* | wc -l

Total number of lines: hadoop fs -cat /path/to/hdfs/* | wc -l

Total number of lines for a given file: hadoop fs -cat /path/to/hdfs/filename | wc -l

168

answered Sep 25 '22 01:09

Soumick Dasgupta

Related questions
                            
                                "Unable to find an entry point named [function] in dll" (c++ to c# type conversion)
                            
                                Cross product of two vectors in Python
                            
                                How do you dynamically identify unknown delimiters in a data file?
                            
                                Displaying date in a double digit format
                            
                                How to interrupt an Infinite Loop
                            
                                Maximum Width of jQuery UI Tooltip widget
                            
                                how to continuously display a file of its last several lines of contents
                            
                                CSS pick a random color from array
                            
                                java.lang.IllegalStateException: Can't change tag of fragment
                            
                                AngularJS 1.2 - ngAnimate not working
                            
                                Copying an array of objects into another array in javascript (Deep Copy)
                            
                                How to Sum a column in AWK? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With