Probably a noob question but is there a way to read the contents of file in hdfs besides copying to local and reading thru unix? So right now what I am doing is: <pre class="prettyprint"><code> bin/hadoop dfs -copyToLocal hdfs/path local/path nano local/path </code></pre> I am wondering if I can open a file directly to hdfs rather than copying it on local and then opening it.

I believe <code>hadoop fs -cat <file></code> should do the job.

If the file size is huge (which will be the case most of the times), by doing 'cat' you don't want to blow up your terminal by throwing the entire content of your file. Instead, use piping and get only few lines of the file. To get the first 10 lines of the file, hadoop fs -cat 'file path' | head -10 To get the last 5 lines of the file, hadoop fs -cat 'file path' | tail -5

view contents of file in hdfs hadoop

Tags:

hadoop

Probably a noob question but is there a way to read the contents of file in hdfs besides copying to local and reading thru unix?

So right now what I am doing is:

Click to copy

  bin/hadoop dfs -copyToLocal hdfs/path local/path    nano local/path

I am wondering if I can open a file directly to hdfs rather than copying it on local and then opening it.

306

asked Feb 17 '13 19:02

frazman

2 Answers

I believe hadoop fs -cat <file> should do the job.

188

answered Oct 15 '22 16:10

Quetzalcoatl

If the file size is huge (which will be the case most of the times), by doing 'cat' you don't want to blow up your terminal by throwing the entire content of your file. Instead, use piping and get only few lines of the file.

To get the first 10 lines of the file, hadoop fs -cat 'file path' | head -10

To get the last 5 lines of the file, hadoop fs -cat 'file path' | tail -5

answered Oct 15 '22 18:10

Manish Barnwal

Related questions
                            
                                what is difference between partition and replica of a topic in kafka cluster
                            
                                Skip first line of csv while loading in hive table
                            
                                Running Apache Hadoop 2.1.0 on Windows
                            
                                Why does Hadoop need classes like Text or IntWritable instead of String or Integer?
                            
                                Why does Hadoop report "Unhealthy Node local-dirs and log-dirs are bad"?
                            
                                How to find the size of a HDFS file
                            
                                Save Spark dataframe as dynamic partitioned table in Hive
                            
                                Hadoop 2.2 Installation `.' no such file or directory
                            
                                Just enough Java for Hadoop [closed]
                            
                                Hadoop one Map and multiple Reduce
                            
                                putting a remote file into hadoop without copying it to local disk
                            
                                What is Google's Dremel? How is it different from Mapreduce?
                            
                                Hadoop DistributedCache is deprecated - what is the preferred API?
                            
                                Easiest way to install Python dependencies on Spark executor nodes?
                            
                                Spark Unable to load native-hadoop library for your platform
                            
                                Where HDFS stores files locally by default?
                            
                                Difference between `yarn.scheduler.maximum-allocation-mb` and `yarn.nodemanager.resource.memory-mb`?
                            
                                Spark Scala list folders in directory
                            
                                Loading Data from a .txt file to Table Stored as ORC in Hive
                            
                                When using --negotiate with curl, is a keytab file required?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With