Is there the equivalent for a `find` command in `hadoop`?

Tags:

I know that from the terminal, one can do a find command to find files such as :

find . -type d -name "*something*" -maxdepth 4

But, when I am in the hadoop file system, I have not found a way to do this.

hadoop fs -find ....

throws an error.

How do people traverse files in hadoop? I'm using hadoop 2.6.0-cdh5.4.1.

334

asked Oct 01 '15 20:10

makansij

1 Answers

hadoop fs -find was introduced in Apache Hadoop 2.7.0. Most likely you're using an older version hence you don't have it yet. see: HADOOP-8989 for more information.

In the meantime you can use

hdfs dfs -ls -R <pattern>

e.g,: hdfs dfs -ls -R /demo/order*.*

but that's not as powerful as 'find' of course and lacks some basics. From what I understand people have been writing scripts around it to get over this problem.

116

answered Sep 19 '22 01:09

Legato

Related questions
                            
                                How to extract selected values from json string in Hive
                            
                                hadoop aws versions compatibility
                            
                                Max/Min for whole sets of records in PIG
                            
                                Storing results of UNION in PIG in a single file
                            
                                Difference between PIG local and mapreduce mode
                            
                                YarnException: Unauthorized request to start container
                            
                                Which nodejs library should I use to write into HDFS?
                            
                                wiping out the Zookeeper data directory
                            
                                Can I cluster by/bucket a table created via "CREATE TABLE AS SELECT....." in Hive?
                            
                                YARN UNHEALTHY nodes
                            
                                Search for a particular text in a string - Hive
                            
                                Insert timestamp into Hive
                            
                                Hadoop : start-dfs.sh Connection refused
                            
                                Hadoop Streaming - Unable to find file error
                            
                                Hadoop job taking input files from multiple directories
                            
                                http request to webhdfs, but empty reply from server
                            
                                sqoop import multiple tables
                            
                                Hadoop : Provide directory as input to MapReduce job
                            
                                java.net.ConnectException: Connection refused error when running Hive
                            
                                "Wrong FS... expected: file:///" when trying to read file from HDFS in Java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there the equivalent for a `find` command in `hadoop`?

Tags:

terminal

hadoop

hadoop2

hdfs

makansij

People also ask

1 Answers

Legato

Recent Activity

Donate For Us