How to delete files from the HDFS?

Tags:

I just downloaded Hortonworks sandbox VM, inside it there are Hadoop with the version 2.7.1. I adding some files by using the

hadoop fs -put /hw1/* /hw1

...command. After it I am deleting the added files, by the

hadoop fs -rm /hw1/*

...command, and after it cleaning the recycle bin, by the

hadoop fs -expunge

...command. But the DFS Remaining space not changed after recyle bin cleaned. Even I can see that the data was truly deleted from the /hw1/ and the recyle bin. I have the fs.trash.interval parameter = 1.

Actually I can find all my data split in chunks in the /hadoop/hdfs/data/current/BP-2048114545-10.0.2.15-1445949559569/current/finalized/subdir0/subdir2 folder, and this is really surprises me, because I expect them to be deleted.

So my question how to delete the data the way that they really will be deleted? After few adding and deletion I got exhausted free space.

521

asked Dec 07 '15 18:12

serg

2 Answers

Try hadoop fs -rm -R URI

-R option deletes the directory and any content under it recursively.

answered Sep 22 '22 15:09

BruceWayne

You can use

hdfs dfs -rm -R /path/to/HDFS/file

since hadoop dfs has been deprecated.

answered Sep 22 '22 15:09

Giorgos Myrianthous

Related questions
                            
                                What does msck stands for in Msck repair command
                            
                                How to copy data from one HDFS to another HDFS?
                            
                                How does Spark running on YARN account for Python memory usage?
                            
                                What is the advantage of storing schema in avro?
                            
                                Parquet without Hadoop?
                            
                                Writing to HDFS could only be replicated to 0 nodes instead of minReplication (=1)
                            
                                How to rename a hive table without changing location?
                            
                                Best splittable compression for Hadoop input = bz2?
                            
                                How do I copy files from S3 to Amazon EMR HDFS?
                            
                                What should be hadoop.tmp.dir ?
                            
                                Change File Split size in Hadoop
                            
                                How to calculate Date difference in Hive
                            
                                Should I call ugi.checkTGTAndReloginFromKeytab() before every action on hadoop?
                            
                                How to make shark/spark clear the cache?
                            
                                hadoop fs -ls results in "no such file or directory"
                            
                                IllegalAccessError to guava's StopWatch from org.apache.hadoop.mapreduce.lib.input.FileInputFormat.listStatus
                            
                                Merge Spark output CSV files with a single header
                            
                                Advantages of using NullWritable in Hadoop
                            
                                LeaseExpiredException: No lease error on HDFS
                            
                                Hadoop safemode recovery - taking too long!

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to delete files from the HDFS?

Tags:

hadoop

hdfs

hortonworks-data-platform

serg

People also ask

2 Answers

BruceWayne

Giorgos Myrianthous

Recent Activity

Donate For Us