overwrite hdfs directory Sqoop import

Tags:

sqoop

Is it possible to overwrite HDFS directory automatically instead of overwriting it every time manually while Sqoop import? (Do we have any option like "--overwrite" like we have for hive import "--hive-overwrite")

447

asked Oct 19 '16 11:10

Abhinav Singh

2 Answers

Use --delete-target-dir

It will delete <HDFS-target-dir> provided in command before writing data to this directory.

163

answered Sep 17 '22 23:09

Dev

Use this: --delete-target-dir

This will work for overwriting the hdfs directory using sqoop syntax:

$ sqoop import --connect jdbc:mysql://localhost/dbname --username username -P --table tablename --delete-target-dir --target-dir '/targetdirectorypath' -m 1

E.g:

$ sqoop import --connect jdbc:mysql://localhost/abc --username root -P --table empsqooptargetdel --delete-target-dir --target-dir '/tmp/sqooptargetdirdelete' -m 1

This command will refresh the corresponding hdfs directory or hive table data with updated/fresh data, every time this command is run.

answered Sep 21 '22 23:09

adarikrishna

Related questions
                            
                                sqoop import issue with mysql
                            
                                how can i provide password to SQOOP through OOZIE to connect to MS-SQL?
                            
                                How to find optimal number of mappers when running Sqoop import and export?
                            
                                Executing Sqoops using Oozie
                            
                                Passing parameter to sqoop job
                            
                                Oozie + Sqoop: JDBC Driver Jar Location
                            
                                sqoop EXPORT - There is no column found in the target table
                            
                                Using Sqoop to import data from MySQL to Hive
                            
                                Apache Sqoop/Pig Consistent Data Representation/Processing
                            
                                Delta/Incremental Load in Hive
                            
                                Loading data from RDBMS to Hadoop with multiple destinations
                            
                                Function min(uuid) does not exist in postgresql
                            
                                Showing wrong count after importing table in Hive
                            
                                Is it possible to read MongoDB data, process it with Hadoop, and output it into a RDBS (MySQL)?
                            
                                Sqoop Hive exited with status 1
                            
                                Sqoop - Binding to YARN queues
                            
                                What is --direct mode in sqoop?
                            
                                Sqoop Hive table import, Table dataType doesn't match with database

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With