When to use Sqoop --create-hive-table

1 Answers

hive-import command:
hive-import commands automatically populates the metadata for the populating tables in hive metastore. If the table in Hive does not exist yet, Sqoop will simply create it based on the metadata fetched for your table or query. If the table already exists, Sqoop will import data into the existing table. If you’re creating a new Hive table, Sqoop will convert the data types of each column from your source table to a type compatible with Hive.
create-hive-table command:
Sqoop can generate a hive table (using create-hive-tablecommand) based on the table from an existing relational data source. If set, then the job will fail if the target hive table exists. By default this property is false.

Using create-hive-table command involves three steps: importing data into HDFS, creating hive table and then loading the HDFS data into Hive. This can be shortened to one step by using hive-import.

During a hive-import, Sqoop will first do a normal HDFS import to a temporary location. After a successful import, Sqoop generates two queries: one for creating a table and another one for loading the data from a temporary location. You can specify any temporary location using either the --target-dir or --warehouse-dir parameter.

Added a example below for above description

Using create-hive-table command:
Involves three steps:

Importing data from RDBMS to HDFS

sqoop import --connect jdbc:mysql://localhost:3306/hadoopexample --table employees --split-by empid -m 1;
Creating hive table using create-hive-table command

sqoop create-hive-table --connect jdbc:mysql://localhost:3306/hadoopexample --table employees --fields-terminated-by ',';
Loading data into Hive

hive> load data inpath "employees" into table employees; Loading data to table default.employees Table default.employees stats: [numFiles=1, totalSize=70] OK Time taken: 2.269 seconds hive> select * from employees; OK 1001 emp1 101 1002 emp2 102 1003 emp3 101 1004 emp4 101 1005 emp5 103 Time taken: 0.334 seconds, Fetched: 5 row(s)

Using hive-import command:

sqoop import --connect jdbc:mysql://localhost:3306/hadoopexample --table departments --split-by deptid -m 1 --hive-import;

115

answered Sep 22 '22 05:09

Sai Neelakantam

Related questions
                            
                                how to make hive take only specific files as input from hdfs folder
                            
                                Hadoop HIVE - How to query part of rows
                            
                                How can we decide the total no. of buckets for a hive table
                            
                                When are files "splittable"?
                            
                                Convert mm/dd/yyyy to yyyy-mm-dd in Hive
                            
                                Using like operator to check for pattern in hive
                            
                                Windowing function in Hive
                            
                                Using Pig/Hive for data processing instead of direct java map reduce code?
                            
                                Reducers stopped working at 66.68% while running HIVE Join query
                            
                                Hive - Adding comments to tables
                            
                                Hive 0.14.0 not starting
                            
                                Run spark SQL on CHD5.4.1 NoClassDefFoundError
                            
                                Cloudera Impala INVALIDATE METADATA
                            
                                Hive insert into table from select statement with different schemas
                            
                                How to create an empty copy of a table in hive
                            
                                Hive Map-Join configuration mystery
                            
                                Hive foreign keys?
                            
                                Combine columns from multiple columns into one in Hive
                            
                                Oozie shell script action
                            
                                Adding/Defining Jars in Hive permanently

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

When to use Sqoop --create-hive-table

Tags:

hive

sqoop

Priya v v

People also ask

1 Answers

Sai Neelakantam

Recent Activity

Donate For Us