How to Append new data to already existing hive table

Tags:

How to append the records to existing partitioned Hive table? For example I have existing external Table called "ip_country" and dataset is testdata1. If dataset grows say like my dataset in next day is testdata1 and testdata2 then how to append new data i.e.., "testdata2" to "ip_country" hive table.

550

asked May 13 '15 10:05

marjun

1 Answers

It can be achieved in couple of ways (Purely depends on your requirement)

If you don't bother about overwriting the existing records in the partition, (I mean you don't have a big history data, say 10 yrs data), then Insert Overwrite might fit.

INSERT OVERWRITE TABLE tablename1 [PARTITION (partcol1=val1, partcol2=val2 ...) [IF NOT EXISTS]] select_statement1 FROM from_statement;

If you don't bother about duplicates in the partition, then Insert Into might fit (Honestly I wudn't prefer to have duplicate records).

INSERT INTO TABLE tablename1 [PARTITION (partcol1=val1, partcol2=val2 ...)] select_statement1 FROM from_statement;

If you have history data plus Incremental data, then History data can be inserted once and the incremental data(based on the frequency that you choose daily/weekly/fortnightly basis) can be inserted using a Insert Overwrite

166

answered Nov 15 '22 06:11

Partha Kaushik

Related questions
                            
                                storing images in HBASE for processing and quick access
                            
                                Pig java.lang.NoSuchFieldException: jobsInProgress exception
                            
                                How to read a record that is split into multiple lines and also how to handle broken records during input split
                            
                                $HIVE_HOME/bin/hive --service hiveserver
                            
                                Hadoop component is not starting
                            
                                how does hadoop read input file?
                            
                                Stream decoding of Base64 data
                            
                                Hadoop on Local FileSystem
                            
                                Jar file for MapReduce new API Job.getInstance(Configuration, String)
                            
                                is is possible to count the number of partitions?
                            
                                How do I search for an item in an array in Hive?
                            
                                Apache Spark with custom InputFormat for HadoopRDD
                            
                                Spring support for WebHDFS
                            
                                Accessing read-only Google Storage buckets from Hadoop
                            
                                How build hadoop sources under windows?
                            
                                How to configure Hive warehouse path?
                            
                                NoSuchMethodError Sets.newConcurrentHashSet() while running jar using hadoop
                            
                                What is the "t" permission on HDFS directories?
                            
                                Difference between combiner and in-mapper combiner in mapreduce?
                            
                                concatenate a string to a field in pig

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to Append new data to already existing hive table

Tags:

hadoop

hive

marjun

People also ask

1 Answers

Partha Kaushik

Recent Activity

Donate For Us