I would like to know what will happen if a hive SELECT and INSERT OVERWRITE is running at the same time. Please help me to understand what will hive query return in the below scenarios. Run the query first, while the query is running, INSERT OVERWRITE the same table. Run the INSERT OVERWRITE first, while overwriting, pull the data from the same table with SELECT. Are we going to get the old data, new data, mixed data, nothing, or unpredictable data? I am using MapR 4.0.1, Hive 0.13. Best regards, Ryan

Read Hive Locking: <blockquote> For a non-partitioned table, the lock modes are pretty intuitive. When the table is being read, a S lock is acquired, whereas an X lock is acquired for all other operations (insert into the table, alter table of any kind etc.) </blockquote> So SELECT and INSERT acquire incompatible locks so they can never run in parallel. One will acquire the lock first and the other will wait. For partitioned tables things are a bit more complex as the locks acquire are hierarchical (S on table, S/X on partition). Read the link.

What will happen if a hive(0.13) SELECT and INSERT OVERWRITE are running at the same time

1 Answers

Read Hive Locking:

For a non-partitioned table, the lock modes are pretty intuitive. When the table is being read, a S lock is acquired, whereas an X lock is acquired for all other operations (insert into the table, alter table of any kind etc.)

So SELECT and INSERT acquire incompatible locks so they can never run in parallel. One will acquire the lock first and the other will wait.

For partitioned tables things are a bit more complex as the locks acquire are hierarchical (S on table, S/X on partition). Read the link.

answered Oct 12 '22 23:10

Remus Rusanu

Related questions
                            
                                How To Refresh/Clear the DistributedCache When Using Hue + Beeswax To Run Hive Queries That Define Custom UDFs?
                            
                                Hive: work around for non equi left join
                            
                                Delta/Incremental Load in Hive
                            
                                Configured the HA Cluster with Hive-2.0.1(Derby Support) shows redundant database names?
                            
                                Connecting to Hive using python's Jaydebeapi
                            
                                Hive query too slow and failed
                            
                                Read data from remote hive on spark over JDBC returns empty result
                            
                                Presto: cast array<struct<key:string,value:array<string>>> into map<string,array<string>>
                            
                                Spark and Hive in Hadoop 3: Difference between metastore.catalog.default and spark.sql.catalogImplementation
                            
                                JSON SerDe for Hive that supports JSON arrays
                            
                                Hive alter location statement not working
                            
                                Apache Phoenix vs Hive-Spark
                            
                                howto add hive properties at runtime in spark-shell
                            
                                Spring-Batch for a massive nightly / hourly Hive / MySQL data processing
                            
                                Missing Hive Execution Jar: /usr/local/hadoop/hive/lib/hive-exec-*.jar
                            
                                Impala cannot find com.mysql.jdbc.Driver
                            
                                How to insert data into Parquet table in Hive
                            
                                Get sequential number of a row (rank) within a partition without using ROW_NUMBER() OVER function
                            
                                Cannot validate serde : org.openx.data.jsonserde.jsonserde
                            
                                Is it possible to concat a string field after group by in Hive

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What will happen if a hive(0.13) SELECT and INSERT OVERWRITE are running at the same time

Tags:

hive

fanwu72

People also ask

1 Answers

Remus Rusanu

Recent Activity

Donate For Us