I have a complex select query and a huge table. I'm running this <code>select</code> statement, meanwhile an <code>Update</code> statement arrives and tries to update the table. IMHO - update requires an exclusive lock - so the update statement will have to wait till the select command is finished. <ol> <li>Am I right ?</li> <li>what can I do in order to: execute the complex <code>select</code>, and also let the <code>update</code> command run (currently I don't care about dirty data)</li> </ol>

Yes - to a degree. How long a <code>SELECT</code> holds on to a shared lock is depending on the isolation level of the transaction: <ul> <li> <code>READ UNCOMMITTED</code> - no shared lock is acquired at all - <code>UPDATE</code> is not blocked</li> <li> <code>READ COMMITTED</code> - shared lock is acquired just for the duration of reading the data - <code>UPDATE</code> might be blocked for a very short period of time</li> <li> <code>REPEATABLE READ</code> and <code>SERIALIZABLE</code> - shared lock is acquired and held on to until the end of the transaction - <code>UPDATE</code> is blocked until the <code>SELECT</code> transaction ends</li> </ul> Technically, the <code>UPDATE</code> statement first gets an <code>UPDATE</code> lock - which is compatible with a shared lock (as used by the <code>SELECT</code>) - for the duration of the time while it's reading the current values of the rows to be updated. Once that's done, the <code>Update</code> lock is escalated to an exclusive lock for the new data to be written to the table.

When you run the two statements concurrently (a SELECT and an UPDATE) the actual behavior will be basically random. This is because neither of the operations is instantaneous. To simplify, consider your table a list and SELECT is traversing this list, looking at one row at a time. UPDATE is also trying to update one or more rows. When the UPDATE is trying to update a row behind the SELECT then nothing happens (no blocking) because the SELECT has already progressed past the UPDATE point. If the UPDATE is trying to update the row at which SELECT is looking right now then the UPDATE will have to wait for SELECT to move on, which will happen very very very fast and the UPDATE will unblock and succeed, while the SELECT is moving ahead. But if the UPDATE is updating a row ahead of the SELECT then the update will succeed and, later, SELECT will eventually reach exactly this row and will stop, blocked. Now SELECT has to wait until the transaction that did the UPDATE commits. This is the simplified story. The real life is much more complicated. The SELECT can have multiple read points (parallel plans). Both the SELECT and the UPDATE are subject to choosing an access path, meaning use one or more secondary indexes to locate the rows. Complex queries may contain operators that cause multiple lookups into a table (eg. joins). Both the SELECT and the UPDATE can do bookmark lookups to fetch BLOB data, which changes significantly the locking behavior. Cardinality estimation may cause the SELECT to run at a high granularity lock mode (eg. table level Shared lock). The UPDATE can trigger lock escalation, and the escalation can fail or succeed. Choosing different access paths can lead to deadlock. False lock contention can occur due to hash collisions. There are just about one myriad variables that have a say in this. And I didn't even mention higher isolation levels (repeatable read, serializable). Perhaps you should use SNAPSHOT isolation and stop worrying about this issue?

SQL Server - does [SELECT] lock [UPDATE]?

2 Answers

Yes - to a degree.

How long a SELECT holds on to a shared lock is depending on the isolation level of the transaction:

READ UNCOMMITTED - no shared lock is acquired at all - UPDATE is not blocked
READ COMMITTED - shared lock is acquired just for the duration of reading the data - UPDATE might be blocked for a very short period of time
REPEATABLE READ and SERIALIZABLE - shared lock is acquired and held on to until the end of the transaction - UPDATE is blocked until the SELECT transaction ends

Technically, the UPDATE statement first gets an UPDATE lock - which is compatible with a shared lock (as used by the SELECT) - for the duration of the time while it's reading the current values of the rows to be updated.

Once that's done, the Update lock is escalated to an exclusive lock for the new data to be written to the table.

136

answered Sep 28 '22 04:09

marc_s

When you run the two statements concurrently (a SELECT and an UPDATE) the actual behavior will be basically random. This is because neither of the operations is instantaneous. To simplify, consider your table a list and SELECT is traversing this list, looking at one row at a time. UPDATE is also trying to update one or more rows. When the UPDATE is trying to update a row behind the SELECT then nothing happens (no blocking) because the SELECT has already progressed past the UPDATE point. If the UPDATE is trying to update the row at which SELECT is looking right now then the UPDATE will have to wait for SELECT to move on, which will happen very very very fast and the UPDATE will unblock and succeed, while the SELECT is moving ahead. But if the UPDATE is updating a row ahead of the SELECT then the update will succeed and, later, SELECT will eventually reach exactly this row and will stop, blocked. Now SELECT has to wait until the transaction that did the UPDATE commits.

This is the simplified story. The real life is much more complicated. The SELECT can have multiple read points (parallel plans). Both the SELECT and the UPDATE are subject to choosing an access path, meaning use one or more secondary indexes to locate the rows. Complex queries may contain operators that cause multiple lookups into a table (eg. joins). Both the SELECT and the UPDATE can do bookmark lookups to fetch BLOB data, which changes significantly the locking behavior. Cardinality estimation may cause the SELECT to run at a high granularity lock mode (eg. table level Shared lock). The UPDATE can trigger lock escalation, and the escalation can fail or succeed. Choosing different access paths can lead to deadlock. False lock contention can occur due to hash collisions. There are just about one myriad variables that have a say in this. And I didn't even mention higher isolation levels (repeatable read, serializable).

Perhaps you should use SNAPSHOT isolation and stop worrying about this issue?

answered Sep 28 '22 05:09

Remus Rusanu

Related questions
                            
                                SQL left join query runs VERY slow
                            
                                Joining tables based on the maximum value
                            
                                C# SQLServer retrieving results and place in a .csv format
                            
                                How do database servers decide which order to return rows without any "order by" statements?
                            
                                Psycopg2 doesn't like table names that start with a lower case letter
                            
                                Type "Time" in SQL Server and C#
                            
                                Limit SQL query result in MySQL
                            
                                simple sql query, combine results and divide
                            
                                List of all tables in database
                            
                                Update A multi-valued field in Access
                            
                                Mapping values without a table
                            
                                T-SQL (transact-SQL) valid in SQLite and other SQL databases?
                            
                                how to round up to decimal place like money
                            
                                SQLite IF Exists Clause
                            
                                How to replace all double quotes to single quotes using mysql replace?
                            
                                Can't log into SQL server after changing computer name
                            
                                How to get column name from gridview?
                            
                                Why = operator doesn't work with ROWNUM other than for value 1?
                            
                                Create array in SELECT
                            
                                Get the SUM of one column when grouped by another column in SQL

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SQL Server - does [SELECT] lock [UPDATE]?

Tags:

performance

sql

sql-server

sql-server-2005

locking

Royi Namir

People also ask

2 Answers

marc_s

Remus Rusanu

Recent Activity

Donate For Us