According to the <code>UPDATE</code> documentation, an <code>UPDATE</code> always acquires an exclusive lock on the whole table. However, I am wondering if the exclusive lock is acquired before the rows to be updated are determined or only just before the actual update. My concrete problem is that I have a nested <code>SELECT</code> in my <code>UPDATE</code> like this: <pre class="prettyprint"><code>UPDATE Tasks SET Status = 'Active' WHERE Id = (SELECT TOP 1 Id FROM Tasks WHERE Type = 1 AND (SELECT COUNT(*) FROM Tasks WHERE Status = 'Active') = 0 ORDER BY Id) </code></pre> Now I am wondering whether it is really guaranteed that there is exactly one task with <code>Status = 'Active'</code> afterwards if in parallel the same statement may be executed with another Type: <pre class="prettyprint"><code>UPDATE Tasks SET Status = 'Active' WHERE Id = (SELECT TOP 1 Id FROM Tasks WHERE Type = 2 -- <== The only difference AND (SELECT COUNT(*) FROM Tasks WHERE Status = 'Active') = 0 ORDER BY Id) </code></pre> If for both statements the rows to change would be determined before the lock is acquired, I could end up with two active tasks which I must prevent. If this is the case, how can I prevent it? Can I prevent it without setting the transaction level to <code>SERIALIZABLE</code> or messing with lock hints? From the answer to Is a single SQL Server statement atomic and consistent? I learned that the problem arises when the nested <code>SELECT</code> accesses another table. However, I'm not sure if I have to care about this issue if only the updated table is concerned.

If you want exactly one task with static = active, then set up the table to ensure this is true. Use a filtered unique index: <pre class="prettyprint"><code>create unique index unq_tasks_status_filter_active on tasks(status) where status = 'Active'; </code></pre> A second concurrent <code>update</code> might fail, but you will be ensured of uniqueness. Your application code can process such failed updates, and re-try. Relying on the actual execution plans of the updates might be dangerous. That is why it is safer to have the database do such validations. Underlying implementation details could vary, depending on the environment and version of SQL Server. For instance, what works in a single threaded, single processor environment may not work in a parallel environment. What works with one isolation level may not work with another. EDIT: And, I cannot resist. For efficiency purposes, consider writing the query as: <pre class="prettyprint"><code>UPDATE Tasks SET Status = 'Active' WHERE NOT EXISTS (SELECT 1 FROM Tasks WHERE Status = 'Active' ) AND Id = (SELECT TOP 1 Id FROM Tasks WHERE Type = 2 -- <== The only difference ORDER BY Id ); </code></pre> Then place indexes on <code>Tasks(Status)</code> and <code>Tasks(Type, Id)</code>. In fact, with the right query, you might find that the query is so fast (despite the update on the index) that your worry about current updates is greatly mitigated. This would not solve a race condition, but it might at least make it rare. And if you are capturing errors, then with the unique filtered index, you could just do: <pre class="prettyprint"><code>UPDATE Tasks SET Status = 'Active' WHERE Id = (SELECT TOP 1 Id FROM Tasks WHERE Type = 2 -- <== The only difference ORDER BY Id ); </code></pre> This will return an error if a row already is active. Note: all these queries and concepts can be applied to "one active per group". This answer is addressing the question that you asked. If you have a "one active per group" problem, then consider asking another question.

Is a transaction that only updates a single table always isolated?

Tags:

sql

sql-server

transaction-isolation

According to the UPDATE documentation, an UPDATE always acquires an exclusive lock on the whole table. However, I am wondering if the exclusive lock is acquired before the rows to be updated are determined or only just before the actual update.

My concrete problem is that I have a nested SELECT in my UPDATE like this:

UPDATE Tasks
SET Status = 'Active'
WHERE Id = (SELECT TOP 1 Id 
            FROM Tasks
            WHERE Type = 1
                AND (SELECT COUNT(*) 
                     FROM Tasks 
                     WHERE Status = 'Active') = 0
            ORDER BY Id)

Now I am wondering whether it is really guaranteed that there is exactly one task with Status = 'Active' afterwards if in parallel the same statement may be executed with another Type:

UPDATE Tasks
SET Status = 'Active'
WHERE Id = (SELECT TOP 1 Id 
            FROM Tasks
            WHERE Type = 2           -- <== The only difference
                AND (SELECT COUNT(*) 
                     FROM Tasks 
                     WHERE Status = 'Active') = 0
            ORDER BY Id)

If for both statements the rows to change would be determined before the lock is acquired, I could end up with two active tasks which I must prevent.

If this is the case, how can I prevent it? Can I prevent it without setting the transaction level to SERIALIZABLE or messing with lock hints?

From the answer to Is a single SQL Server statement atomic and consistent? I learned that the problem arises when the nested SELECT accesses another table. However, I'm not sure if I have to care about this issue if only the updated table is concerned.

644

asked Jan 08 '16 14:01

lex82

1 Answers

If you want exactly one task with static = active, then set up the table to ensure this is true. Use a filtered unique index:

create unique index unq_tasks_status_filter_active on tasks(status)
    where status = 'Active';

A second concurrent update might fail, but you will be ensured of uniqueness. Your application code can process such failed updates, and re-try.

Relying on the actual execution plans of the updates might be dangerous. That is why it is safer to have the database do such validations. Underlying implementation details could vary, depending on the environment and version of SQL Server. For instance, what works in a single threaded, single processor environment may not work in a parallel environment. What works with one isolation level may not work with another.

EDIT:

And, I cannot resist. For efficiency purposes, consider writing the query as:

UPDATE Tasks
    SET Status = 'Active'
    WHERE NOT EXISTS (SELECT 1
                      FROM Tasks
                      WHERE Status = 'Active'
                     ) AND
          Id = (SELECT TOP 1 Id 
                FROM Tasks
                WHERE Type = 2           -- <== The only difference
                ORDER BY Id
               );

Then place indexes on Tasks(Status) and Tasks(Type, Id). In fact, with the right query, you might find that the query is so fast (despite the update on the index) that your worry about current updates is greatly mitigated. This would not solve a race condition, but it might at least make it rare.

And if you are capturing errors, then with the unique filtered index, you could just do:

UPDATE Tasks
    SET Status = 'Active'
    WHERE Id = (SELECT TOP 1 Id 
                FROM Tasks
                WHERE Type = 2           -- <== The only difference
                ORDER BY Id
               );

This will return an error if a row already is active.

Note: all these queries and concepts can be applied to "one active per group". This answer is addressing the question that you asked. If you have a "one active per group" problem, then consider asking another question.

186

answered Oct 20 '22 00:10

Gordon Linoff

Related questions
                            
                                Intersect in SQL Server
                            
                                How to create the fastest possible database on a SQL Server 2012 cluster, sacrificing any durability
                            
                                How to get all child of each records of a self-referenced table
                            
                                Select percentage of rows from SQL table?
                            
                                Database Transactions: Difference between 'write skew' and 'lost update'
                            
                                Why am I seeing "COLLATION 'xxx' is not valid for CHARACTER SET 'yyy'"
                            
                                SQL Server Select Distinct and Order By with CASE
                            
                                Why remainder(35,10) is -5 when remainder(25,10) is 5 in oracle?
                            
                                In SQL coding can DEFERRABLE be used in TRIGGER ? How does DEFERRABLE work?
                            
                                Find the largest sum of three sequential values in SQL?
                            
                                Saving psql output to csv file
                            
                                How to Handle DATEDIFF(MINUTE, '00:00', '24:20') Like scenario?
                            
                                How to get the id of a database entity with Persistent?
                            
                                Get records that have all sub records
                            
                                How to display the string Enum values instead of the number value using SQL
                            
                                how to edit column on laravel?
                            
                                default value for $_POST[];
                            
                                How to do a where exists in nested query with SQLAlchemy?
                            
                                SQL plan compilation and truth tables
                            
                                DENSE_RANK according to particular order

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With