Why write skew can happen in Repeatable reads?

Tags:

Wiki says；

Repeatable read:
In this isolation level, a lock-based concurrency control DBMS implementation keeps read and write locks (acquired on selected data) until the end of the transaction. However, range-locks are not managed, so phantom reads can occur.

Write skew is possible at this isolation level, a phenomenon where two writes are allowed to the same column(s) in a table by two different writers (who have previously read the columns they are updating), resulting in the column having data that is a mix of the two transactions.

I'm curious about why write skew can happen in Repeatable reads? It says that it will keep read and write locks until the end of the transaction and the write skew happens when previously read the columns they are updating, so how can lock a write lock when a read lock is locked?

479

asked Jan 24 '18 08:01

handora

1 Answers

Repeatable read isolation level guarantees that each transaction will read from the consistent snapshot of the database. In other words, a row is retrieved twice within the same transaction always has the same values.

Many databases such as Postgres, SQLServer in repeatable read isolation levels can detect lost update (a special case of write skew) but others don't. (i.e: InnoDB engine in MySQL)

We're back to write skew phenomena problem. There are situations that most database engines cannot detect in the repeatable read isolation. One case is when 2 concurrent transactions modifies 2 different objects and making race conditions.

I take an example from the book Designing Data-Intensive Application. Here is the scenario:

You are writing an application for doctors to manage their on-call shifts at a hospital. The hospital usually tries to have several doctors on call at any one time, but it absolutely must have at least one doctor on call. Doctors can give up their shifts (e.g., if they are sick themselves), provided that at least one colleague remains on call in that shift

The next interesting question is how we can implement this under databases. Here is pseudocode SQL code:

BEGIN TRANSACTION;
    SELECT * FROM doctors
        WHERE on_call = true
        AND shift_id = 1234;
    if (current_on_call >= 2) {
        UPDATE doctors
        SET on_call = false WHERE name = 'Alice' AND shift_id = 1234;
    }
COMMIT;

Here is the illustration: Flow Data

As the above illustration, we see that Bob and Alice run above SQL code concurrently. However Bob and Alice modify different data, Bob modified Bob's record and Alice modified Alice's record. Databases at repeatable-read isolation level no way can know and check the condition (total doctor >= 2) has been violated. Write skew phenomena has happened.

To solve this problem, there are 2 methods proposed:

locks all records that are being called manually. So either Bob or Alice will wait until other finishes transaction.

Here is some pseudocode using SELECT .. FOR UPDATE query.

BEGIN TRANSACTION;
    SELECT * FROM doctors
        WHERE on_call = true
        AND shift_id = 1234 FOR UPDATE; // important here: locks all records that satisfied requirements.

    if (current_on_call >= 2) {
        UPDATE doctors
        SET on_call = false WHERE name = 'Alice' AND shift_id = 1234;
    }
  COMMIT;

Using a more strict isolation level. Both MySQL, Postgres T-SQL provides serialize isolation level.

131

answered Nov 02 '22 00:11

hqt

Related questions
                            
                                Grouping tables within a MySQL database
                            
                                SQLite - Is it possible to insert a BLOB via insert statement?
                            
                                Lua script for Redis which sums the values of keys
                            
                                Pandas HDF5 as a Database
                            
                                Two nodes MongoDB replica set without arbiter
                            
                                Warning in ./libraries/plugin_interface.lib.php#551 count(): Parameter must be an array or an object that implements Countable
                            
                                How do I find out if an oracle database is set to autocommit?
                            
                                How to store TimeZoneInfo objects in a database?
                            
                                Python Redis connection should be closed on every request? (flask)
                            
                                Pros and cons of using MD5 Hash as the primary key vs. use a int identity as the primary key in SQL Server
                            
                                Database integration tests in Visual Studio Online
                            
                                Cannot initialize flask initdb (Flask Tutorial Step4)
                            
                                Can Multiple Indexes Work Together?
                            
                                What are the pros/cons of and best practices for using a single database?
                            
                                how to insert html tag inside sql in Liquibase migration?
                            
                                ActiveRecord talk to two databases?
                            
                                Best way to store XML data in a MySQL database, with some specific requirements
                            
                                MySQL - Combining two select statements into one result with LIMIT efficiently
                            
                                get table prefix
                            
                                Import data from excel spreadsheet to django model

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why write skew can happen in Repeatable reads?

Tags:

database

isolation-level

acid

handora

People also ask

1 Answers

hqt

Recent Activity

Donate For Us