What is the difference between Non-Repeatable Read and Phantom Read?

Read phenomena

Dirty reads: read UNCOMMITED data from another transaction
Non-repeatable reads: read COMMITTED data from an UPDATE query from another transaction
Phantom reads: read COMMITTED data from an INSERT or DELETE query from another transaction

Note : DELETE statements from another transaction, also have a very low probability of causing Non-repeatable reads in certain cases. It happens when the DELETE statement unfortunately, removes the very same row which your current transaction was querying. But this is a rare case, and far more unlikely to occur in a database which have millions of rows in each table. Tables containing transaction data usually have high data volume in any production environment.

Also we may observe that UPDATES may be a more frequent job in most use cases rather than actual INSERT or DELETES (in such cases, danger of non-repeatable reads remain only - phantom reads are not possible in those cases). This is why UPDATES are treated differently from INSERT-DELETE and the resulting anomaly is also named differently.

There is also an additional processing cost associated with handling for INSERT-DELETEs, rather than just handling the UPDATES.

Benefits of different isolation levels

READ_UNCOMMITTED prevents nothing. It's the zero isolation level
READ_COMMITTED prevents just one, i.e. Dirty reads
REPEATABLE_READ prevents two anomalies: Dirty reads and Non-repeatable reads
SERIALIZABLE prevents all three anomalies: Dirty reads, Non-repeatable reads and Phantom reads

Then why not just set the transaction SERIALIZABLE at all times? Well, the answer to the above question is: SERIALIZABLE setting makes transactions very slow, which we again don't want.

In fact transaction time consumption is in the following rate:

SERIALIZABLE > REPEATABLE_READ > READ_COMMITTED > READ_UNCOMMITTED

So READ_UNCOMMITTED setting is the fastest.

Summary

Actually we need to analyze the use case and decide an isolation level so that we optimize the transaction time and also prevent most anomalies.

Note that databases by default may have REPEATABLE_READ setting. Admins and architects may have an affinity towards choosing this setting as default, to exhibit better performance of the platform.

There is a difference in the implementation between these two kinds isolation levels.
For "non-repeatable read", row-locking is needed.
For "phantom read"，scoped-locking is needed, even a table-locking.
We can implement these two levels by using two-phase-locking protocol.

Related questions
                            
                                Surrogate vs. natural/business keys [closed]
                            
                                IN vs ANY operator in PostgreSQL
                            
                                How SID is different from Service name in Oracle tnsnames.ora
                            
                                How to get ERD diagram for an existing database? [closed]
                            
                                Fastest hash for non-cryptographic uses?
                            
                                How do I reset a sequence in Oracle?
                            
                                Throw an error preventing a table update in a MySQL trigger
                            
                                How to check if a table exists in a given schema
                            
                                Is there a good reason I see VARCHAR(255) used so often (as opposed to another length)?
                            
                                What scalability problems have you encountered using a NoSQL data store? [closed]
                            
                                Difference between database and schema
                            
                                Maximum number of records in a MySQL database table
                            
                                How to find out the MySQL root password
                            
                                How to check which locks are held on a table
                            
                                Storing time-series data, relational or non?
                            
                                Copy values from one column to another in the same table
                            
                                Which is faster/best? SELECT * or SELECT column1, colum2, column3, etc
                            
                                Binary Data in MySQL [closed]
                            
                                How do I move a redis database from one server to another?
                            
                                How does the HyperLogLog algorithm work?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the difference between Non-Repeatable Read and Phantom Read?

Tags:

database

oracle

transactions

isolation-level

transaction-isolation

People also ask

Read phenomena

Benefits of different isolation levels

Summary

Recent Activity

Donate For Us