I develop an online reservation system. To simplify let's say that users can book multiple items and each item can be booked only once. Items are first added to the shopping cart. App uses <code>MySql</code> / <code>InnoDB</code> database. According to MySql documentation, default isolation level is <code>Repeatable reads</code>. Here is the checkout procedure I've came up with so far: <blockquote> <ol> <li>Begin transaction</li> <li> Select items in the shopping cart (with <code>for update</code> lock) Records from <code>cart-item</code> and <code>items</code> tables are fetched at this step.</li> <li> Check if items haven't been booked by anybody else Basically check if <code>quantity > 0</code>. It's more complicated in the real application, thus I put it here as a separate step.</li> <li> Update items, set <code>quantity = 0</code> Also perform other essential database manipulations.</li> <li> Make payment (via external api like PayPal or Stripe) No user interaction is necessary as payment details can be collected before checkout.</li> <li>If everything went fine commit transaction or rollback otherwise</li> <li> Continue with non-essential logic Send e-mail etc in case of success, redirect for error.</li> </ol> </blockquote> I am unsure if that is sufficient. I'm worried whether: <ol> <li>Other user that tries to book same item at the same time will be handled correcly. Will his transaction <code>T2</code> wait until <code>T1</code> is done?</li> <li>Payment using PayPal or Stripe may take some time. Wouldn't this become a problem in terms of performance?</li> <li>Items availability will be shown correctly all the time (items should be available until checkout succeeds). Should these read-only selects use <code>shared lock</code>?</li> <li>Is it possible that MySql rollbacks transaction by itself? Is it generally better to retry automatically or display an error message and let user try again?</li> <li>I guess its enough if I do <code>SELECT ... FOR UPDATE</code> on <code>items</code> table. This way both request caused by double click and other user will have to wait till transaction finishes. They'll wait because they also use <code>FOR UPDATE</code>. Meanwhile vanilla <code>SELECT</code> will just see a snapshot of db before the transaction, with no delay though, right?</li> <li>If I use <code>JOIN</code> in <code>SELECT ... FOR UPDATE</code>, will records in both tables be locked?</li> <li>I'm a bit confused about SELECT ... FOR UPDATE on non-existent rows section of Willem Renzema answer. When may it become important? Could you provide any example?</li> </ol> Here are some resources I've read: How to deal with concurrent updates in databases?, MySQL: Transactions vs Locking Tables, Do database transactions prevent race conditions?, Isolation (database systems), InnoDB Locking and Transaction Model, A beginner’s guide to database locking and the lost update phenomena. Rewrote my original question to make it more general. Added follow-up questions.

<blockquote> <ol> <li>Begin transaction</li> <li>Select items in shopping cart (with for update lock)</li> </ol> </blockquote> So far so good, this will at least prevent the user from doing checkout in multiple sessions (multiple times trying to checkout the same card - good to deal with double clicks.) <blockquote> <ol start="3"> <li>Check if items haven't been booked by other user</li> </ol> </blockquote> How do you check? With a standard <code>SELECT</code> or with a <code>SELECT ... FOR UPDATE</code>? Based on step 5, I'm guessing you are checking a reserved column on the item, or something similar. The problem here is that the <code>SELECT ... FOR UPDATE</code> in step 2 is NOT going to apply the <code>FOR UPDATE</code> lock to everything else. It is only applying to what is <code>SELECT</code>ed: the <code>cart-item</code> table. Based on the name, that is going to be a different record for each cart/user. This means that other transactions will NOT be blocked from proceeding. <blockquote> <ol start="4"> <li>Make payment</li> <li>Update items marking them as reserved</li> <li>If everything went fine commit transaction, rollback otherwise</li> </ol> </blockquote> Following the above, based on the information you've provided, you may end up with multiple people buying the same item, if you aren't using <code>SELECT ... FOR UPDATE</code> on step 3. <h3>Suggested Solution</h3> <ol> <li>Begin transaction</li> <li> <code>SELECT ... FOR UPDATE</code> the <code>cart-item</code> table.</li> </ol> This will lock a double click out from running. What you select here should be the some kind of "cart ordered" column. If you do this, a second transaction will pause here and wait for the first to finish, and then read the result what the first saved to the database. Make sure to end the checkout process here if the <code>cart-item</code> table says it has already been ordered. <ol start="3"> <li> <code>SELECT ... FOR UPDATE</code> the table where you record if an item has been reserved.</li> </ol> This will lock OTHER carts/users from being able to read those items. Based on the result, if the items are not reserved, continue: <ol start="4"> <li><code>UPDATE ...</code> the table in step 3, marking the item as reserved. Do any other <code>INSERT</code>s and <code>UPDATE</code>s you need, as well.</li> <li>Make payment. Issue a rollback if the payment service says the payment didn't work.</li> <li>Record payment, if success.</li> <li>Commit transaction </li> </ol> Make sure you don't do anything that might fail between steps 5 and 7 (like sending emails), else you may end up with them making a payment without it being recorded, in the event the transaction gets rolled back. Step 3 is the important step with regards to making sure two (or more) people don't try to order the same item. If two people do try, the 2nd person will end up having their webpage "hang" while it processes the first. Then when the first finishes, the 2nd will read the "reserved" column, and you can return a message to the user that someone has already purchased that item. <h3>Payment in transaction or not</h3> This is subjective. Generally, you want to close transactions as quickly as possible, to avoid multiple people being locked out from interacting with the database at once. However, in this case, you actually do want them to wait. It's just a matter of how long. If you choose to commit the transaction before payment, you'll need to record your progress in some intermediate table, run the payment, and then record the result. Be aware that if the payment fails, you'll then have to manually undo the item reservation records that you updated. <h3>SELECT ... FOR UPDATE on non-existent rows</h3> Just a word of warning, in case your table design involves inserting rows where you need to earlier <code>SELECT ... FOR UPDATE</code>: If a row doesn't exist, that transaction will NOT cause other transactions to wait, if they also <code>SELECT ... FOR UPDATE</code> the same non-existent row. So, make sure to always serialize your requests by doing a <code>SELECT ... FOR UPDATE</code> on a row that you know exists first. Then you can <code>SELECT ... FOR UPDATE</code> on the row that may or may not exist yet. (Don't try to do just a <code>SELECT</code> on the row that may or may not exist, as you'll be reading the state of the row at the time the transaction started, not at the moment you run the <code>SELECT</code>. So, <code>SELECT ... FOR UPDATE</code> on non-existent rows is still something you need to do in order to get the most up to date information, just be aware it will not cause other transactions to wait.)

How to properly use transactions and locks to ensure database integrity?

Tags:

database

mysql

concurrency

locking

transactions

I develop an online reservation system. To simplify let's say that users can book multiple items and each item can be booked only once. Items are first added to the shopping cart.

App uses MySql / InnoDB database. According to MySql documentation, default isolation level is Repeatable reads.

Here is the checkout procedure I've came up with so far:

Begin transaction

Select items in the shopping cart (with for update lock)
Records from cart-item and items tables are fetched at this step.

Check if items haven't been booked by anybody else
Basically check if quantity > 0. It's more complicated in the real application, thus I put it here as a separate step.

Update items, set quantity = 0
Also perform other essential database manipulations.

Make payment (via external api like PayPal or Stripe)
No user interaction is necessary as payment details can be collected before checkout.

If everything went fine commit transaction or rollback otherwise

Continue with non-essential logic
Send e-mail etc in case of success, redirect for error.

I am unsure if that is sufficient. I'm worried whether:

Other user that tries to book same item at the same time will be handled correcly. Will his transaction T2 wait until T1 is done?
Payment using PayPal or Stripe may take some time. Wouldn't this become a problem in terms of performance?
Items availability will be shown correctly all the time (items should be available until checkout succeeds). Should these read-only selects use shared lock?
Is it possible that MySql rollbacks transaction by itself? Is it generally better to retry automatically or display an error message and let user try again?
I guess its enough if I do SELECT ... FOR UPDATE on items table. This way both request caused by double click and other user will have to wait till transaction finishes. They'll wait because they also use FOR UPDATE. Meanwhile vanilla SELECT will just see a snapshot of db before the transaction, with no delay though, right?
If I use JOIN in SELECT ... FOR UPDATE, will records in both tables be locked?
I'm a bit confused about SELECT ... FOR UPDATE on non-existent rows section of Willem Renzema answer. When may it become important? Could you provide any example?

Here are some resources I've read: How to deal with concurrent updates in databases?, MySQL: Transactions vs Locking Tables, Do database transactions prevent race conditions?, Isolation (database systems), InnoDB Locking and Transaction Model, A beginner’s guide to database locking and the lost update phenomena.

Rewrote my original question to make it more general.
Added follow-up questions.

764

asked Nov 22 '16 19:11

Paul

2 Answers

Begin transaction

Select items in shopping cart (with for update lock)

So far so good, this will at least prevent the user from doing checkout in multiple sessions (multiple times trying to checkout the same card - good to deal with double clicks.)

Check if items haven't been booked by other user

How do you check? With a standard SELECT or with a SELECT ... FOR UPDATE? Based on step 5, I'm guessing you are checking a reserved column on the item, or something similar.

The problem here is that the SELECT ... FOR UPDATE in step 2 is NOT going to apply the FOR UPDATE lock to everything else. It is only applying to what is SELECTed: the cart-item table. Based on the name, that is going to be a different record for each cart/user. This means that other transactions will NOT be blocked from proceeding.

Make payment

Update items marking them as reserved

If everything went fine commit transaction, rollback otherwise

Following the above, based on the information you've provided, you may end up with multiple people buying the same item, if you aren't using SELECT ... FOR UPDATE on step 3.

Payment in transaction or not

This is subjective. Generally, you want to close transactions as quickly as possible, to avoid multiple people being locked out from interacting with the database at once.

However, in this case, you actually do want them to wait. It's just a matter of how long.

If you choose to commit the transaction before payment, you'll need to record your progress in some intermediate table, run the payment, and then record the result. Be aware that if the payment fails, you'll then have to manually undo the item reservation records that you updated.

SELECT ... FOR UPDATE on non-existent rows

Just a word of warning, in case your table design involves inserting rows where you need to earlier SELECT ... FOR UPDATE: If a row doesn't exist, that transaction will NOT cause other transactions to wait, if they also SELECT ... FOR UPDATE the same non-existent row.

So, make sure to always serialize your requests by doing a SELECT ... FOR UPDATE on a row that you know exists first. Then you can SELECT ... FOR UPDATE on the row that may or may not exist yet. (Don't try to do just a SELECT on the row that may or may not exist, as you'll be reading the state of the row at the time the transaction started, not at the moment you run the SELECT. So, SELECT ... FOR UPDATE on non-existent rows is still something you need to do in order to get the most up to date information, just be aware it will not cause other transactions to wait.)

answered Oct 21 '22 15:10

Willem Renzema

1. Other user that tries to book same item at the same time will be handled correcly. Will his transaction T2 wait until T1 is done?

Yes. While active transaction keeps FOR UPDATE lock on a record, statements in other transactions that use any lock (SELECT ... FOR UPDATE, SELECT ... LOCK IN SHARE MODE, UPDATE, DELETE) will be suspended untill either active transaction commits or "Lock wait timeout" is exceeded.

2. Payment using PayPal or Stripe may take some time. Wouldn't this become a problem in terms of performance?

This will not be a problem, as this is exactly what is necessary. Checkout transactions should be executed sequentially, ie. latter checkout should not start before former finish.

3. Items availability will be shown correctly all the time (items should be available until checkout succeeds). Should these read-only selects use shared lock?

Repeatable reads isolation level ensures that changes made by a transaction are not visible until that transaction is commited. Therefore items availability will be displayed correctly. Nothing will be shown unavailable before it is actually paid for. No locks are necessary.

SELECT ... LOCK IN SHARE MODE would cause checkout transaction to wait until it is finished. This could slow down checkouts without giving any payoff.

4. Is it possible that MySql rollbacks transaction by itself? Is it generally better to retry automatically or display an error message and let user try again?

It is possible. Transaction may be rolled back when "Lock wait timeout" is exceeded or when deadlock happens. In that case it would be a good idea to retry it automatically.
By default suspended statements fail after 50s.

5. I guess its enough if I do SELECT ... FOR UPDATE on items table. This way both request caused by double click and other user will have to wait till transaction finishes. They'll wait because they also use FOR UPDATE. Meanwhile vanilla SELECT will just see a snapshot of db before the transaction, with no delay though, right?

Yes, SELECT ... FOR UPDATE on items table should be enough.
Yes, these selects wait, because FOR UPDATE is an exclusive lock.
Yes, simple SELECT will just grab value as it was before transaction started, this will happen immediately.

6. If I use JOIN in SELECT ... FOR UPDATE, will records in both tables be locked?

Yes, SELECT ... FOR UPDATE, SELECT ... LOCK IN SHARE MODE, UPDATE, DELETE lock all read records, so whatever we JOIN is included. See MySql Docs.

What's interesting (at least for me) everything that is scanned in the processing of the SQL statement gets locked, no matter wheter it is selected or not. For example WHERE id < 10 would lock also the record with id = 10!

If you have no indexes suitable for your statement and MySQL must scan the entire table to process the statement, every row of the table becomes locked, which in turn blocks all inserts by other users to the table. It is important to create good indexes so that your queries do not unnecessarily scan many rows.

answered Oct 21 '22 14:10

Paul

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to properly use transactions and locks to ensure database integrity?

Tags:

database

mysql

concurrency

locking

transactions

Paul

People also ask

2 Answers

Suggested Solution

Payment in transaction or not

SELECT ... FOR UPDATE on non-existent rows

Willem Renzema

Paul

Recent Activity

Donate For Us

How to properly use transactions and locks to ensure database integrity?

Tags:

database

mysql

concurrency

locking

transactions

Paul

People also ask

2 Answers

Suggested Solution

Payment in transaction or not

SELECT ... FOR UPDATE on non-existent rows

Willem Renzema

Paul

Related questions

Recent Activity

Donate For Us