<p>When you do:</p> <pre class="prettyprint"><code>@transaction.atomic def update_db(): do_bulk_update() </code></pre> <p>while the function is running, does it lock the database?</p> <p>I'm asking regarding django's atomic transaction: https://docs.djangoproject.com/en/1.10/topics/db/transactions/#autocommit-details</p>

<p><em>(I'm assuming modern SQL databases in this answer.)</em></p> <h3>tl;dr</h3> <p>Transactions are not locks, but hold locks that are acquired automatically during operations. And django does not add any locking by default, so the answer is No, it does not lock the database.</p> <p>E.g. if you were do:</p> <pre class="prettyprint"><code>@transaction.atomic def update_db(): cursor.execute('UPDATE app_model SET model_name TO 'bob' WHERE model_id = 1;') # some other stuff... </code></pre> <p>You will have locked the <code>app_model</code> row with id 1 for the duration of "other stuff". But it is not locked until that query. So if you want to ensure consistency you should probably use locks explicitly.</p> <h3>Transactions</h3> <p>As said, transactions are not locks because that would be awful for perfomance. In general they are lighter-weight mechanisms in the first instance for ensuring that if you make a load of changes that wouldn't make sense one at a time to other users of the database, those changes appear to happen all at once. I.e. are atomic. Transactions do not block other users from mutating the database, and indeed in general do not block other users from mutating the same rows you may be reading.</p> <p>See this guide and your databases docs (e.g. postgres) for more details on how transactions are protected.</p> <h3>Django implementation of atomic.</h3> <p>Django itself does the following when you use the <code>atomic</code> decorator (referring to the code).</p> <h3>Not already in an atomic block</h3> <ol> <li> <p>Disables autocommit. Autocommit is an application level feature which will always commit transactions immediately, so it looks to the application like there is never a transaction outstanding.</p> <p>This tells the database to start a new transaction.</p> <ul> <li> <p>At this point <code>psycopg2</code> for postgres sets the isolation level of the transaction to <code>READ COMMITTED</code>, which means that any reads in the transaction will only return committed data, which means if another transaction writes, you won't see that change until it commits it. It does mean though that if that transaction commits during your transaction, you may read again and see that the value has changed during your transaction.</p> <p>Obviously this means that the database is not locked.</p> </li> </ul> </li> <li> <p>Runs your code. Any queries / mutations you make are not committed.</p> </li> <li> <p>Commits the transaction.</p> </li> <li> <p>Re-enables autocommit.</p> </li> </ol> <h3>In an earlier atomic block</h3> <p>Basically in this case we try to use savepoints so we can revert back to them if we "rollback" the "transaction", but as far as the database connection is concerned we are in the same transaction.</p> <h3>Automatic locking</h3> <p>As said, the database may give your transaction some automatic locks, as outlined in this doc. To demonstrate this, consider the following code that operates on a postgres database with one table and one row in it:</p> <pre class="prettyprint"><code>my_table id | age ---+---- 1 | 50 </code></pre> <p>And then you run this code:</p> <pre class="prettyprint"><code>import psycopg2 as Database from multiprocessing import Process from time import sleep from contextlib import contextmanager @contextmanager def connection(): conn = Database.connect( user='daphtdazz', host='localhost', port=5432, database='db_test' ) try: yield conn finally: conn.close() def connect_and_mutate_after_seconds(seconds, age): with connection() as conn: curs = conn.cursor() print('execute update age to %d...' % (age,)) curs.execute('update my_table set age = %d where id = 1;' % (age,)) print('sleep after update age to %d...' % (age,)) sleep(seconds) print('commit update age to %d...' % (age,)) conn.commit() def dump_table(): with connection() as conn: curs = conn.cursor() curs.execute('select * from my_table;') print('table: %s' % (curs.fetchall(),)) if __name__ == '__main__': p1 = Process(target=connect_and_mutate_after_seconds, args=(2, 99)) p1.start() sleep(0.6) p2 = Process(target=connect_and_mutate_after_seconds, args=(1, 100)) p2.start() p2.join() dump_table() p1.join() dump_table() </code></pre> <p>You get:</p> <pre class="prettyprint"><code>execute update age to 99... sleep after update age to 99... execute update age to 100... commit update age to 99... sleep after update age to 100... commit update age to 100... table: [(1, 100)] table: [(1, 100)] </code></pre> <p>and the point is that the second process is started before the first command completes, but after it has called the <code>update</code> command, so the second process has to wait for the lock which is why we don't see <code>sleep after update age to 100</code> until after the <code>commit</code> for age 99.</p> <p>If you put the sleep before the exec, you get:</p> <pre class="prettyprint"><code>sleep before update age to 99... sleep before update age to 100... execute update age to 100... commit update age to 100... table: [(24, 3), (100, 2)] execute update age to 99... commit update age to 99... table: [(24, 3), (99, 2)] </code></pre> <p>Indicating the lock was not acquired by the time the second process gets to its update, which happens first but during the first process's transaction.</p>

Does Django Atomic Transaction lock the database?

Tags:

database

atomic

django

locking

When you do:

@transaction.atomic def update_db():     do_bulk_update()

while the function is running, does it lock the database?

I'm asking regarding django's atomic transaction: https://docs.djangoproject.com/en/1.10/topics/db/transactions/#autocommit-details

594

asked Feb 28 '17 23:02

kong

1 Answers

(I'm assuming modern SQL databases in this answer.)

tl;dr

Transactions are not locks, but hold locks that are acquired automatically during operations. And django does not add any locking by default, so the answer is No, it does not lock the database.

E.g. if you were do:

@transaction.atomic def update_db():     cursor.execute('UPDATE app_model SET model_name TO 'bob' WHERE model_id = 1;')     # some other stuff...

You will have locked the app_model row with id 1 for the duration of "other stuff". But it is not locked until that query. So if you want to ensure consistency you should probably use locks explicitly.

Transactions

As said, transactions are not locks because that would be awful for perfomance. In general they are lighter-weight mechanisms in the first instance for ensuring that if you make a load of changes that wouldn't make sense one at a time to other users of the database, those changes appear to happen all at once. I.e. are atomic. Transactions do not block other users from mutating the database, and indeed in general do not block other users from mutating the same rows you may be reading.

See this guide and your databases docs (e.g. postgres) for more details on how transactions are protected.

Django implementation of atomic.

Django itself does the following when you use the atomic decorator (referring to the code).

Not already in an atomic block

Disables autocommit. Autocommit is an application level feature which will always commit transactions immediately, so it looks to the application like there is never a transaction outstanding.

This tells the database to start a new transaction.
- At this point psycopg2 for postgres sets the isolation level of the transaction to READ COMMITTED, which means that any reads in the transaction will only return committed data, which means if another transaction writes, you won't see that change until it commits it. It does mean though that if that transaction commits during your transaction, you may read again and see that the value has changed during your transaction.
  
  Obviously this means that the database is not locked.
Runs your code. Any queries / mutations you make are not committed.
Commits the transaction.
Re-enables autocommit.

In an earlier atomic block

Basically in this case we try to use savepoints so we can revert back to them if we "rollback" the "transaction", but as far as the database connection is concerned we are in the same transaction.

Automatic locking

As said, the database may give your transaction some automatic locks, as outlined in this doc. To demonstrate this, consider the following code that operates on a postgres database with one table and one row in it:

my_table id | age ---+---- 1  | 50

And then you run this code:

import psycopg2 as Database from multiprocessing import Process from time import sleep from contextlib import contextmanager   @contextmanager def connection():     conn = Database.connect(         user='daphtdazz', host='localhost', port=5432, database='db_test'     )     try:         yield conn     finally:         conn.close()  def connect_and_mutate_after_seconds(seconds, age):      with connection() as conn:         curs = conn.cursor()         print('execute update age to %d...' % (age,))         curs.execute('update my_table set age = %d where id = 1;' % (age,))         print('sleep after update age to %d...' % (age,))         sleep(seconds)         print('commit update age to %d...' % (age,))         conn.commit()   def dump_table():     with connection() as conn:         curs = conn.cursor()         curs.execute('select * from my_table;')         print('table: %s' % (curs.fetchall(),))  if __name__ == '__main__':      p1 = Process(target=connect_and_mutate_after_seconds, args=(2, 99))     p1.start()      sleep(0.6)     p2 = Process(target=connect_and_mutate_after_seconds, args=(1, 100))     p2.start()     p2.join()      dump_table()      p1.join()      dump_table()

You get:

execute update age to 99... sleep after update age to 99... execute update age to 100... commit update age to 99... sleep after update age to 100... commit update age to 100... table: [(1, 100)] table: [(1, 100)]

and the point is that the second process is started before the first command completes, but after it has called the update command, so the second process has to wait for the lock which is why we don't see sleep after update age to 100 until after the commit for age 99.

If you put the sleep before the exec, you get:

sleep before update age to 99... sleep before update age to 100... execute update age to 100... commit update age to 100... table: [(24, 3), (100, 2)] execute update age to 99... commit update age to 99... table: [(24, 3), (99, 2)]

Indicating the lock was not acquired by the time the second process gets to its update, which happens first but during the first process's transaction.

answered Sep 19 '22 10:09

daphtdazz

Related questions
                            
                                How does Django handle multiple requests?
                            
                                How to override the default value of a Model Field from an Abstract Base Class
                            
                                Where did Django form wizard go in django 1.8?
                            
                                djangorestframework: Filtering in a related field
                            
                                Search engine solution for Django that actually works?
                            
                                The request's session was deleted before the request completed. The user may have logged out in a concurrent request, for example
                            
                                How to clear the whole cache when using django's page_cache decorator
                            
                                Circular dependency in serializers
                            
                                Why are read-only form fields in Django a bad idea?
                            
                                How can I store an array of strings in a Django model?
                            
                                How to pass kwargs from save to post_save signal
                            
                                How to receive POST data in django
                            
                                Django - two views, one page
                            
                                ModelForm with OneToOneField in Django
                            
                                Only showing year in django admin, a YearField instead of DateField?
                            
                                Received "ValueError: Found wrong number (0) of constraints for ..." during Django migration
                            
                                Can I manually trigger signals in Django?
                            
                                Django and domain driven design
                            
                                Writing unit tests in Django / Python
                            
                                Django IntegerField with Choice Options (how to create 0-10 integer options)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With