The Wikipedia article for Distributed transaction isn't very helpful. Can you give a high-level description with more details of what a distributed transaction is? Also, can you give me an example of why an application or database should perform a transaction that updates data on two or more networked computers? I understand the classic bank example; I care more about distributed transactions in Web-scale databases like Dynamo, Bigtable, HBase, or Cassandra.

Distributed transactions span multiple physical systems, whereas standard transactions do not. Synchronization amongst the systems becomes a need which traditionally would not exist in a standard transaction. From your Wikipedia reference... <blockquote> ...a distributed transaction can be seen as a database transaction that must be synchronized (or provide ACID properties) among multiple participating databases which are distributed among different physical locations... </blockquote>

Usually, transactions occur on one database server: <pre class="prettyprint"><code>BEGIN TRANSACTION SELECT something FROM myTable UPDATE something IN myTable COMMIT </code></pre> A distributed transaction involves multiple servers: <pre class="prettyprint"><code>BEGIN TRANSACTION UPDATE amount = amount - 100 IN bankAccounts WHERE accountNr = 1 UPDATE amount = amount + 100 IN someRemoteDatabaseAtSomeOtherBank.bankAccounts WHERE accountNr = 2 COMMIT </code></pre> The difficulty comes from the fact that the servers must communicate to ensure that transactional properties such as atomicity are satisfied on both servers: If the transaction succeeds, the values must be updated on both servers. If the transaction fails, the transaction must be rollbacked on both servers. It must never happen that the values are updated on one server but not updated on the other.

What is a "distributed transaction"?

2 Answers

Distributed transactions span multiple physical systems, whereas standard transactions do not. Synchronization amongst the systems becomes a need which traditionally would not exist in a standard transaction.

From your Wikipedia reference...

...a distributed transaction can be seen as a database transaction that must be synchronized (or provide ACID properties) among multiple participating databases which are distributed among different physical locations...

answered Sep 30 '22 23:09

Aaron McIver

Usually, transactions occur on one database server:

BEGIN TRANSACTION SELECT something FROM myTable UPDATE something IN myTable COMMIT

A distributed transaction involves multiple servers:

BEGIN TRANSACTION UPDATE amount = amount - 100 IN bankAccounts WHERE accountNr = 1 UPDATE amount = amount + 100 IN someRemoteDatabaseAtSomeOtherBank.bankAccounts WHERE accountNr = 2 COMMIT

The difficulty comes from the fact that the servers must communicate to ensure that transactional properties such as atomicity are satisfied on both servers: If the transaction succeeds, the values must be updated on both servers. If the transaction fails, the transaction must be rollbacked on both servers. It must never happen that the values are updated on one server but not updated on the other.

answered Sep 30 '22 21:09

Heinzi

Related questions
                            
                                To Do or Not to Do: Store Images in a Database [duplicate]
                            
                                Can I add to an existing lazy database in R without having to recreate everything?
                            
                                Are there any REAL advantages to NoSQL over RDBMS for structured data on one machine?
                            
                                Any experiences with Protocol Buffers?
                            
                                Difference between ANSI and Unicode drivers of MySQL
                            
                                Guice, JDBC and managing database connections
                            
                                How does `db.serialize` work in `node-sqlite3`
                            
                                Cassandra - transaction support
                            
                                Why do Rails migrations define foreign keys in the application but not in the database?
                            
                                Where does Chrome save its SQLite database to?
                            
                                Unit testing database application with business logic performed in the UI
                            
                                Is there data visualisation tool for postgresql which is capable of displaying inter schema relations as well? [closed]
                            
                                LinqDataSource - Can you limit the amount of records returned?
                            
                                Symfony2: get the id of the persisted object
                            
                                How to create multi environment DB's with Firestore
                            
                                Difference between different types of SQL? [closed]
                            
                                Laravel :: Best way to update a foreign key
                            
                                How can I browse my Heroku database?
                            
                                Unknown column in 'having clause'
                            
                                How are bitmap indexes helpful?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is a "distributed transaction"?

Tags:

database

commit

distributed-transactions

xa

Zombie

People also ask

2 Answers

Aaron McIver

Heinzi

Recent Activity

Donate For Us