How does three-phase commit avoid blocking?

Tags:

I am trying to understand how three-phase commit avoids blocking

Consider the following two failure scenarios:

Scenario 1: In phase 2 the coordinator sends preCommit messages to all cohorts and has gotten an ack from all except cohort A. Network problems prevent cohort A from receiving the coordinator's preCommit message. Cohort A times out waiting for the preCommit message and chooses to abort. Then both the coordinator and cohort A crash.

Scenario 2: The protocol reaches phase 3. The coordinator sends a doCommit message to cohort A. But before it can send more doCommit messages the coordinator crashes. Cohort A commits its part of the transaction then crashes.

As far as I can tell the remaining cohorts have the exact same state at the end of scenario 1 and scenario 2. So when a recovery coordinator steps in how can it find out from the remaining cohorts whether we are in scenario 1 and abort or we are in scenario 2 and commit and thus avoid blocking?

229

asked Jan 29 '14 07:01

user782220

2 Answers

For further reading, here's a sentence from the abstract of a paper on the subject, Analysis and Verification of Two-Phase Commit & Three-Phase Commit Protocols, by Muhammad Atif, to whet your appetite:

We also apply our method to its “amended” variant, the Three-Phase Commit Protocol (3PC) and prove it to be erroneous for simultaneous site failures

I found this paper to provide a foothold into the literature. There's no small amount of it on this subject, if you want to delve in.

111

answered Oct 05 '22 23:10

eh9

In the two-phase commit the coordinator sends a prepare message to all participants (nodes) and waits for their answers. The coordinator then sends their answers to all other sites. Every participant waits for these answers from the coordinator before committing to or aborting the transaction.

The two-phase commit protocol also has limitations in that it is a blocking protocol. For example, participants will block resource processes while waiting for a message from the coordinator. If for any reason this fails, the participant will continue to wait and may never resolve its transaction. Therefore the resource could be blocked indefinitely. On the other hand, a coordinator will also block resources while waiting for replies from participants. In this case, a coordinator can also block in definitely if no acknowledgement is received from the participant.

However, the three-phase protocol introduces a third phase called the pre-commit. The aim of this is to 'remove the uncertainty period for participants that have committed and are waiting for the global abort or commit message from the coordinator.

When receiving a pre-commit message, participants know that all others have voted to commit.  If a pre-commit message has not been received the participant will abort and release any blocked resources.

answered Oct 05 '22 22:10

apomene

Related questions
                            
                                Is it possible to initialize std::vector over already allocated memory?
                            
                                Using grequests to make several thousand get requests to sourceforge, get "Max retries exceeded with url"
                            
                                Static final int v/s static int
                            
                                Extend Blueprint class?
                            
                                Make sure numpy is using MKL library on mac pro
                            
                                Perform a web.config transform before publishing with MSBuild
                            
                                TSLint double vs triple equality
                            
                                C++ error -- expression must have integral or enum type -- getting this from a string with concatenation?
                            
                                Spring RestTemplate HTTP Post with parameters cause 400 bad request error
                            
                                Does pandas.Series.unique() preserve order?
                            
                                Difference between Task and async Task
                            
                                Cannot instantiate @InjectMocks field named exception with java class

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With