I was reading about GFS and its consistency model but I'm failing to grasp some of it. In particular, can someone provide me with a specific example scenario (or an explanation of why it cannot happen) of: <ul> <li>concurrent record append that could result in record duplication</li> <li>concurrent record append that could result in undefined regions</li> <li>concurrent writes (on a single chunk) that could result in undefined regions</li> </ul>

I'm quoting from http://research.google.com/archive/gfs.html. Check out Table 1, which is a summary of the possible outcomes for writes/appends: <img src="https://i.stack.imgur.com/ri2qK.png" alt="Table 1 from GFS whitepaper"> <ol> <li>"If a record append fails at any replica, the client retries the operation. As a result, replicas of the same chunk may contain different data possibly including duplicates of the same record in whole or in part." So any failure on a replica (e.g. timeout) will cause a duplicate record at least on the other replicas. This can happen without concurrent writes.</li> <li>The same situation that causes a duplicate record also causes an inconsistent (and hence undefined) region. If a replica failed to acknowledge the mutation, it may not have performed it. In that case when the client retries the append this replica will have to add padding in place of the missing data, so that the record can be written at the right offset. So one replica will have padding while other will have the previously written record in this region.</li> <li>A failed write can cause an inconsistent (hence undefined) region as well. More interestingly, successful concurrent writes can cause consistent but undefined regions. "If a write by the application is large or straddles a chunk boundary, GFS client code breaks it down into multiple write operations. They [...] may be interleaved with and overwritten by concurrent operations from other clients. Therefore, the shared file region may end up containing fragments from different clients, although the replicas will be identical because the individual operations are completed successfully in the same order on all replicas. This leaves the file region in consistent but undefined state [...]."</li> </ol>

Google File System Consistency Model

Tags:

filesystems

concurrency

gfs

distributed-system

I was reading about GFS and its consistency model but I'm failing to grasp some of it. In particular, can someone provide me with a specific example scenario (or an explanation of why it cannot happen) of:

concurrent record append that could result in record duplication
concurrent record append that could result in undefined regions
concurrent writes (on a single chunk) that could result in undefined regions

264

asked Jan 09 '15 16:01

Simone

2 Answers

I'm quoting from http://research.google.com/archive/gfs.html. Check out Table 1, which is a summary of the possible outcomes for writes/appends:

Table 1 from GFS whitepaper

"If a record append fails at any replica, the client retries the operation. As a result, replicas of the same chunk may contain different data possibly including duplicates of the same record in whole or in part." So any failure on a replica (e.g. timeout) will cause a duplicate record at least on the other replicas. This can happen without concurrent writes.
The same situation that causes a duplicate record also causes an inconsistent (and hence undefined) region. If a replica failed to acknowledge the mutation, it may not have performed it. In that case when the client retries the append this replica will have to add padding in place of the missing data, so that the record can be written at the right offset. So one replica will have padding while other will have the previously written record in this region.
A failed write can cause an inconsistent (hence undefined) region as well. More interestingly, successful concurrent writes can cause consistent but undefined regions. "If a write by the application is large or straddles a chunk boundary, GFS client code breaks it down into multiple write operations. They [...] may be interleaved with and overwritten by concurrent operations from other clients. Therefore, the shared file region may end up containing fragments from different clients, although the replicas will be identical because the individual operations are completed successfully in the same order on all replicas. This leaves the file region in consistent but undefined state [...]."

answered Sep 18 '22 20:09

Daniel Darabos

I don't think it really has to do with concurrent append but wih the at least once semantics of their system.

Failure is a fundamental problem of large distributed systems. In the presence of failure a sender may not know if the computer on the other end of the network fully received its message.

For such occasions distributed systems guarantee that a message is either delivered either at most once or delivered at least once.

In this case, it appears GFS decided upon at least once delivery to the storage nodes.

answered Sep 20 '22 20:09

Michael Deardeuff

Related questions
                            
                                Multithreading: do I need protect my variable in read-only method?
                            
                                Does SemaphoreSlim's Wait(Int32) method return immediately when passed zero?
                            
                                Sync dispatch on current queue
                            
                                Java - ThreadLocal vs ConcurrentHashMap
                            
                                Lock for SELECT so another process doesn't get old data
                            
                                @volatile usage unclear - sending an object with a `var` to another thread
                            
                                How to assure that operations in an OperationQueue are finished one after another
                            
                                Can multiple threads/processes read/write from/to non-overlapping regions of a file simultaneously without synchronization?
                            
                                Experiencing deadlocks when using the Hikari transactor for Doobie with ZIO
                            
                                Multi:Threading - Is this the right approach?
                            
                                Queue Thread In Blackberry
                            
                                How to gracefully unload a child AppDomain that has threads running
                            
                                Java - Thread.sleep in the main method
                            
                                Difference between wait-notify and CountDownLatch
                            
                                A tested implementation of Peterson Lock algorithm?
                            
                                Does using ConcurrentDictionary TryGetValue within an if statement make the if contents thread-safe?
                            
                                Java Multi-Core Processing
                            
                                Harvesting the power of highly-parallel computers with python scientific code [closed]
                            
                                Entity framework Optimistic Concurrency Exception
                            
                                how does the linux kernel avoid deadlocks?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With