What are the use cases for a Vector Clock versus a Version Vector?

Tags:

I have been having trouble finding an example of what use cases are suitable for Vector Clocks and Version Vectors, and how they might differ. I understand that they largely work in the same way, with Vector Clocks using receive and send functions, and Version Vectors using a sync function instead, but I do not understand the differences between the two options. Is it just two different ways of expressing the same thing, or are there real differences in use cases between them?

I was only able to find one question that was somewhat related: "When do I use a consensus algorithm like Paxos vs using a something like a Vector Clock?"

Even though the linked answer states the following, and references a short article, the differences are still unclear to me.

You might want to use a version vector for a leaderless distributed storage. You might use vector clocks for the same (although it's a worse fit; the article also suggests you use it for consistent snapshots, for implementing causal ordering in general distributed systems etc).

209

asked Oct 24 '19 15:10

arabellel

1 Answers

Same question here, and it's still not absolutely clear to me, but what I've found is that version vectors are more suitable to determine the causality of events in a specific network of replicated nodes in a distributed system, where the only thing you are interested in is what happened first and what happened after.

By contrast, a vector clock determines event order in an undetermined sequence of events in a distributed system.

In that sense, using integers for version vectors is overly complicated, because if we just want to determine which node, A or B, is more updated, given a situation where initially A[2,2] and B[2,2] (therefore in sync).

From the version vector perspective, A[3,2] > B[2,2] means the same as A[10,2] > B[2,2]. That would explain why we can use a fixed set of values for version vectors and the only important operation is just sync versions.

From the vector clock perspective, there is a difference between A[10,2] and A[3,2]. It means that +7 events happened in the meantime. That would explain why we need to keep track of all the events and there are send and receive operations to sync all the vector clocks in the network.

Anyways, I'm missing like you some clear document that explains clearly the difference and the usages of one compared to the other.

answered Nov 10 '22 01:11

Luis

Related questions
                            
                                Is a list or dictionary faster in Python?
                            
                                Flatten a dictionary of dictionaries (2 levels deep) of lists
                            
                                Why doesn't .Net have a Set data structure?
                            
                                Single-user database options
                            
                                Proper way in C# to combine an arbitrary number of strings into a single string
                            
                                C data structure to mimic C#'s List<List<int>>?
                            
                                If statement in c++
                            
                                How to find longest path in graph?
                            
                                Algorithm to minimize the cost for mechanic [duplicate]
                            
                                What is the best way to implement Tree : LinkedList - Array
                            
                                How to design inserting to an infinite array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What are the use cases for a Vector Clock versus a Version Vector?

Tags:

synchronization

data-structures

distributed-computing

replication

distributed-system

arabellel

People also ask

1 Answers

Luis

Recent Activity

Donate For Us