How to elect a master node among the nodes running in a cluster?

Tags:

cloud

I'm writing a managed cloud stack (on top of hardware-level cloud providers like EC2), and a problem I will face soon is:

How do several identical nodes decide which one of them becomes a master? (I.e. think of 5 servers running on EC2. One of them has to become a master, and other ones have to become slaves.)

I read a description of the algorithm used by MongoDB, and it seems pretty complicated, and also depends on a concept of votes — i.e. two nodes left alone won't be able to decide anything. Also their approach has a significant delay before it produces the results.

I wonder if there are any less complicated, KISS-embrasing approaches? Are they used widely, or are they risky to adopt?
Suppose we already have a list of servers. Then we can just elect the one that is up and has a numerically smallest IP address. What are downsides of this approach?
Why is MongoDB's algorithm so complicated?

This is a duplicate of How to elect new Master in Cluster?, which gives less details and has not been answered for 6 months, so I feel it is appropriate to start a new question.

(The stack I'm working on is open-source, but it's on a very early stage of development so not giving a link here.)

UPDATE: based on the answers, I have designed a simple consensus algorithm, you can find a JavaScript (CoffeeScript) implementation on GitHub: majority.js.

806

asked Dec 23 '10 23:12

Andrey Tarantsov

2 Answers

Leader election algorithms typically consider the split brain as a fault case to support. If you assume that it's not the nodes that fail but the networking, you may run into the case where all nodes are up, but fail to talk to each other. Then, you may end up with two masters.

If you can exclude "split brain" from your fault model (i.e. if you consider only node failures), your algorithm (leader is the one with the smallest address) is fine.

160

answered Sep 18 '22 14:09

Martin v. Löwis

Use Apache ZooKeeper. It solves exactly this problem (and many more).

answered Sep 17 '22 14:09

Spike Gronim

Related questions
                            
                                how to measure running time of algorithms in python [duplicate]
                            
                                How does Firefox's 'awesome' bar match strings?
                            
                                Smoothing data from a sensor
                            
                                Why is O(n) better than O( nlog(n) )?
                            
                                Quickselect Algorithm - Simplified Explanation
                            
                                What are the rules for the "Ω(n log n) barrier" for sorting algorithms?
                            
                                PyMC: Taking advantage of sparse model structure in Adaptive Metropolis MCMC
                            
                                How can I generate an "unlimited" world?
                            
                                How Can I Best Guess the Encoding when the BOM (Byte Order Mark) is Missing?
                            
                                Travelling Salesman with multiple salesmen?
                            
                                Efficient algorithm for finding all maximal subsets
                            
                                Distribute points on a circle as evenly as possible
                            
                                Towers of Hanoi with K pegs
                            
                                Sorting algorithm to implement highest total combinations
                            
                                Compare two integer arrays with same length
                            
                                Why is it important to delete files in-order to remove them faster?
                            
                                Is it possible to invert an array with constant extra space?
                            
                                Anyone know anything about OLAP Internals?
                            
                                Hashing of pointer values
                            
                                Check if given string can be created by a set of characters cut out from magazine article

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With