I'm trying to implement a cluster using Erlang as the glue that holds it all together. I like the idea that it creates a fully connected graph of nodes, but upon reading different articles online, it seems as though this doesn't scale well (having a max of 50 - 100 nodes). Did the developers of OTP impose this limitation on purpose? I do know that you can setup nodes to have explicit connections only as well as have hidden nodes, etc. But, it seems as though the default out-of-the-box setup isn't very scalable. So to the questions: <ol> <li>If you had 5 nodes (A, B, C, D, E) that all had explicit connections such that A-B-C-D-E. Does Erlang/OTP allow A to talk directly to E or does A have to pass messages from B through D to get to E, and thus that's the reason for the fully connected graph? Again, it makes sense but it doesn't scale well from what I've seen.</li> <li>If one was to try and go for a scalable and fault-tolerant system, what are your options? It seems as though, if you can't create a fully connected graph because you have too many nodes, the next best thing would be to create a tree of some kind. But, this doesn't seem very fault-tolerant because if the root or any parent of children nodes dies, you would lose a significant portion of your cluster.</li> <li>In looking into supervisors and workers, all of the examples I've seen apply this to processes on a single node. Could it be applied to a cluster of nodes to help implement fault-tolerance?</li> <li>Can nodes be part of several clusters?</li> </ol> Thanks for your help, if there is a semi-recent website or blogpost (roughly 1-year old) that I've missed, I'd be happy to look at those. But, I've scoured the internet pretty well.

<ol> <li>Yes, you can send messages to a process on any remote node in a cluster, for example, by using its process identifier (pid). This is called location transparency. And yes, it scales well (see Riak, CouchDB, RabbitMQ, etc).</li> <li>Note that one node can run hundred thousands of processes. Erlang has proven to be very scalable and was built for fault tolerance. There are other approaches to build bigger, e.g. SOA approach of CloudI (see comments). You also could build clusters that use hidden nodes if you really really need to. </li> <li>At the node level you would take a different approach, for example, build identical nodes that are easy to replace if they fail and the work is taken over by the remaining nodes. Check out how Riak handles this (look into <code>riak_core</code> and check the blog post Introducing Riak Core).</li> <li>Nodes can leave and enter a cluster but cannot be part of multiple clusters at the same time. Connected nodes share one cluster cookie which is used to identify connected nodes. You can set the cookie while the VM is running (see Distributed Erlang). </li> </ol> Read http://learnyousomeerlang.com/ for greater good.

Erlang clusters

Tags:

erlang

cloud

distributed-computing

cluster-computing

I'm trying to implement a cluster using Erlang as the glue that holds it all together. I like the idea that it creates a fully connected graph of nodes, but upon reading different articles online, it seems as though this doesn't scale well (having a max of 50 - 100 nodes). Did the developers of OTP impose this limitation on purpose? I do know that you can setup nodes to have explicit connections only as well as have hidden nodes, etc. But, it seems as though the default out-of-the-box setup isn't very scalable.

So to the questions:

If you had 5 nodes (A, B, C, D, E) that all had explicit connections such that A-B-C-D-E. Does Erlang/OTP allow A to talk directly to E or does A have to pass messages from B through D to get to E, and thus that's the reason for the fully connected graph? Again, it makes sense but it doesn't scale well from what I've seen.
If one was to try and go for a scalable and fault-tolerant system, what are your options? It seems as though, if you can't create a fully connected graph because you have too many nodes, the next best thing would be to create a tree of some kind. But, this doesn't seem very fault-tolerant because if the root or any parent of children nodes dies, you would lose a significant portion of your cluster.
In looking into supervisors and workers, all of the examples I've seen apply this to processes on a single node. Could it be applied to a cluster of nodes to help implement fault-tolerance?
Can nodes be part of several clusters?

Thanks for your help, if there is a semi-recent website or blogpost (roughly 1-year old) that I've missed, I'd be happy to look at those. But, I've scoured the internet pretty well.

689

asked Nov 03 '12 18:11

SolomonS

1 Answers

Yes, you can send messages to a process on any remote node in a cluster, for example, by using its process identifier (pid). This is called location transparency. And yes, it scales well (see Riak, CouchDB, RabbitMQ, etc).
Note that one node can run hundred thousands of processes. Erlang has proven to be very scalable and was built for fault tolerance. There are other approaches to build bigger, e.g. SOA approach of CloudI (see comments). You also could build clusters that use hidden nodes if you really really need to.
At the node level you would take a different approach, for example, build identical nodes that are easy to replace if they fail and the work is taken over by the remaining nodes. Check out how Riak handles this (look into riak_core and check the blog post Introducing Riak Core).
Nodes can leave and enter a cluster but cannot be part of multiple clusters at the same time. Connected nodes share one cluster cookie which is used to identify connected nodes. You can set the cookie while the VM is running (see Distributed Erlang).

Read http://learnyousomeerlang.com/ for greater good.

192

answered Oct 08 '22 00:10

Tilman

Related questions
                            
                                What profilers and analyzers are there for Erlang/OTP?
                            
                                Actor-based distributed concurrency libraries for Ocaml and other languages [closed]
                            
                                Why must/should UI frameworks be single threaded?
                            
                                How Do You Determine The PID of the Parent of a Process
                            
                                What weaknesses can be found in using Erlang?
                            
                                Erlang Programming: Will Learning Prolog Help?
                            
                                Elixir - What does the 'use' keyword do?
                            
                                Is Erlang a Constraint-Logic programming language?
                            
                                Anonymous variables in Erlang
                            
                                Why doesn't Haskell have symbols (a la ruby) / atoms (a la erlang)?
                            
                                How do I convert an integer to a binary in Erlang?
                            
                                Elixir call Axis2 Java SOAP Web Service with detergentex and detergent
                            
                                Converting Erlang-C port example to Erlang-Golang
                            
                                How do you compile an Erlang program into a standalone windows executable?
                            
                                How does one use cached data in a functional language such as Erlang?
                            
                                rabbitmqctl Error: unable to connect to node rabbit@myserver nodedown
                            
                                How to encrypt Erlang rpc calls (and Mnesia replication) and other traffic?
                            
                                What's the best way to unit test concurrent Erlang code?
                            
                                Mailbox Processor on Distributed Systems
                            
                                Erlang -- How to convert a fun() object to a String

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With