I have an use case where I am looking to replicate a single database on multiple servers (for HA and scalability purposes), Would there be any disadvantage to run a 3 node replica instead of a 3 nodes cluster ?

Couchdb docs 11.2 provides an example cluster configuration of: <pre class="prettyprint"><code>[cluster] q=8 r=2 w=2 n=3 </code></pre> <blockquote> q - The number of shards. r - The number of copies of a document with the same revision that have to be read before CouchDB returns with a 200 and the document. If there is only one copy of the document accessible, then that is returned with 200. w - The number of nodes that need to save a document before a write is returned with 201. If the nodes saving the document is 0, 202 is returned. n - The number of copies there is of every document. Replicas. </blockquote> The behavior of your 3 part replica should be equivalent to: <pre class="prettyprint"><code>[cluster] q=1 r=1 w=1 n=3 </code></pre> when replicating correctly. This is a possible configuration of clustering, but not an optimal as it lacks: <ul> <li>the benefit of confirmation that multiple nodes and a majority of nodes have confirmed a save before it is acknowledged.</li> <li>the benefit of confirmation that multiple nodes and a majority of nodes have confirmed a revision is correct before it is returned.</li> <li>Expandability of the database beyond a single node's storage via sharding.</li> <li>The ability to change to any configuration equivalent to cluster parameters with q, r or w > 1 without switching to a cluster.</li> </ul> Indirectly, the limits on acknowledgements make more potential conflicts to resolve between the replicas if the replicas are actually used for network scalability, and greater odds an actual inconsistency in the form of lost records if a node fails between acknowledging a save and passing it on to the other replicas.

Which version of CouchDB will you be using? If 2.0.0+, there's probably no reason not to use true clustering. The only reason I can think of to use replicas instead of clustering would be for ease of configuration, or because your db (i.e. CouchDB < 2.0.0) doesn't support it. But if you use clustering, even on just 3 nodes now, you're already set up for greater expansion later, just by adding more nodes. Is there a reason you might not want to use a cluster?

Cluster vs replication

2 Answers

Couchdb docs 11.2 provides an example cluster configuration of:

[cluster]
  q=8
  r=2
  w=2
  n=3

q - The number of shards.

r - The number of copies of a document with the same revision that have to be read before CouchDB returns with a 200 and the document. If there is only one copy of the document accessible, then that is returned with 200.

w - The number of nodes that need to save a document before a write is returned with 201. If the nodes saving the document is 0, 202 is returned.

n - The number of copies there is of every document. Replicas.

The behavior of your 3 part replica should be equivalent to:

[cluster]
  q=1
  r=1
  w=1
  n=3

when replicating correctly. This is a possible configuration of clustering, but not an optimal as it lacks:

the benefit of confirmation that multiple nodes and a majority of nodes have confirmed a save before it is acknowledged.
the benefit of confirmation that multiple nodes and a majority of nodes have confirmed a revision is correct before it is returned.
Expandability of the database beyond a single node's storage via sharding.
The ability to change to any configuration equivalent to cluster parameters with q, r or w > 1 without switching to a cluster.

Indirectly, the limits on acknowledgements make more potential conflicts to resolve between the replicas if the replicas are actually used for network scalability, and greater odds an actual inconsistency in the form of lost records if a node fails between acknowledging a save and passing it on to the other replicas.

answered Oct 19 '22 15:10

lossleader

Which version of CouchDB will you be using? If 2.0.0+, there's probably no reason not to use true clustering.

The only reason I can think of to use replicas instead of clustering would be for ease of configuration, or because your db (i.e. CouchDB < 2.0.0) doesn't support it.

But if you use clustering, even on just 3 nodes now, you're already set up for greater expansion later, just by adding more nodes.

Is there a reason you might not want to use a cluster?

answered Oct 19 '22 16:10

Flimzy

Related questions
                            
                                Stateless pagination in CouchDB?
                            
                                Using both graph db and document db
                            
                                Uninstall CouchDB Completely Mac OSX [closed]
                            
                                Couchdb external authentication
                            
                                PouchDB delete data on device without affecting remote sync
                            
                                How can I trigger an AWS Lambda in response to CouchDB change events using only AWS?
                            
                                How to create a TEXT index in CouchDB 2.0?
                            
                                Bulk insert into Hyperledger Fabric keeps timing out
                            
                                Creating a couchdb standalone attachment using cURL
                            
                                What is the "revpos" value used for in CouchDB attachments?
                            
                                Should I make my CouchDB database server public-facing?
                            
                                CouchDB: Linking a document that references an array of different document types
                            
                                What kind of application would CouchDB be most useful/performant for?
                            
                                What is the best way to iterate or recurse through huge amounts of huge functions without exceeding the stack limit?
                            
                                RDBMS vs NoSQL for CRM, CMS and other financial Systems [closed]
                            
                                How to manage pouchdb and couchdb synchronization?
                            
                                Change notification in CouchDB when a field is set
                            
                                Can I replicate between CouchBase on Android and Couch DB running on Linux?
                            
                                couchdb: map in design document gives compilation_error
                            
                                Can I use Couchdb as a webserver and the only backend?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cluster vs replication

Tags:

couchdb

database-cluster

romainrbr

People also ask

2 Answers

lossleader

Flimzy

Recent Activity

Donate For Us