I have a cluster of 3 ElasticSearch nodes running on AWS EC2. These nodes are setup using OpsWorks/Chef. My intent is to design this cluster to be very resilient and elastic (nodes can come in and out when needed). From everything I've read about ElasticSearch, it seems like no one recommends putting a load balancer in front of the cluster; instead, it seems like the recommendation is to do one of two things: <ol> <li>Point your client at the URL/IP of one node, let ES do the load balancing for you and hope that node never goes down.</li> <li>Hard-code the URLs/IPs of ALL your nodes into your client app and have the app handle the failover logic.</li> </ol> My background is mostly in web farms where it's just common sense to create a huge pool of autonomous web servers, throw an ELB in front of them and let the load balancer decide what nodes are alive or dead. Why does ES not seem to support this same architecture?

I believe load balancing an Elasticsearch cluster is a good idea (designing a fault tolerant system, resilient to single node failure.) To architect your cluster you'll need background on the two primary functions of Elasticsearch: 1. Writing and updating documents and 2. Querying Documents. Writing / indexing documents in elasticsearch: <ol> <li>When a new document comes into Elasticsearch to be indexed, Elasticsearch determines the "primary shard" the document should be assigned to using the "Shard Routing Algorithm" </li> <li>The Lucene process associated with the shard "maps" the fields in the document;</li> <li>The Lucene process adds the document to the shard's Lucene "inverted index"</li> <li>Any "replica shard(s)" then receive the document; the replica shard "maps" the document and adds the document to the replica shard's Lucene "inverted index"</li> </ol> Querying documents in Elasticsearch: <ol> <li>By default, when a query is sent to Elasticsearch, the query hits a node -- this becomes the "query node" or the "gateway query node" for that query</li> <li>The node broadcasts the query to every shard in the index (primary & replica)</li> <li>each shard performs query on the shard's local Lucene inverted index.</li> <li>each shard returns the top 10 - 20 results to the "gateway query node" </li> <li>the "gateway query node" then performs a merge-sort on the combined results returned from the other shards, </li> <li>once the merge-sort is finished, the "gateway query node" and returns results to the client <ul> <li>the merge-sort is CPU and Memory resource heavy</li> </ul> </li> </ol> Architect a Load Balancer for Writes / Indexing / Updates Elasticsearch self manages the location of shards on nodes. The "master node" keeps and updates the "shard routing table". The "master node" provides a copy of the shard routing table to other nodes in the cluster. Generally, you don't want your master node doing much more than health checks for the cluster and updating routing tables, and managing shards. It's probably best to point the load balancer for writes to the "data nodes" (Data nodes are nodes that contain data = shards) and let the data nodes use their shard routing tables to get the writes to the correct shards. Architecting for Queries Elasticsearch has created a special node type: "client node", which contains "no data", and cannot become a "master node". The client node's function is to perform the final resource heavy merge-sort at the end of the query. For AWS you'd probably use a c3 or c4 instance type as a "client node" Best practice is to point the load balancer for queries to client nodes. Cheers! References: <ol> <li>Elasticsearch Node Types</li> <li>Elasticsearch: Shard Routing Algorithm</li> <li>Elasticsearch: Replica Shards</li> <li>Elasticsearch: Cluster State i.e. the Shard Routing Table</li> <li>ElasticHQ - Introduction to Elasticsearch Video</li> <li>Elasticsearch: Shard numbers and Cluster Scaling</li> </ol>

Is using a load balancer with ElasticSearch unnecessary?

Tags:

amazon-web-services

amazon-ec2

nosql

lucene

elasticsearch

I have a cluster of 3 ElasticSearch nodes running on AWS EC2. These nodes are setup using OpsWorks/Chef. My intent is to design this cluster to be very resilient and elastic (nodes can come in and out when needed).

From everything I've read about ElasticSearch, it seems like no one recommends putting a load balancer in front of the cluster; instead, it seems like the recommendation is to do one of two things:

Point your client at the URL/IP of one node, let ES do the load balancing for you and hope that node never goes down.
Hard-code the URLs/IPs of ALL your nodes into your client app and have the app handle the failover logic.

My background is mostly in web farms where it's just common sense to create a huge pool of autonomous web servers, throw an ELB in front of them and let the load balancer decide what nodes are alive or dead. Why does ES not seem to support this same architecture?

827

asked Jul 15 '14 06:07

user2719100

1 Answers

I believe load balancing an Elasticsearch cluster is a good idea (designing a fault tolerant system, resilient to single node failure.)

To architect your cluster you'll need background on the two primary functions of Elasticsearch: 1. Writing and updating documents and 2. Querying Documents.

Writing / indexing documents in elasticsearch:

When a new document comes into Elasticsearch to be indexed, Elasticsearch determines the "primary shard" the document should be assigned to using the "Shard Routing Algorithm"
The Lucene process associated with the shard "maps" the fields in the document;
The Lucene process adds the document to the shard's Lucene "inverted index"
Any "replica shard(s)" then receive the document; the replica shard "maps" the document and adds the document to the replica shard's Lucene "inverted index"

Querying documents in Elasticsearch:

By default, when a query is sent to Elasticsearch, the query hits a node -- this becomes the "query node" or the "gateway query node" for that query
The node broadcasts the query to every shard in the index (primary & replica)
each shard performs query on the shard's local Lucene inverted index.
each shard returns the top 10 - 20 results to the "gateway query node"
the "gateway query node" then performs a merge-sort on the combined results returned from the other shards,
once the merge-sort is finished, the "gateway query node" and returns results to the client
- the merge-sort is CPU and Memory resource heavy

Architect a Load Balancer for Writes / Indexing / Updates

Elasticsearch self manages the location of shards on nodes. The "master node" keeps and updates the "shard routing table". The "master node" provides a copy of the shard routing table to other nodes in the cluster.

Generally, you don't want your master node doing much more than health checks for the cluster and updating routing tables, and managing shards.

It's probably best to point the load balancer for writes to the "data nodes" (Data nodes are nodes that contain data = shards) and let the data nodes use their shard routing tables to get the writes to the correct shards.

Architecting for Queries

Elasticsearch has created a special node type: "client node", which contains "no data", and cannot become a "master node". The client node's function is to perform the final resource heavy merge-sort at the end of the query.

For AWS you'd probably use a c3 or c4 instance type as a "client node"

Best practice is to point the load balancer for queries to client nodes.

Cheers!

References:

Elasticsearch Node Types
Elasticsearch: Shard Routing Algorithm
Elasticsearch: Replica Shards
Elasticsearch: Cluster State i.e. the Shard Routing Table
ElasticHQ - Introduction to Elasticsearch Video
Elasticsearch: Shard numbers and Cluster Scaling

104

answered Sep 19 '22 11:09

Manchego

Related questions
                            
                                How to set up an OAuth2 Authentication Provider with AWS API Gateway?
                            
                                What is difference between AWS S3 Select and AWS Athena?
                            
                                How to temporarily switch profiles for AWS CLI?
                            
                                Name an EC2 Instance in the CloudFormation template?
                            
                                AWS malformed policy error
                            
                                How to make CloudFront never cache index.html on S3 bucket
                            
                                AWS EC2 Ubuntu 12.04.1 LTS: deb command-not-found [closed]
                            
                                running git clone against AWS CodeCommits gets me a 403 error
                            
                                Getting "EndpointDisabled" from Amazon SNS
                            
                                How do I delete an object on AWS S3 using Javascript?
                            
                                Installing PostgreSQL Client v10 on AWS Amazon Linux (EC2) AMI
                            
                                MongoDB connections from AWS Lambda
                            
                                Amazon Web Service (AWS) account (trial version) without credit card info [closed]
                            
                                Linking containers between task definitions in AWS ECS?
                            
                                How to create a new version of a Lambda function using CloudFormation?
                            
                                How can one determine the current region within an AWS Lambda function?
                            
                                AWS S3 object listing
                            
                                502 Bad Gateway Deploying Express Generator Template on Elastic Beanstalk
                            
                                Attaching and mounting existing EBS volume to EC2 instance filesystem issue
                            
                                Should I use Amazon's AWS Virtual Private Cloud (VPC) [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With