Load Balancing in Amazon EC2?

Tags:

We've been fighting with HAProxy for a few days now in Amazon EC2; the experience has so far been great, but we're stuck on squeezing more performance out of the software load balancer. We're not exactly Linux networking whizzes (we're a .NET shop, normally), but we've so far held our own, attempting to set proper ulimits, inspecting kernel messages and tcpdumps for any irregularities. So far though, we've reached a plateau of about 1,700 requests/sec, at which point client timeouts abound (we've been using and tweaking httperf for this purpose). A coworker and I were listening to the most recent Stack Overflow podcast, in which the Reddit founders note that their entire site runs off one HAProxy node, and that it so far hasn't become a bottleneck. Ack! Either there's somehow not seeing that many concurrent requests, we're doing something horribly wrong, or the shared nature of EC2 is limiting the network stack of the Ec2 instance (we're using a large instance type). Considering the fact that both Joel and the Reddit founders agree that network will likely be the limiting factor, is it possible that's the limitation we're seeing?

Any thoughts are greatly appreciated!

Edit It looks like the actual issue was not, in fact, with the load balancer node! The culprit was actually the nodes running httperf, in this instance. As httperf builds and tears down a socket for each request, it spends a good amount of CPU time in the kernel. As we bumped the request rate higher, the TCP FIN TTL (being 60s by default) was keeping sockets around too long, and the ip_local_port_range's default was too low for this usage scenario. Basically, after a few minutes of the client (httperf) node constantly creating and destroying new sockets, the number of unused ports ran out, and subsequent 'requests' errored-out at this stage, yielding low request/sec numbers and a large amount of errors.

We also had looked at nginx, but We've been working with RighScale, and they've got drop-in scripts for HAProxy. Oh, and we've got too tight a deadline [of course] to switch out components unless it proves absolutely necessary. Mercifully, being on AWS allows us to test out another setup using nginx in parallel (if warranted), and make the switch overnight later on.

This page describes each of the sysctl variables fairly well (ip_local_port_range and tcp_fin_timeout were tuned, in this case).

960

asked Nov 04 '08 00:11

Marc Bollinger

1 Answers

Not answering the question directly, but EC2 now supports load balancing through Elastic Load Balancing rather than running your own load balancer in an EC2 instance.

EDIT: Amazon's Route 53 DNS service now offers a way to point a top-level domain at an ELB with an "alias" record. Since Amazon knows the current IP address of the ELB, it can return an A record for that current IP rather than having to use a CNAME record, while still being free to change the IP from time to time.

answered Oct 08 '22 19:10

stevemegson

Related questions
                            
                                Automating Amazon EBS snapshots anyone have a good script or solution for this on linux [closed]
                            
                                What is the purpose of 'Reservations' in Amazon EC2
                            
                                How to set an environment variable in Amazon EC2
                            
                                upload a directory to s3 with boto
                            
                                How to run cloud-init manually?
                            
                                Automatically mount an EBS volume upon starting an Amazon EC2 Linux instance
                            
                                Elastic IP on application deployed using Elastic Beanstalk
                            
                                connecting to amazon aws linux server by ssh on mac
                            
                                Have Route 53 point to an instance instead of an IP or CNAME?
                            
                                Difference between Docker and AMI
                            
                                Best Practice for Updating AWS ECS Service Tasks
                            
                                Airbnb Airflow using all system resources
                            
                                AWS - What are the exact differences between EC2, Beanstalk and LightSail?
                            
                                How to increase ulimit on Amazon EC2 instance?
                            
                                How can I re-download the pem file in AWS EC2?
                            
                                How redirect a domain to Amazon EC2 Machine?
                            
                                How is "Target Groups" different from "Auto-Scaling Groups" in AWS?
                            
                                Can you run a rails console or rake command in the elastic beanstalk environment?
                            
                                How to deploy with Gitlab-Ci to EC2 using AWS CodeDeploy/CodePipeline/S3
                            
                                How to find the IP address of an amazon EC2 instance on reboot

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Load Balancing in Amazon EC2?

Tags:

amazon-ec2

load-balancing

scaling

haproxy

Marc Bollinger

People also ask

1 Answers

stevemegson

Recent Activity

Donate For Us