I did a load test of my Rails application yesterday, running 8 dynos with 3 concurrent Unicorn processes on each. This is the New Relic output: <img src="https://i.stack.imgur.com/HeTkt.png" alt="newrelic"> As you can see, my Rails stack itself has a pretty good response time (DB, Web, etc), but the queue time is super terrible. What can I do about this? Is this inherent in Heroku performance, or does it just mean I need to add more dynos? Any advice appreciated.

Basically, break the problem down into its parts and test each part. Simply throwing a bunch of requests at a cluster of unicorns isn't necessarily a good way to measure throughput. You have to consider many variables (side note: checkout "Programmers Need To Learn Statistics Or I Will Kill Them All" by Zed Shaw) Also, you're leaving out critical information from your question for solving the mystery. <ul> <li>How many requests is each unicorn handling per second?</li> <li>How long is the total test and are you allowing time for whatever cache you have to warm up?</li> <li>How many total requests were handled by the collection?</li> <li>I see in the chart that queuing time drops significantly from the initial spike at the left hand side of the chart - any idea why? Is this startup time? Is this cache warming? Is it a flood of requests coming disproportionally at the beginning of the test?</li> </ul> You're the only person who can answer these questions. Queuing time, if I understand Heroku's setup correctly, is essentially the time new requests sit waiting for an available unicorn (or to be more accurate with unicorn, how long requests sit before they are grabbed by unicorn). If you're load testing and feeding the system more than it can handle then, while your app itself my serve requests that it's ready to handle very quickly, there will still be a backlog of requests waiting for an available unicorn to process it. Depending on your original setup, try the following variables in your test: <ul> <li>Same number of total requests, but run it longer to see if caches warm up more and speed up response times (i.e. unicorns handle more requests per second)</li> <li>Adjust the number of requests per second to the total collection of unicorns available, both up and down, and observe at what thresholds the queuing times get better and worse</li> <li>Simplify the test. First, just test a single unicorn process and figure out how long it takes to warm up, how many requests per second it can handle, and at what point queuing times start to increase due to backlogs. Then, add unicorn processes and repeat the tests, trying to to find out if, with 3 unicorns, you get 3x performance, or if there's some % overhead in adding more unicorns (e.g. the overhead of load balancing the incoming requests), and whether that overhead is negligible or not, etc.</li> <li>Make sure the requests are all very similar. If you have some requests that are just returning a front page with 100% cached and non-dynamic content your processing times will be much shorter than requests that need to generate a variable amount of dynamic content, which is going to throw off your test results considerably.</li> </ul> Also, find out if the test results chart you're showing above is an average, or a 95th percentile with standard deviations, or some other measurement. Only after you've broken the problem down into its component parts will you know with any predictability whether or not adding more unicorns will help. Looking at this basic chart and asking, "Should I just add more unicorns?" is like having a slow computer and asking, "Should I just add more RAM to my machine?". While it may help you're skipping the steps of actually understanding why something is slow, and adding more of something, while it may help, won't give you any deeper understanding of why it's slower. Because of this (and especially on heroku), you might wind up overpaying for more dynos when you don't need them, if only you could get to the root of what is causing the longer than expected queuing times you'll be in much better shape. This approach, of course, isn't unique to heroku. Trying experiments, tweaking variables, and recording the outcome measurements will allow you to pick apart what's going on inside those performance numbers. Understanding the "why" will enable you to take specific, educated steps that should have mostly predictable effects on overall performance. After all of that you may find that yes, the best way to improve the performance in your specific case is to add more unicorns, but at least you'll know why and when to do so, as well as a really solid guess as to how many to add.

I essentially wrote another question, and then sat back, and realized I just edited this exact question a week before, and knew the answer to both. What jefflunt said is basically 100% true, but, because I'm here, I'm here to spell it out. There's 2 solutions: <ol> <li>Add more Unicorn Workers.</li> <li>Reduce the total transaction time of requests.</li> </ol> They basically boil down to the same exact concept, but: <ul> <li>If you have 15k transactions per minute, you'd have 250 transactions per second. </li> <li>If you're average transaction time is 100ms, each worker can execute 10 transactions per second (where 1000ms/(100ms/transactions)).</li> <li>If you have 8 dynos with 3 workers, you'd have 24 workers.</li> <li>24 workers at 10 transactions per second means your current setup can produce around 240 transactions per second. </li> </ul> Granted, this is just the roughest of frameworks on how to gauge the problem, especially because traffic is always weighted somehow, and taking an average (over the median), is usually a better gauge because you're taking more into consideration the 95% requests, but you'll be close to the right number to understanding what kind of capacity you need.

Heroku queue times

2 Answers

Basically, break the problem down into its parts and test each part. Simply throwing a bunch of requests at a cluster of unicorns isn't necessarily a good way to measure throughput. You have to consider many variables (side note: checkout "Programmers Need To Learn Statistics Or I Will Kill Them All" by Zed Shaw)

Also, you're leaving out critical information from your question for solving the mystery.

How many requests is each unicorn handling per second?
How long is the total test and are you allowing time for whatever cache you have to warm up?
How many total requests were handled by the collection?
I see in the chart that queuing time drops significantly from the initial spike at the left hand side of the chart - any idea why? Is this startup time? Is this cache warming? Is it a flood of requests coming disproportionally at the beginning of the test?

You're the only person who can answer these questions.

Queuing time, if I understand Heroku's setup correctly, is essentially the time new requests sit waiting for an available unicorn (or to be more accurate with unicorn, how long requests sit before they are grabbed by unicorn). If you're load testing and feeding the system more than it can handle then, while your app itself my serve requests that it's ready to handle very quickly, there will still be a backlog of requests waiting for an available unicorn to process it.

Depending on your original setup, try the following variables in your test:

Same number of total requests, but run it longer to see if caches warm up more and speed up response times (i.e. unicorns handle more requests per second)
Adjust the number of requests per second to the total collection of unicorns available, both up and down, and observe at what thresholds the queuing times get better and worse
Simplify the test. First, just test a single unicorn process and figure out how long it takes to warm up, how many requests per second it can handle, and at what point queuing times start to increase due to backlogs. Then, add unicorn processes and repeat the tests, trying to to find out if, with 3 unicorns, you get 3x performance, or if there's some % overhead in adding more unicorns (e.g. the overhead of load balancing the incoming requests), and whether that overhead is negligible or not, etc.
Make sure the requests are all very similar. If you have some requests that are just returning a front page with 100% cached and non-dynamic content your processing times will be much shorter than requests that need to generate a variable amount of dynamic content, which is going to throw off your test results considerably.

Also, find out if the test results chart you're showing above is an average, or a 95th percentile with standard deviations, or some other measurement.

Only after you've broken the problem down into its component parts will you know with any predictability whether or not adding more unicorns will help. Looking at this basic chart and asking, "Should I just add more unicorns?" is like having a slow computer and asking, "Should I just add more RAM to my machine?". While it may help you're skipping the steps of actually understanding why something is slow, and adding more of something, while it may help, won't give you any deeper understanding of why it's slower. Because of this (and especially on heroku), you might wind up overpaying for more dynos when you don't need them, if only you could get to the root of what is causing the longer than expected queuing times you'll be in much better shape.

This approach, of course, isn't unique to heroku. Trying experiments, tweaking variables, and recording the outcome measurements will allow you to pick apart what's going on inside those performance numbers. Understanding the "why" will enable you to take specific, educated steps that should have mostly predictable effects on overall performance.

After all of that you may find that yes, the best way to improve the performance in your specific case is to add more unicorns, but at least you'll know why and when to do so, as well as a really solid guess as to how many to add.

answered Oct 16 '22 15:10

jefflunt

I essentially wrote another question, and then sat back, and realized I just edited this exact question a week before, and knew the answer to both.

What jefflunt said is basically 100% true, but, because I'm here, I'm here to spell it out.

There's 2 solutions:

Add more Unicorn Workers.
Reduce the total transaction time of requests.

They basically boil down to the same exact concept, but:

If you have 15k transactions per minute, you'd have 250 transactions per second.
If you're average transaction time is 100ms, each worker can execute 10 transactions per second (where 1000ms/(100ms/transactions)).
If you have 8 dynos with 3 workers, you'd have 24 workers.
24 workers at 10 transactions per second means your current setup can produce around 240 transactions per second.

Granted, this is just the roughest of frameworks on how to gauge the problem, especially because traffic is always weighted somehow, and taking an average (over the median), is usually a better gauge because you're taking more into consideration the 95% requests, but you'll be close to the right number to understanding what kind of capacity you need.

answered Oct 16 '22 16:10

FullOnFlatWhite

Related questions
                            
                                Elasticsearch: Highlighting hits from within attachment
                            
                                Major factors for memory leaks in Rails
                            
                                Can't make calls to heroku CLI from within a Rake task without the Heroku gem in the Gemfile
                            
                                Composite entity scaffolding in Ruby on Rails
                            
                                Rails DB Migration Error: relation already exists
                            
                                setting headers in active resource request
                            
                                Can't Git Push Heroku Master Due To libv8
                            
                                config.js doesn't work for CKEditor
                            
                                sql error cannot start a transaction within a transaction while testing with cucumber
                            
                                Validation on a has_many relationship in rails 3.2
                            
                                Routing nested resources and matching controller
                            
                                Jobs with Resque gives "Don't know how to build task 'jobs:work'" on Heroku
                            
                                S3 Cross-Origin Resource Sharing Not working
                            
                                How do I delegate to a model's to_builder method from a JBuilder view?
                            
                                Disable New Relic from reporting when switching environments locally
                            
                                FactoryGirl before(:create) callback not creating associations
                            
                                RVM version [1.19.1] requires a change to .rvmrc file
                            
                                "eval" is deprecated. Please use "evaluate" instead
                            
                                Rails implementation of a database-based file system
                            
                                Zeus fails when testing with Rspec

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Heroku queue times

Tags:

performance

ruby-on-rails

heroku

Ronze

People also ask

2 Answers

jefflunt

FullOnFlatWhite

Recent Activity

Donate For Us