Reasons for NOT scaling-up vs. -out?

Tags:

scalability

As a programmer I make revolutionary findings every few years. I'm either ahead of the curve, or behind it by about π in the phase. One hard lesson I learned was that scaling OUT is not always better, quite often the biggest performance gains are when we regrouped and scaled up.

What reasons to you have for scaling out vs. up? Price, performance, vision, projected usage? If so, how did this work for you?

We once scaled out to several hundred nodes that would serialize and cache necessary data out to each node and run maths processes on the records. Many, many billions of records needed to be (cross-)analyzed. It was the perfect business and technical case to employ scale-out. We kept optimizing until we processed about 24 hours of data in 26 hours wallclock. Really long story short, we leased a gigantic (for the time) IBM pSeries, put Oracle Enterprise on it, indexed our data and ended up processing the same 24 hours of data in about 6 hours. Revolution for me.

So many enterprise systems are OLTP and the data are not shard'd, but the desire by many is to cluster or scale-out. Is this a reaction to new techniques or perceived performance?

Do applications in general today or our programming matras lend themselves better for scale-out? Do we/should we take this trend always into account in the future?

834

asked Nov 02 '09 22:11

Jé Queue

2 Answers

Because scaling up

Is limited ultimately by the size of box you can actually buy
Can become extremely cost-ineffective, e.g. a machine with 128 cores and 128G ram is vastly more expensive than 16 with 8 cores and 8G ram each.
Some things don't scale up well - such as IO read operations.
By scaling out, if your architecture is right, you can also achieve high availability. A 128-core, 128G ram machine is very expensive, but to have a 2nd redundant one is extortionate.

And also to some extent, because that's what Google do.

130

answered Sep 22 '22 14:09

MarkR

Scaling out is best for embarrassingly parallel problems. It takes some work, but a number of web services fit that category (thus the current popularity). Otherwise you run into Amdahl's law, which then means to gain speed you have to scale up not out. I suspect you ran into that problem. Also IO bound operations also tend to do well with scaling out largely because waiting for IO increases the % that is parallelizable.

answered Sep 22 '22 14:09

Kathy Van Stone

Related questions
                            
                                Performance of setTimeout in node?
                            
                                How to design a distributed node.js web server
                            
                                Jenkins multiple masters
                            
                                Can in-memory SQLite databases scale with concurrency?
                            
                                RavenDB - Planning for scalability
                            
                                ElasticSearch Analytical queries
                            
                                To limit concurrency OR NOT to limit concurrency? (within a single ASP.NET request)
                            
                                Fast Text Search Over Logs
                            
                                Increasing DynamoDB Stream + Lambda throughput
                            
                                Performance of NSManagedObjectContext save degrades dramatically
                            
                                How to scale out with esper?
                            
                                How to create an Amazon VPC using AWS CloudFormation?
                            
                                How do you integrate functional programming languages to Java or C#? [closed]
                            
                                Which one of these is a better option to use alongside "latest rails" application? Mongrel, Thin, WEBrick and Passenger
                            
                                Should we use PHP framework for a high scalable web site? [closed]
                            
                                How to make my Java application scalable and fault tolerant?
                            
                                Looking for distributed/scalable database solution where all nodes are read/write? Not MongoDB? [closed]
                            
                                Resources for Database Sharding and Partitioning
                            
                                Planning Scalable Web Application Development
                            
                                How to gear towards scalability for a start up e-commerce portal?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With