How to use GNU make --max-load on a multicore Linux machine?

Tags:

From the documentation for GNU make: http://www.gnu.org/software/make/manual/make.html#Parallel

When the system is heavily loaded, you will probably want to run fewer jobs than when it is lightly loaded. You can use the ‘-l’ option to tell make to limit the number of jobs to run at once, based on the load average. The ‘-l’ or ‘--max-load’ option is followed by a floating-point number. For example,
 -l 2.5
will not let make start more than one job if the load average is above 2.5. The ‘-l’ option with no following number removes the load limit, if one was given with a previous ‘-l’ option.

More precisely, when make goes to start up a job, and it already has at least one job running, it checks the current load average; if it is not lower than the limit given with ‘-l’, make waits until the load average goes below that limit, or until all the other jobs finish.

From the Linux man page for uptime: http://www.unix.com/man-page/Linux/1/uptime/

System load averages is the average number of processes that are either in a runnable or uninterruptable state. A process in a runnable state is either using the CPU or waiting to use the CPU. A process in uninterruptable state is waiting for some I/O access, eg waiting for disk. The averages are taken over the three time intervals. Load averages are not normalized for the number of CPUs in a system, so a load average of 1 means a single CPU system is loaded all the time while on a 4 CPU system it means it was idle 75% of the time.

I have a parallel makefile and I want to do the obvious thing: have make to keep adding processes until I am getting full CPU usage but I'm not inducing thrashing.

Many (all?) machines today are multicore, so that means that the load average is not the number make should be checking, as that number needs to be adjusted for the number of cores.

Does this mean that the --max-load (aka -l) flag to GNU make is now useless? What are people doing who are running parallel makefiles on multicore machines?

216

asked Dec 17 '12 06:12

Daniel

1 Answers

My short answer: --max-load is useful if you're willing to invest the time it takes to make good use of it. With its current implementation there's no simple formula to pick good values, or a pre-fab tool for discovering them.

The build I maintain is fairly large. Before I started maintaining it the build was 6 hours. With -j64 on a ramdisk, now it finishes in 5 minutes (30 on an NFS mount with -j12). My goal here was to find reasonable caps for -j and -l that allows our developers to build quickly but doesn't make the server (build server or NFS server) unusable for everyone else.

To begin with:

If you choose a reasonable -jN value (on your machine) and find a reasonable upper bound for load average (on your machine), they work nicely together to keep things balanced.
If you use a very large -jN value (or unspecified; eg, -j without a number) and limit the load average, gmake will:
- continue spawning processes (gmake 3.81 added a throttling mechanism, but that only helps mitigate the problem a little) until the max # of jobs is reached or until the load average goes above your threshold
- while the load average is over your threshold:
  - do nothing until all sub-processes are finished
  - spawn one job at a time
- do it all over again

On Linux at least (and probably other *nix variants), load average is an exponential moving average (UNIX Load Average Reweighed, Neil J. Gunther) that represents the avg number of processes waiting for CPU time (can be caused by too many processes, waiting for IO, page faults, etc). Since it's an exponential moving average, it's weighted such that newer samples have a stronger influence on the current value than older samples.

If you can identify a good "sweet spot" for the right max load and number of parallel jobs (through a combination of educated guesses and empirical testing), assuming you have a long running build: your 1 min avg will hit an equilibrium point (won't fluctuate much). However, if your -jN number is too high for a given max load average, it'll fluctuate quite a bit.

Finding that sweet spot is essentially equivalent to finding optimal parameters to a differential equation. Since it will be subject to initial conditions, the focus is on finding parameters that get the system to stay at equilibrium as opposed to coming up with a "target" load average. By "at equilibrium" I mean: 1m load avg doesn't fluctuate much.

Assuming you're not bottlenecked by limitations in gmake: When you've found a -jN -lM combination that gives a minimum build time: that combination will be pushing your machine to its limits. If the machine needs to be used for other purposes ...

compiling

... you may want to scale it back a bit when you're finished optimizing.

Without regard to load avg, the improvements I saw in build time with increasing -jN appeared to be [roughly] logarithmic. That is to say, I saw a larger difference between -j8 and -j12 than between -j12 and -j16.

Things peaked for me somewhere between -j48 and -j64 (on the Solaris machine it was about -j56) because the initial gmake process is single-threaded; at some point that thread cannot start new jobs faster than they finish.

My tests were performed on:

A non-recursive build
- recursive builds may see different results; they won't run into the bottleneck I did around -j64
- I've done my best to minimize the amount of make-isms (variable expansions, macros, etc) in recipes because recipe parsing occurs in the same thread that spawns parallel jobs. The more complicated recipes are, the more time it spends in the parser instead of spawning/reaping jobs. For example:
  - No $(shell ...) macros are used in recipes; those are ran during the 1st parsing pass and cached
  - Most variables are assigned with := to avoid recursive expansion
Solaris 10/sparc
- 256 cores
- no virtualization/logical domains
- the build ran on a ramdisk
x86_64 linux
- 32-core (4x hyper threaded)
- no virtualization
- the build ran on a fast local drive

answered Oct 18 '22 02:10

Brian Vandenberg

Related questions
                            
                                GNU Make. Variables with a scope limited to a single Makefile
                            
                                how to remedy mno cygwin error?
                            
                                How to run pre- and post-recipes for every target using GNU Make?
                            
                                Makefile always recompiling a file
                            
                                Is there a configuration file for gnu make?
                            
                                multiple Target-specific Variable Values
                            
                                Calling make from within a makefile
                            
                                Makefile call function. How to get all arguments
                            
                                What is the -lrt flag in gnu-make?
                            
                                Makefile pattern rule fails?
                            
                                *** No rule to make target 'class.cpp', needed by `build/....x86/class.o` Stop. error in Ubuntu
                            
                                How to generate list of make targets automatically by globbing subdirectories?
                            
                                How does GNU make keep track of file changes?
                            
                                How to add custom targets in a qmake generated Makefile?
                            
                                How to synthesize line breaks in GNU Make warnings or errors?
                            
                                How can i split string with dot in makefile
                            
                                Checking if file exists before removing it in makefile
                            
                                Why .SECONDARY does not work with patterns (%) while .PRECIOUS does?
                            
                                Automake Variables to tidy up Makefile.am
                            
                                Automatic initialization and update of submodules in Makefile

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use GNU make --max-load on a multicore Linux machine?

Tags:

gnu-make

multicore

uptime

Daniel

People also ask

1 Answers

Brian Vandenberg

Recent Activity

Donate For Us