Does a c/c++ compiler optimize constant divisions by power-of-two value into shifts?

Tags:

Question says it all. Does anyone know if the following...

size_t div(size_t value) {     const size_t x = 64;     return value / x; }

...is optimized into?

size_t div(size_t value) {     return value >> 6; }

Do compilers do this? (My interest lies in GCC). Are there situations where it does and others where it doesn't?

I would really like to know, because every time I write a division that could be optimized like this I spend some mental energy wondering about whether precious nothings of a second is wasted doing a division where a shift would suffice.

698

asked Apr 05 '10 19:04

porgarmingduod

2 Answers

Even with g++ -O0 (yes, -O0!), this happens. Your function compiles down to:

_Z3divm: .LFB952:         pushq   %rbp .LCFI0:         movq    %rsp, %rbp .LCFI1:         movq    %rdi, -24(%rbp)         movq    $64, -8(%rbp)         movq    -24(%rbp), %rax         shrq    $6, %rax         leave         ret

Note the shrq $6, which is a right shift by 6 places.

With -O1, the unnecessary junk is removed:

_Z3divm: .LFB1023:         movq    %rdi, %rax         shrq    $6, %rax         ret

Results on g++ 4.3.3, x64.

114

answered Oct 08 '22 02:10

Thomas

Most compilers will go even further than reducing division by powers of 2 into shifts - they'll often convert integer division by a constant into a series of multiplication, shift, and addition instructions to get the result instead of using the CPU's built-in divide instruction (if there even is one).

For example, MSVC converts division by 71 to the following:

// volatile int y = x / 71;  8b 0c 24        mov ecx, DWORD PTR _x$[esp+8] ; load x into ecx  b8 49 b4 c2 e6  mov eax, -423447479 ; magic happens starting here... f7 e9           imul ecx            ; edx:eax = x * 0xe6c2b449  03 d1           add edx, ecx        ; edx = x + edx  c1 fa 06        sar edx, 6          ; edx >>= 6 (with sign fill)  8b c2           mov eax, edx        ; eax = edx c1 e8 1f        shr eax, 31         ; eax >>= 31 (no sign fill) 03 c2           add eax, edx        ; eax += edx  89 04 24        mov DWORD PTR _y$[esp+8], eax

So, you get a divide by 71 with a multiply, a couple shifts and a couple adds.

For more details on what's going on, consult Henry Warren's "Hacker's Delight" book or the companion webpage:

http://www.hackersdelight.org/

There's an online added chapter that provides some addition information about about division by constants using multiplication/shift/add with magic numbers, and a page with a little JavaScript program that'll calculate the magic numbers you need.

The companion site for the book is well worth reading (as is the book) - particularly if you're interested in bit-level micro optimizations.

Another article that I discovered just now that discusses this optimization: http://blogs.msdn.com/devdev/archive/2005/12/12/502980.aspx

answered Oct 08 '22 01:10

Michael Burr

Related questions
                            
                                How to write C++ getters and setters
                            
                                Empty function macros
                            
                                Is dependency injection useful in C++
                            
                                How is C++ std::vector implemented?
                            
                                an enclosing-function local variable cannot be referenced in a lambda body unless if it is in capture list
                            
                                std::vector to boost::python::list
                            
                                Why do C++ templates use the angle bracket syntax?
                            
                                fastest way to negate a number
                            
                                Why did C++11 remove the default value from the prototypes of std::vector's fill constructor?
                            
                                determine size of array if passed to function
                            
                                Compiling Qt 4.8.x for Visual Studio 2012
                            
                                std::max - expected an identifier
                            
                                Is the ranged based for loop beneficial to performance?
                            
                                Why class { int i; }; is not fully standard-conformant?
                            
                                Vector Iterators Incompatible
                            
                                Calling virtual function from destructor
                            
                                No matching function for call to operator new
                            
                                Why does int pointer '++' increment by 4 rather than 1?
                            
                                c++ const member function that returns a const pointer.. But what type of const is the returned pointer?
                            
                                advantages of std::set vs vectors or maps

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With