Understanding of MSVS C++ compiler optimizations

Tags:

I do not understand what is happening in this code. The C code is:

#include <stdio.h>

int main()
{
    const int mul = 100;
    int x;
    printf_s("Input a number\r\n");
    scanf_s("%i", &x);
    printf_s("%i/%i = %i\r\n", x, mul, x / mul);
    return 0;
}

I expected that the resulting assembly will be some simple shifts and add/sub operations, but there are some magic constants like 51EB851Fh, multiplications, etc. What is happening here?

; int __cdecl main()
_main proc near

x= dword ptr -8
var_4= dword ptr -4

push    ebp
mov     ebp, esp
sub     esp, 8
mov     eax, ___security_cookie
xor     eax, ebp
mov     [ebp+var_4], eax
push    offset Format   ; "Input a number\r\n"
call    ds:__imp__printf_s
lea     eax, [ebp+x]
push    eax
push    offset aI       ; "%i"
call    ds:__imp__scanf_s
mov     ecx, [ebp+x]
mov     eax, 51EB851Fh
imul    ecx
sar     edx, 5
mov     eax, edx
shr     eax, 1Fh
add     eax, edx
push    eax
push    64h
push    ecx
push    offset aIII     ; "%i/%i = %i\r\n"
call    ds:__imp__printf_s
mov     ecx, [ebp+var_4]
add     esp, 1Ch
xor     ecx, ebp        ; cookie
xor     eax, eax
call    @__security_check_cookie@4 ; __security_check_cookie(x)
mov     esp, ebp
pop     ebp
retn
_main endp

976

asked Jul 08 '14 09:07

Alex Zhukovskiy

1 Answers

Processors are not very good at dividing, an idiv can take between 11 and 18 cycles. As opposed to shifts and multiplies, they usually only take a single cycle.

So the optimizer replaced your division by a multiplication using fixed-point math, taking advantage of a 32-bit multiply producing a 64-bit result into edx:eax. Back-of-the-envelope: n / 100 == n * 0.32 / 32 == n * (0.32 * pow(2,32)) / 32 / pow(2,32). Those divisions are very cheap, just a right-shift. And the multiplier becomes 0.32 * pow(2,32) ~= 1374389535 == 0x51EB851F

127

answered Oct 17 '22 02:10

Hans Passant

Related questions
                            
                                How can I zero just the padding bytes of a class?
                            
                                Convert json value to int in c++
                            
                                QTcpServer can only be accessed through localhost
                            
                                evaluation order initialization array in c++
                            
                                Iterating over a list in pseudo-random order without storing a shuffled list
                            
                                boost::exception - how to print details?
                            
                                Passing data through tessellation shaders to the fragment shader
                            
                                How to store formatting settings with an IOStream?
                            
                                Why will two-phase lookup fail to choose overloaded version of 'swap'?
                            
                                Basic timer with std::thread and std::chrono
                            
                                How to read a specific line from QPlainTextEdit
                            
                                How to fill sockaddr_storage?
                            
                                Qt ISODate formatted date/time including timezone
                            
                                Is x/a the same as x*(1/a) for floats?
                            
                                Performing set_difference on unordered sets
                            
                                passing allocated pointer before it allocated
                            
                                Visual Studio: How to use platform toolset as preprocessor directive?
                            
                                Performance: boost.compute v.s. opencl c++ wrapper
                            
                                sparse vector in C++? [closed]
                            
                                Eigen Matrix vs Numpy Array multiplication performance

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Understanding of MSVS C++ compiler optimizations

Tags:

c++

c

optimization

visual-studio

assembly

Alex Zhukovskiy

People also ask

1 Answers

Hans Passant

Recent Activity

Donate For Us