I've often noticed gcc converting multiplications into shifts in the executable. Something similar might happen when multiplying an <code>int</code> and a <code>float</code>. For example, <code>2 * f</code>, might simply increment the exponent of <code>f</code> by 1, saving some cycles. Do the compilers, perhaps if one requests them to do so (e.g. via <code>-ffast-math</code>), in general, do it? Are compilers generally smart enough to do this, or do I need to do this myself using the <code>scalb*()</code> or <code>ldexp()/frexp()</code> function family?

On modern CPUs, multiplication typically has one-per-cycle throughput and low latency. If the value is already in a floating point register, there's no way you'll beat that by juggling it around to do integer arithmetic on the representation. If it's in memory to begin with, and if you're assuming neither the current value nor the correct result would be zero, denormal, nan, or infinity, then it might be faster to perform something like <pre class="prettyprint"><code>addl $0x100000, 4(%eax) # x86 asm example </code></pre> to multiply by two; the only time I could see this being beneficial is if you're operating on a whole array of floating-point data that's bounded away from zero and infinity, and scaling by a power of two is the only operation you'll be performing (so you don't have any existing reason to be loading the data into floating point registers).

Why doesn't a compiler optimize floating-point *2 into an exponent increment?

Tags:

c++

performance

c

optimization

compiler-optimization

I've often noticed gcc converting multiplications into shifts in the executable. Something similar might happen when multiplying an int and a float. For example, 2 * f, might simply increment the exponent of f by 1, saving some cycles. Do the compilers, perhaps if one requests them to do so (e.g. via -ffast-math), in general, do it?

Are compilers generally smart enough to do this, or do I need to do this myself using the scalb*() or ldexp()/frexp() function family?

606

asked Oct 16 '12 16:10

user1095108

2 Answers

For example, 2 * f, might simply increment the exponent of f by 1, saving some cycles.

This simply isn't true.

First you have too many corner cases such as zero, infinity, Nan, and denormals. Then you have the performance issue.

The misunderstanding is that incrementing the exponent is not faster than doing a multiplication.

If you look at the hardware instructions, there is no direct way to increment the exponent. So what you need to do instead is:

Bitwise convert into integer.
Increment the exponent.
Bitwise convert back to floating-point.

There is generally a medium to large latency for moving data between the integer and floating-point execution units. So in the end, this "optimization" becomes much worse than a simple floating-point multiply.

So the reason why the compiler doesn't do this "optimization" is because it isn't any faster.

answered Oct 14 '22 11:10

Mysticial

On modern CPUs, multiplication typically has one-per-cycle throughput and low latency. If the value is already in a floating point register, there's no way you'll beat that by juggling it around to do integer arithmetic on the representation. If it's in memory to begin with, and if you're assuming neither the current value nor the correct result would be zero, denormal, nan, or infinity, then it might be faster to perform something like

addl $0x100000, 4(%eax)   # x86 asm example

to multiply by two; the only time I could see this being beneficial is if you're operating on a whole array of floating-point data that's bounded away from zero and infinity, and scaling by a power of two is the only operation you'll be performing (so you don't have any existing reason to be loading the data into floating point registers).

answered Oct 14 '22 10:10

R.. GitHub STOP HELPING ICE

Related questions
                            
                                How to easily make std::cout thread-safe?
                            
                                What STL algorithm can determine if exactly one item in a container satisfies a predicate?
                            
                                How do define anonymous functions in C++?
                            
                                How do you convert a C++ string to an int? [duplicate]
                            
                                bitwise not operator
                            
                                Declare variables at top of function or in separate scopes?
                            
                                C++ Parallelization Libraries: OpenMP vs. Thread Building Blocks [closed]
                            
                                How do I include the string header?
                            
                                Initialize integer literal to std::size_t
                            
                                In Clion's debugger, how do I show the entire contents of an int array
                            
                                How to use WinDbg to analyze the crash dump for VC++ application?
                            
                                C++ #include <atlbase.h> is not found
                            
                                Passing array to a function (and why it does not work in C++)
                            
                                Initializing Constant Static Array In Header File
                            
                                Overriding vs Virtual
                            
                                Use of observer_ptr
                            
                                What is the official name of C++'s arrow (->) operator?
                            
                                Getting array from std:vector
                            
                                Using boost thread and a non-static class function
                            
                                Why is the sum of an int and a float an int?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With