Pre-calculating in gcc-4.8 (C++11)

Tags:

Testing a code in old gcc-4.4.0 and gcc-4.6.4, compiler was able to apply a smart optimization and pre-calculate the result for const inputs:

Click to copy

#include <iostream>
#include <chrono>
using namespace std;

const auto N = 1000000000ULL;  // constexptr is tested, no effect

unsigned long long s(unsigned long long n)
{
    auto s = 0ULL;
    for (auto i = 0ULL; i < n; i++)
        s += i;
    return s;
}

int main()
{
    auto t1 = std::chrono::high_resolution_clock::now();

    auto x = s(N);

    auto t2 = std::chrono::high_resolution_clock::now();
    auto t = std::chrono::duration_cast<std::chrono::nanoseconds>(t2-t1).count();
    cout << "Result: " << x << " -- time (ms):" << t/0.1e7 << endl;
}

N is a constant value, then compiler can run function s in compile-time and assign the result to x. (No run-time calculation is needed for N)

Results in different versions of gcc (and also a version of clang):

The last version (clang-3.4) result (pre-calculated): 0.001532 ms.
The old version (gcc-4.4.0) result (pre-calculated): 0.013517 ms.
The old version (gcc-4.6.4) result (pre-calculated): 0.001 ms.
The newer version (gcc-4.8.0+) doesn't calculate it in compile-time, result: 1313.78 ms !!.

Question:

Is this optimization omitted in 4.8.1? Why?
Is there any compiler command/switch to enable it (if it's disabled by default)?
~~If it's omitted, how can I force the compiler to do this pre-calculation?~~

Note(1): I tested both -O2 and -O3 switches, no effect.

Note(2): Forcing, I mean compiler's commands and switches.

Note(3): Function s is just an example, it can be replaced by more complicated functions.

351

asked Oct 13 '13 20:10

4 Answers

I've submitted it as a bug. Yes, it's a Regression in version 4.8 which is fixed in newer revisions 5 weeks ago. Follow it here:

Bug 58717 - [4.8 Regression] SCEV final value replacement no longer triggers
Bug 57511 - [4.8 Regression] Missing SCEV final value replacement
Fixed in revision 202168

answered Oct 02 '22 20:10

The C++11 way to deal with computations at compile-time is the use of constexpr. Sadly, constexpr functions are somewhat limited in what can be done. In C++11, a constexpr function is allowed to contain empty statements, static_assert() declarations, typedefs, and using declarations/directives, and exactly one return-statement (I got temporarily confused because I was looking at the C++14 draft which has the rules relaxed). That is, you'd need to formulate your function recursively. On the plus side, if a constexpr function is called with a constant expression, it will be evaluated at compile-time.

Other than that, you might want to help out the compiler with its constant folding. For example, it could help to

make the function s() an inline functions.
declare N as constexpr unsigned long long N = 1000000000ULL;
make sure you use a suitable optimization level.

answered Oct 02 '22 20:10

Dietmar Kühl

Is this optimization omitted in 4.8.1?

It looks like it is gone. It is still present in 4.7.2 though.

Why? [From one of your comments:] I think that optimization was excellent and doesn't hurt anything.

It is most likely accidental and the gcc developers don't know about it.

I can think of a good reason why I would want to at least provide an upper bound on this optimization. I got bitten by MSVC back in 2009: When I gave it a machine generated C code it was trying to optimize it and the compiler struggled with it for minutes. Obviously, it was desperately trying to do some optimization which should have been limited in some way so that the compiler wouldn't struggle for minutes over a 7KB source file. My point is: You may want to limit optimizations that can potentially increase your compile times too much.

However it doesn't seem to be the case here. I have tried it with fairly small Ns and this optimization is not performed either.

If it's omitted, how can I force the compiler to do this pre-calculation?
Note(2): Forcing, I mean compiler's commands and switches

I couldn't trick gcc 4.8.1 into doing this optimization. I will submit a bugreport if nobody says soon that it is a known issue or it can be enabled with some compiler flag.

answered Oct 02 '22 21:10

Ali

Related questions
                            
                                How to generate deprecated warning for a method in a COM interface (c++)
                            
                                Programmatically switch API naming conventions
                            
                                Deployment of application with embedded Python 3
                            
                                Boost tokenizer to treat quoted string as one token
                            
                                Are they really the virtual codes?
                            
                                Template keyword for dependent types in C++11
                            
                                How do I get the member function pointer of a destructor?
                            
                                OpenGL: How to make light to be independent of rotation?
                            
                                No member named 'forward' in namespace 'std'
                            
                                Read from a specific spot in a file C++
                            
                                Setting winsock select timeout accurately
                            
                                Winsock2 - How to use IOCP on client side
                            
                                Is there any performance benchmark for Thrift on HBase?
                            
                                Different types assignment in switch state cases, inside template function
                            
                                How to sort a multiset to a container by the number of element occurences
                            
                                Java vs C++ for passing arguments
                            
                                Qt remove title bar
                            
                                Is _ (single underscore) a valid C++ variable name?
                            
                                How to compile code from stdin? [duplicate]
                            
                                Get the name of a std::function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pre-calculating in gcc-4.8 (C++11)

Tags:

c++

gcc

c++11

masoud

People also ask

4 Answers

masoud

Ben Voigt

Dietmar Kühl

Ali

Recent Activity

Donate For Us