Optimisation of division in gcc

Tags:

Here's some code (full program follows later in the question):

template <typename T>
T fizzbuzz(T n) {
    T count(0);
    #if CONST
        const T div(3);
    #else
        T div(3);
    #endif
    for (T i(0); i <= n; ++i) {
        if (i % div == T(0)) count += i;
    }
    return count;
}

Now, if I call this template function with int, then I get a factor of 6 performance difference according to whether I define CONST or not:

$ gcc --version
gcc (GCC) 3.4.4 (cygming special, gdc 0.12, using dmd 0.125)

$ make -B wrappedint CPPFLAGS="-O3 -Wall -Werror -DWRAP=0 -DCONST=0" &&
 time ./wrappedint
g++  -O3 -Wall -Werror -DWRAP=0 -DCONST=0   wrappedint.cpp   -o wrappedi
nt
484573652

real    0m2.543s
user    0m2.059s
sys     0m0.046s

$ make -B wrappedint CPPFLAGS="-O3 -Wall -Werror -DWRAP=0 -DCONST=1" &&
 time ./wrappedint
g++  -O3 -Wall -Werror -DWRAP=0 -DCONST=1   wrappedint.cpp   -o wrappedi
nt
484573652

real    0m0.655s
user    0m0.327s
sys     0m0.046s

Examining the disassembly shows that in the fast (const) case, the modulo has been turned into a multiplication and shift type thing, whereas in the slow (non-const) case it's using idivl.

Even worse, if I try to wrap my integer in a class, then the optimisation doesn't happen whether I use const or not. The code always uses idivl and runs slow:

#include <iostream>

struct WrappedInt {
    int v;
    explicit WrappedInt(const int &val) : v(val) {}
    bool operator<=(const WrappedInt &rhs) const { return v <= rhs.v; }
    bool operator==(const WrappedInt &rhs) const { return v == rhs.v; }
    WrappedInt &operator++() { ++v; return *this; }
    WrappedInt &operator+=(const WrappedInt &rhs) { v += rhs.v; return *this; }
    WrappedInt operator%(const WrappedInt &rhs) const 
        { return WrappedInt(v%rhs.v); }
};

std::ostream &operator<<(std::ostream &s, WrappedInt w) {
    return s << w.v;
}

template <typename T>
T fizzbuzz(T n) {
    T count(0);
    #if CONST
        const T div(3);
    #else
        T div(3);
    #endif
    for (T i(0); i <= n; ++i) {
        if (i % div == T(0)) count += i;
    }
    return count;
}

int main() {
    #if WRAP
        WrappedInt w(123456789);
        std::cout << fizzbuzz(w) << "\n";
    #else
        std::cout << fizzbuzz<int>(123456789) << "\n";
    #endif
}

My questions are:

1) Is there a simple principle of C++ itself, or gcc's optimisation, which explains why this happens, or is it just a case of "various heuristics run, this is the code you get"?

2) Is there any way to make the compiler realise that my locally-declared and never-referenced const WrappedInt can be treated as a compile-time const value? I want this thing to be a straight replacement for int in templates.

3) Is there a known way of wrapping an int such that the compiler can discard the wrapping when optimising? The goal is that WrappedInt will be a policy-based template. But if a "do-nothing" policy results in essentially arbitrary 6x speed penalties over int, I'm better off special-casing that situation and using int directly.

884

asked Jul 13 '09 20:07

Steve Jessop

1 Answers

I'm guessing its just the severely old GCC version you are running. The oldest compiler I have on my machine - gcc-4.1.2, performs the fast way with both the non-const and the wrap versions (and does so at only -O1).

106

answered Sep 24 '22 19:09

Greg Rogers

Related questions
                            
                                Why can GCC not vectorize this function and loop?
                            
                                Clean way to lazy initialize and cache internal value in lambda
                            
                                Windows Media Foundation MFT buffering and video quality issues (Loss of colors, not so smooth curves, especially text)
                            
                                Writing a saturate casting operator without listing all possible combinations
                            
                                Can C++ be called from C# compiled to wasm?
                            
                                How to prevent modification to array data?
                            
                                GCC fails to report an ill-formed constexpr lambda call
                            
                                Why does this clang code fail to compile with clang 10 with -std=c++20
                            
                                Why do I need to include <compare> header to get <=> to compile?
                            
                                Assign a value to an rvalue reference returned from function
                            
                                In C++ when should we use std::any, std::variant or std::optional? [closed]
                            
                                C++20 default equality / inequality
                            
                                Passing a std::shared_ptr<T> to a function that takes a std::shared_ptr<const T>?
                            
                                How std::shared_ptr is deallocated?
                            
                                Partial specialization of templates over non-type literal parameters in C++20: clang and gcc disagree
                            
                                C++ nested designated initializer with pointer
                            
                                Is it safe to reassign *this inside a class' method?
                            
                                Warning for overflow in std::accumulate
                            
                                How can I make Eclipse CDT auto-indent properly when using BOOST_FOREACH?
                            
                                Object Oriented C++ library for ssh and scp

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Optimisation of division in gcc

Tags:

c++

optimization

gcc

Steve Jessop

People also ask

1 Answers

Greg Rogers

Recent Activity

Donate For Us