Let's say I have a class that's something like this: <pre class="prettyprint"><code>class View { public: View(DataContainer &c) : _c(c) { } inline Elem getElemForCoords(double x, double y) { int idx = /* some computation here... */; return _c.data[idx]; } private: DataContainer& _c; }; </code></pre> If I have a function using this class, is the compiler allowed to optimize it away entirely and just inline the data access? Is the same still true if View::_c happens to be a std::shared_ptr?

<blockquote> If I have a function using this class, is the compiler allowed to optimize it away entirely and just inline the data access? Is the same still true if View::_c happens to be a std::shared_ptr? </blockquote> Absolutely, yes, and yes; as long as it doesn't violate the as-if rule (as already pointed out by Pentadecagon). Whether this optimization really happens is a much more interesting question; it is allowed by the standard. For this code: <pre class="prettyprint"><code>#include <memory> #include <vector> template <class DataContainer> class View { public: View(DataContainer& c) : c(c) { } int getElemForCoords(double x, double y) { int idx = x*y; // some dumb computation return c->at(idx); } private: DataContainer& c; }; template <class DataContainer> View<DataContainer> make_view(DataContainer& c) { return View<DataContainer>(c); } int main(int argc, char* argv[]) { auto ptr2vec = std::make_shared<std::vector<int>>(2); auto view = make_view(ptr2vec); return view.getElemForCoords(1, argc); } </code></pre> I have verified, by inspecting the assembly code (<code>g++ -std=c++11 -O3 -S -fwhole-program optaway.cpp</code>), that the <code>View</code> class is like it is not there, it adds zero overhead. <hr> Some unsolicited advice. <ul> <li>Inspect the assembly code of your programs; you will learn a lot and start worrying about the right things. <code>shared_ptr</code> is a heavy-weight object (compared to, for example, <code>unique_ptr</code>), partly because of all that multi-threading machinery under the hood. If you look at the assembly code, you will worry much more about the overhead of the shared pointer and less about element access. ;)</li> <li>The <code>inline</code> in your code is just noise, that function is implicitly inline anyway. Please don't trash your code with the inline keyword; the optimizer is free to treat it as whitespace anyway. Use link time optimization instead (<code>-flto</code> with gcc). GCC and Clang are surprisingly smart compilers and generate good code.</li> <li>Profile your code instead of guessing and doing premature optimization. Perf is a great tool. </li> <li>Want speed? Measure. (by Howard Hinnant)</li> </ul>

Can C++ compilers optimize away a class?

Tags:

c++

compiler-optimization

c++11

Let's say I have a class that's something like this:

class View
{
public:
    View(DataContainer &c)
        : _c(c)
    {
    }

    inline Elem getElemForCoords(double x, double y)
    {
        int idx = /* some computation here... */;
        return _c.data[idx];
    }

private:
    DataContainer& _c;
};

If I have a function using this class, is the compiler allowed to optimize it away entirely and just inline the data access?

Is the same still true if View::_c happens to be a std::shared_ptr?

723

asked Apr 15 '14 19:04

Siegfried Gevatter

1 Answers

If I have a function using this class, is the compiler allowed to optimize it away entirely and just inline the data access?

Is the same still true if View::_c happens to be a std::shared_ptr?

Absolutely, yes, and yes; as long as it doesn't violate the as-if rule (as already pointed out by Pentadecagon). Whether this optimization really happens is a much more interesting question; it is allowed by the standard. For this code:

#include <memory>
#include <vector>

template <class DataContainer>
class View {
public:
    View(DataContainer& c) : c(c) { }

    int getElemForCoords(double x, double y) {
        int idx = x*y; // some dumb computation
        return c->at(idx);
    }
private:
    DataContainer& c;
};

template <class DataContainer>
View<DataContainer> make_view(DataContainer& c) {
  return View<DataContainer>(c);
}

int main(int argc, char* argv[]) {

  auto ptr2vec = std::make_shared<std::vector<int>>(2);

  auto view = make_view(ptr2vec);

  return view.getElemForCoords(1, argc);
}

I have verified, by inspecting the assembly code (g++ -std=c++11 -O3 -S -fwhole-program optaway.cpp), that the View class is like it is not there, it adds zero overhead.

Some unsolicited advice.

Inspect the assembly code of your programs; you will learn a lot and start worrying about the right things. shared_ptr is a heavy-weight object (compared to, for example, unique_ptr), partly because of all that multi-threading machinery under the hood. If you look at the assembly code, you will worry much more about the overhead of the shared pointer and less about element access. ;)
The inline in your code is just noise, that function is implicitly inline anyway. Please don't trash your code with the inline keyword; the optimizer is free to treat it as whitespace anyway. Use link time optimization instead (-flto with gcc). GCC and Clang are surprisingly smart compilers and generate good code.
Profile your code instead of guessing and doing premature optimization. Perf is a great tool.
Want speed? Measure. (by Howard Hinnant)

104

answered Oct 08 '22 07:10

Ali

Related questions
                            
                                global variable in a .so library
                            
                                How to construct Voronoi diagram on the sphere with CGAL easily?
                            
                                C++ Console Application1.exe has triggered a breakpoint
                            
                                Do I need to make multiple executables for targeting different instruction sets?
                            
                                Why is boost::enable_shared_from_raw so undocumented?
                            
                                Simplify (a + b) XOR (c + b) [closed]
                            
                                const reference is bad C++ 11
                            
                                How to debug nodejs addon by gdb
                            
                                Are GLSL compilers well optimized
                            
                                g++ -no optimization- skips asm code after goto
                            
                                Set CXXFLAGS in Rcpp Makevars
                            
                                Replicating GLM::perspective in code
                            
                                Interoperability between Boost and C++11
                            
                                What does operand like 32i64 mean?
                            
                                How to call a stored procedure with a parameter of type table
                            
                                DFS in boost::graph with changing the graphs content
                            
                                Set insert doing a weird number of comparisons
                            
                                Why the template version is chosen below by the compiler?
                            
                                def __init__(self, *args, **kwargs) initialization of class in python
                            
                                Is there a simulator/emulator of Xeon Phi?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With