return by value inline functions

Tags:

I'm implementing some math types and I want to optimize the operators to minimize the amount of memory created, destroyed, and copied. To demonstrate I'll show you part of my Quaternion implementation.

Click to copy

class Quaternion
{
public:
    double w,x,y,z;

    ...

    Quaternion  operator+(const Quaternion &other) const;
}

I want to know how the two following implementations differ from eachother. I do have a += implementation that operates in-place to where no memory is created, but some higher level operations utilizing quaternions it's useful to use + and not +=.

Click to copy

__forceinline Quaternion Quaternion::operator+( const Quaternion &other ) const
{
    return Quaternion(w+other.w,x+other.x,y+other.y,z+other.z);
}

and

Click to copy

__forceinline Quaternion Quaternion::operator+( const Quaternion &other ) const
{
    Quaternion q(w+other.w,x+other.x,y+other.y,z+other.z);
    return q;
}

My c++ is completely self-taught so when it comes to some optimizations, I'm unsure what to do because I do not know exactly how the compiler handles these things. Also how do these mechanics translate to non-inline implementations.

Any other criticisms of my code are welcomed.

265

asked Aug 21 '09 19:08

Mark

3 Answers

Your first example allows the compiler to potentially use somehting called "Return Value Optimization" (RVO).

The second example allows the compiler to potentially use something called "Named Return Value Optimization" (NRVO). These 2 optimizations are clearly closely related.

Some details of Microsoft's implementation of NRVO can be found here:

http://msdn.microsoft.com/en-us/library/ms364057.aspx

Note that the article indicates that NRVO support started with VS 2005 (MSVC 8.0). It doesn't specifically say whether the same applies to RVO or not, but I believe that MSVC used RVO optimizations before version 8.0.

This article about Move Constructors by Andrei Alexandrescu has good information about how RVO works (and when and why compilers might not use it).

Including this bit:

you'll be disappointed to hear that each compiler, and often each compiler version, has its own rules for detecting and applying RVO. Some apply RVO only to functions returning unnamed temporaries (the simplest form of RVO). The more sophisticated ones also apply RVO when there's a named result that the function returns (the so-called Named RVO, or NRVO).

In essence, when writing code, you can count on RVO being portably applied to your code depending on how you exactly write the code (under a very fluid definition of "exactly"), the phase of the moon, and the size of your shoes.

The article was written in 2003 and compilers should be much improved by now; hopefully, the phase of the moon is less important to when the compiler might use RVO/NRVO (maybe it's down to day-of-the-week). As noted above it appears that MS didn't implement NRVO until 2005. Maybe that's when someone working on the compiler at Microsoft got a new pair of more comfortable shoes a half-size larger than before.

Your examples are simple enough that I'd expect both to generate equivalent code with more recent compiler versions.

196

answered Sep 20 '22 03:09

Michael Burr

Between the two implementations you presented, there really is no difference. Any compiler doing any sort of optimizations whatsoever will optimize your local variable out.

As for the += operator, a slightly more involved discussion about whether or not you want your Quaternions to be immutable objects is probably required... I would always lead towards creating objects like this as immutable objects. (but then again, I'm more of a managed coder as well)

answered Sep 17 '22 03:09

LorenVS

If these two implementations do not generate exactly the same assembly code when optimization is turned on, you should consider using a different compiler. :) And I don't think it matters whether or not the function is inlined.

By the way, be aware that __forceinline is very non-portable. I would just use plain old standard inline and let the compiler decide.

answered Sep 17 '22 03:09

Dima

Related questions
                            
                                compiler "error: passing ‘const something’ as ‘this’ argument discards qualifiers"
                            
                                SFINAE to assert() that code DOES NOT compile
                            
                                Do threads sleep when waiting on a locked mutex?
                            
                                Make compiler assume that all cases are handled in switch without default
                            
                                Having trouble compiling in VS Code terminal, which is Windows Powershell
                            
                                Is ~i really equivalent to i != -1?
                            
                                Using std::any_of, std::all_of, std::none_of etc with std::map
                            
                                shared_ptrs being deleted twice
                            
                                Efficiently convert two Integers x and y into the float x.y
                            
                                If an integer is signed by default, why does the signed keyword exist?
                            
                                Is there a way to slice the structure vector in c++?
                            
                                dynamic cast with interfaces
                            
                                Howto create software package in Unix/Linux?
                            
                                How to identify if an object should be on the stack or not?
                            
                                filling a boost vector or matrix
                            
                                Check at compile time class constructor signature
                            
                                How to tell std::set to 'refresh' its ordering?
                            
                                Same Header File for both DLL and Static Library
                            
                                IDE for Objective C
                            
                                Counting occurrences in a vector

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

return by value inline functions

Tags:

c++

optimization

memory

inline

Mark

People also ask

3 Answers

Michael Burr

LorenVS

Dima

Recent Activity

Donate For Us