Is There a Reason Standard Algorithms Take Lambdas by Value? [duplicate]

Tags:

So I asked a question here: Lambda Works on Latest Visual Studio, but Doesn't Work Elsewhere to which I got the response, that my code was implementation defined since the standard's 25.1 [algorithms.general] 10 says:

Unless otherwise specified, algorithms that take function objects as arguments are permitted to copy those function objects freely. Programmers for whom object identity is important should consider using a wrapper class that points to a noncopied implementation object such as reference_wrapper<T>

I'd just like a reason why this is happening? We're told our whole lives to take objects by reference, why then is the standard taking function objects by value, and even worse in my linked question making copies of those objects? Is there some advantage that I don't understand to doing it this way?

222

asked Dec 12 '16 19:12

Jonathan Mee

1 Answers

std assumes function objects and iterators are free to copy.

std::ref provides a method to turn a function object into a pseudo-reference with a compatible operator() that uses reference instead of value semantics. So nothing of large value is lost.

If you have been taught all your life to take objects by reference, reconsider. Unless there is a good reason otherwise, take objects by value. Reasoning about values is far easier; references are pointers into any state anywhere in your program.

The conventional use of references, as a pointer to a local object which is not referred to by any other active reference in the context where it is used, is not something someone reading your code nor the compiler can presume. If you reason about references this way, they don't add a ridiculous amount of complexity to your code.

But if you reason about them that way, you are going to have bugs when your assumption is violated, and they will be subtle, gross, unexpected, and horrible.

A classic example is the number of operator= that break when this and the argument refer to the same object. But any function that takes two references or pointers of the same type has the same issue.

But even one reference can break your code. Let's look at sort. In pseudo-code:

void sort( Iterator start, Iterator end, Ordering order )

Now, let's make Ordering a reference:

void sort( Iterator start, Iterator end, Ordering const& order )

How about this one?

std::function< void(int, int) > alice;
std::function< void(int, int) > bob;
alice = [&]( int x, int y ) { std:swap(alice, bob); return x<y; };
bob = [&]( int x, int y ) { std:swap(alice, bob); return x>y; };

Now, call sort( begin(vector), end(vector), alice ).

Every time < is called, the referred-to order object swaps meaning. Now this is pretty ridiculous, but when you took Ordering by const&, the optimizer had to take into account that possibility and rule it out on every invokation of your ordering code!

You wouldn't do the above (and in fact this particular implementation is UB as it would violate any reasonable requisites on std::sort); but the compiler has to prove you didn't do something "like that" (change the code in ordering) every time it follows order or invokes it! Which means constantly reloading the state of order, or inlining and proving you did nonesuch insanity.

Doing this when taking by-value is an order of magnitude harder (and basically requires something like std::ref). The optimizer has a function object, it is local, and its state is local. Anything stored within it is local, and the compiler and optimizer know who exactly can modify it legally.

Every function you write taking a const& that ever leaves its "local scope" (say, called a C library function) can not assume the state of the const& remained the same after it got back. It must reload the data from wherever the pointer points to.

Now, I did say pass by value unless there is a good reason. And there are many good reasons; your type is very expensive to move or copy, for example, is a great reason. You are writing data to it. You actually want it to change as you read it each time. Etc.

But the default behavior should be pass-by-value. Only move to references if you have a good reason, because the costs are distributed and hard to pin down.

101

answered Nov 15 '22 11:11

Yakk - Adam Nevraumont

Related questions
                            
                                Overload copy assignment operator for a member struct of a non-type template struct
                            
                                Use Clang to convert C++ to C code
                            
                                Is it possible to check if const value is known at compile time?
                            
                                Partially specializing on non-type template parameter of the wrong type
                            
                                Calling derived class through base class function pointer
                            
                                strtoul of negative number
                            
                                FlatBuffers: Send Multiple Packet Types Using a Union
                            
                                Why is subprocess.run output different from shell output of same command?
                            
                                Automatic constructor in explicitly instantiated class template
                            
                                Compilation error when returning an std::map of implicitly non-copyable structs on new versions of gcc
                            
                                Metaprogramming tricks: how to simplify implementation of two metafunctions
                            
                                Chain-calling member functions off a constructor of a named object
                            
                                Will C++17 template arguments with auto feature allow constrained std::function objects?
                            
                                Is behaviour well-defined when `sleep_until()` specifies a time point in the past?
                            
                                How to Detect the bounds of a Passport page with OpenCV?
                            
                                Selecting a specific libstdc++ version with clang
                            
                                C++ class access level
                            
                                Why no transparent C++1x std::map::at?
                            
                                Is static_assert supposed to work when invoked via decltype expression?
                            
                                Why does ICC unroll this loop in this way and use lea for arithmetic?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is There a Reason Standard Algorithms Take Lambdas by Value? [duplicate]

Tags:

c++

pass-by-reference

pass-by-value

algorithm

lambda

Jonathan Mee

People also ask

1 Answers

Yakk - Adam Nevraumont

Recent Activity

Donate For Us