The return value of a function is usually stored on the stack or in a register. But for a large structure, it has to be on the stack. How much copying has to happen in a real compiler for this code? Or is it optimized away? For example: <pre class="prettyprint"><code>struct Data { unsigned values[256]; }; Data createData() { Data data; // initialize data values... return data; } </code></pre> (Assuming the function cannot be inlined..)

<blockquote> But for a large structure, it has to be on the <strike>heap</strike> stack. </blockquote> Indeed so! A large structure declared as a local variable is allocated on the stack. Glad to have that cleared up. As for avoiding copying, as others have noted: <ul> <li>Most calling conventions deal with "function returning struct" by passing an additional parameter that points the location in the caller's stack frame in which the struct should be placed. This is definitely a matter for the calling convention and not the language.</li> <li>With this calling convention, it becomes possible for even a relatively simple compiler to notice when a code path is definitely going to return a struct, and for it to fix assignments to that struct's members so that they go directly into the caller's frame and don't have to be copied. The key is for the compiler to notice that all terminating code paths through the function return the same struct variable. If that's the case, the compiler can safely use the space in the caller's frame, eliminating the need for a copy at the point of return.</li> </ul>

How do C compilers implement functions that return large structures?

Tags:

c

compiler-optimization

abi

compiler-theory

calling-convention

The return value of a function is usually stored on the stack or in a register. But for a large structure, it has to be on the stack. How much copying has to happen in a real compiler for this code? Or is it optimized away?

For example:

struct Data {     unsigned values[256]; };  Data createData()  {     Data data;     // initialize data values...     return data; }

(Assuming the function cannot be inlined..)

585

asked Jan 28 '10 15:01

Steve Hanov

2 Answers

None; no copies are done.

The address of the caller's Data return value is actually passed as a hidden argument to the function, and the createData function simply writes into the caller's stack frame.

This is known as the named return value optimisation. Also see the c++ faq on this topic.

commercial-grade C++ compilers implement return-by-value in a way that lets them eliminate the overhead, at least in simple cases

...

When yourCode() calls rbv(), the compiler secretly passes a pointer to the location where rbv() is supposed to construct the "returned" object.

You can demonstrate that this has been done by adding a destructor with a printf to your struct. The destructor should only be called once if this return-by-value optimisation is in operation, otherwise twice.

Also you can check the assembly to see that this happens:

Data createData()  {     Data data;     // initialize data values...     data.values[5] = 6;     return data; }

here's the assembly:

__Z10createDatav: LFB2:         pushl   %ebp LCFI0:         movl    %esp, %ebp LCFI1:         subl    $1032, %esp LCFI2:         movl    8(%ebp), %eax         movl    $6, 20(%eax)         leave         ret     $4 LFE2:

Curiously, it allocated enough space on the stack for the data item subl $1032, %esp, but note that it takes the first argument on the stack 8(%ebp) as the base address of the object, and then initialises element 6 of that item. Since we didn't specify any arguments to createData, this is curious until you realise this is the secret hidden pointer to the parent's version of Data.

answered Oct 17 '22 07:10

Alex Brown

But for a large structure, it has to be on the ~~heap~~ stack.

Indeed so! A large structure declared as a local variable is allocated on the stack. Glad to have that cleared up.

As for avoiding copying, as others have noted:

Most calling conventions deal with "function returning struct" by passing an additional parameter that points the location in the caller's stack frame in which the struct should be placed. This is definitely a matter for the calling convention and not the language.
With this calling convention, it becomes possible for even a relatively simple compiler to notice when a code path is definitely going to return a struct, and for it to fix assignments to that struct's members so that they go directly into the caller's frame and don't have to be copied. The key is for the compiler to notice that all terminating code paths through the function return the same struct variable. If that's the case, the compiler can safely use the space in the caller's frame, eliminating the need for a copy at the point of return.

answered Oct 17 '22 07:10

Norman Ramsey

Related questions
                            
                                What c lib to use when I need to parse a simple config file under linux? [closed]
                            
                                Why do I have to explicitly link with libm? [duplicate]
                            
                                How to use list from sys/queue.h?
                            
                                Replacing extrordinarily slow pow() function
                            
                                What does "@(#)" in comments mean?
                            
                                What is the difference between functions in math and functions in programming?
                            
                                Extracting precise frequencies from FFT Bins using phase change between frames
                            
                                Why does gcc -Wall give warning about zero-length format string?
                            
                                Inter-operability of Swift arrays with C?
                            
                                Void ** a generic pointer?
                            
                                how is select() alerted to an fd becoming "ready"?
                            
                                UTF-8 in Windows
                            
                                Unit testing patterns for microcontroller C code
                            
                                CMake cross-compiling: C flags from toolchain file ignored
                            
                                Do I have the guarantee that sizeof(type) == sizeof(unsigned type)?
                            
                                Why does adding 0 to the end of float literal change how it rounds (possible GCC bug)?
                            
                                UDP checksum calculation
                            
                                Signal handling in pthreads
                            
                                How `realloc` work actually in the background?
                            
                                Jump Table Switch Case question

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With