I am asking this question in the context of the C language, though it applies really to any language supporting pointers or pass-by-reference functionality. I come from a Java background, but have written enough low-level code (C and C++) to have observed this interesting phenomenon. Supposing we have some object X (not using "object" here in the strictest OOP sense of the word) that we want to fill with information by way of some other function, it seems there are two approaches to doing so: <ol> <li> Returning an instance of that object's type and assigning it, e.g. if X has type T, then we would have: <code>T func(){...}</code> <code>X = func();</code> </li> <li> Passing in a pointer / reference to the object and modifying it inside the function, and returning either <code>void</code> or some other value (in C, for instance, a lot of functions return an <code>int</code> corresponding to the success/failure of the operation). An example of this here is: <code>int func(T* x){...x = 1;...}</code> <code>func(&X);</code> </li> </ol> My question is: in what situations makes one method better than the other? Are they equivalent approaches to accomplishing the same outcome? What are the restrictions of each? Thanks!

There is a reason that you should always consider using the second method, rather than the first. If you look at the return values for the entirety of the C standard library, you'll notice that there's almost always an element of error handling involved in them. For example, you have to check the return value of the following functions before you assume they've succeeded: <ul> <li> <code>calloc</code>, <code>malloc</code> and <code>realloc</code> </li> <li><code>getchar</code></li> <li> <code>fopen</code> </li> <li> <code>scanf</code> and family</li> <li><code>strtok</code></li> </ul> There are other non-standard functions that follow this pattern: <ul> <li> <code>pthread_create</code>, etc.</li> <li> <code>socket</code>, <code>connect</code>, etc.</li> <li> <code>open</code>, <code>read</code>, <code>write</code>, etc.</li> </ul> Generally speaking, a return value conveys a number of items successfully read/written/converted or a flat-out boolean success/fail value, and in practice you'll almost always need such a return value, unless you're going to <code>exit(EXIT_FAILURE);</code> at any errors (in which case I would rather not use your modules, because they give me no opportunity to clean up within my own code). There are functions that don't use this pattern in the standard C library, because they use no resources (e.g. allocations or files) and so there's no chance of any error. If your function is a basic translation function (e.g. like <code>toupper</code>, <code>tolower</code> and friends which translate single character values), for example, then you don't need a return value for error handling because there are no errors. I think you'll find this scenario quite rare indeed, but if that is your scenario, by all means use the first option! In summary, you should always highly consider using option 2, reserving the return value for a similar use, for the sake of consistent with the rest of the world, and because you might later decide that you need the return value for communicating errors or number of items processed.

Two approaches to writing functions

Tags:

c

function

pointers

I am asking this question in the context of the C language, though it applies really to any language supporting pointers or pass-by-reference functionality.

I come from a Java background, but have written enough low-level code (C and C++) to have observed this interesting phenomenon. Supposing we have some object X (not using "object" here in the strictest OOP sense of the word) that we want to fill with information by way of some other function, it seems there are two approaches to doing so:

Returning an instance of that object's type and assigning it, e.g. if X has type T, then we would have:
T func(){...}

X = func();
Passing in a pointer / reference to the object and modifying it inside the function, and returning either void or some other value (in C, for instance, a lot of functions return an int corresponding to the success/failure of the operation). An example of this here is:

int func(T* x){...x = 1;...}

func(&X);

My question is: in what situations makes one method better than the other? Are they equivalent approaches to accomplishing the same outcome? What are the restrictions of each?

Thanks!

987

asked Aug 15 '15 04:08

Muhammad Khan

2 Answers

There is a reason that you should always consider using the second method, rather than the first. If you look at the return values for the entirety of the C standard library, you'll notice that there's almost always an element of error handling involved in them. For example, you have to check the return value of the following functions before you assume they've succeeded:

calloc, malloc and realloc
getchar
fopen
scanf and family
strtok

There are other non-standard functions that follow this pattern:

pthread_create, etc.
socket, connect, etc.
open, read, write, etc.

Generally speaking, a return value conveys a number of items successfully read/written/converted or a flat-out boolean success/fail value, and in practice you'll almost always need such a return value, unless you're going to exit(EXIT_FAILURE); at any errors (in which case I would rather not use your modules, because they give me no opportunity to clean up within my own code).

There are functions that don't use this pattern in the standard C library, because they use no resources (e.g. allocations or files) and so there's no chance of any error. If your function is a basic translation function (e.g. like toupper, tolower and friends which translate single character values), for example, then you don't need a return value for error handling because there are no errors. I think you'll find this scenario quite rare indeed, but if that is your scenario, by all means use the first option!

In summary, you should always highly consider using option 2, reserving the return value for a similar use, for the sake of consistent with the rest of the world, and because you might later decide that you need the return value for communicating errors or number of items processed.

137

answered Oct 04 '22 04:10

autistic

Method (1) passes the object by value, which requires that the object be copied. It's copied when you pass it in and copied again when it's returned. Method (2) passes only a pointer. When you're passing a primitive, (1) is just fine, but when you're passing an object, a struct, or an array, that's just wasted space and time.

In Java and many other languages, objects are always passed by reference. Behind the scenes, only a pointer is copied. This means that even though the syntax looks like (1), it actually works like (2).

answered Oct 04 '22 03:10

Thom Smith

Related questions
                            
                                ADC single conversion on STM32
                            
                                How Stack or memory is allocated for threads under the same process in Linux
                            
                                Linux - why is the program break pointer (brk/sbrk) different each time a program is run?
                            
                                split char string with multi-character delimiter in C
                            
                                What does ccache mean by "called for link"
                            
                                Programmatically verify a X509 certificate and private key match
                            
                                how to force recompile when changing Makefile flags?
                            
                                Are there any other arguments that main() can accept?
                            
                                How do images work in opencl kernel?
                            
                                Using Go on existing C project
                            
                                How to divide a decimal MIDI pitch-bend value into 2 separated 7 bit values correctly?
                            
                                Use a dope vector to access arbitrary axial slices of a multidimensional array?
                            
                                With MACH-O is there a way to register a function that will run before main?
                            
                                Is it possible to increase the refresh speed of srand(time(NULL)) in C?
                            
                                GCC does not warn about conversion and loss of data
                            
                                Union as an argument to a function in C
                            
                                Standard Input C: Incorrect string if $ is present
                            
                                cache locality for a binary tree
                            
                                Deletion Using memcpy in an array
                            
                                Why does printf literally print (null) and what exactly happens?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With