How should I handle 'helper' functions in an R package?

Background

I written an R package, and now a collaborator (recent CS grad who is new to R) is editing and refactoring the code. In the process, he is dividing up my functions into smaller, more generic functions.

What he is doing makes sense, but when I started with package.skeleton(), I had one file per function. Now, he has added functions on which the primary function depends, but that may have limited use outside the function itself.

He suggests that all the functions go into a single file, but I am against that because it is easier to do version control when we work on different files.

I have since started using roxygen to document each function within the text.

Question

What is the recommended way to handle functions: clearly the helper functions should stay with the the main function, but to what extent do I need to document helper functions?

The @export suggestion in the comments is helpful, but I am curious to know how others organize their code.

659

asked Mar 09 '11 17:03

David LeBauer

1 Answers

I cut up my functions under two conditions :

when it improves readibility of the code of the main function, and/or
when it avoids copy-pasting code, eg if the same code is used a couple of times within the same function.

I do include the so-called helper functions in the file of the main function, but only as long as those helper functions are not used in any other function. Actually, I consider them nested within the main function. I do understand your argument for version control, but changing the helper function comes down to changing the performance of the main function, so I see no problem in keeping them in the same file.

Some helper functions might be used in different other functions, and then I save them in their own file. Often I do export those functions, as they might be of interest for the user. Compare this to eg lm and the underlying lm.fit, where advanced users could make decent use of lm.fit for speeding up code etc.

I use the naming convention used in quite some packages (and derived from linux), by preceding every "hidden" function by a dot. So that makes

.helper.function <- function(x, ...){     ... some code ... }  main.function <- function(x, ...){     ...some code, including .helper.function(y, ...) }

I explicitly @export all functions that need exporting, never the helper functions. It's not always easy to judge whether a function might be of interest to an end user, but in most cases it's pretty clear.

To give an example : A few lines of code to cut off NA lines I consider a helper function. A more complex function to convert the dataset to the correct format I export and document.

YMMV

125

answered Sep 22 '22 05:09

Joris Meys

Related questions
                            
                                R - Group by variable and then assign a unique ID [duplicate]
                            
                                Add legend to geom_line() graph in r
                            
                                It is possible to create inset graphs?
                            
                                How to create TextArea as input in a Shiny webapp in R?
                            
                                Convert months mmm to numeric
                            
                                Stacked Bar Plot in R
                            
                                Split a string every 5 characters
                            
                                Suppressing messages in Knitr / Rmarkdown
                            
                                How to avoid a loop in R: selecting items from a list
                            
                                How to make R use all processors?
                            
                                What are the default plotting colors in R or ggplot2? [duplicate]
                            
                                Memory profiling in R - tools for summarizing
                            
                                Generating a Call Graph in R
                            
                                Is it possible to use R package data in testthat tests or run_examples()?
                            
                                Facet with free scales but keep aspect ratio fixed
                            
                                Is there an R dplyr method for merge with all=TRUE?
                            
                                Why is `row.names` preferred over `rownames`?
                            
                                R not responding request to interrupt stop process
                            
                                In R, how do you loop over the rows of a data frame really fast?
                            
                                R: Insert a vector as a row in data.frame

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How should I handle 'helper' functions in an R package?

Tags:

package

r

coding-style

roxygen