All,
I am starting to write object-oriented R code for the first time and anticipate having multiple R files with dependencies in between. I'm new to R and have not yet wrote anything outside of a single massive script to test ideas. Are there resources online that give tips on how one ought to organize code? Short of descriptions on how to build packages, I'm failing to find such guidance. At this point, I just want to organize the code in such a way that it makes loading and interacting with the collection of routines as straightforward as possible.
Appreciate any guidance you can provide.
Chris
R's base and recommended packages are compiled when R is installed, and your scripts and functions are automatically compiled "just-in-time" as you execute them. At the useR!
This question is very closely related to: "How to organize large R programs?"
You should consider creating an R package. You can use the package.skeleton
function to start with given a set of R files. I also strongly recommend using roxygen
to document the package at the beginning, because it's much more difficult to do it after the fact.
Read "Writing R Extensions". The online book "Statistics with R" has a section on this subject. Also take a look at Creating R Packages: A Tutorial by Friedrich Leisch. Lastly, if you're in NY, come to the upcoming NY use-R group meeting on "Authoring R Packages: a gentle introduction with examples".
Just to rehash some suggestions about good practices:
R CMD check
which is very helpful at catching bugs; separately you can look at using the codetools
package.Edit:
Regarding how do make incremental changes without rebuilding and installing the full package: I find the easiest thing to do is to make changes in your relevant R file and then use the source
command to load those changes. Once you load your library into an R session, it will always be lower in the environment (and lower in priority) than the .GlobalEnv, so any changes that you source or load in directly will be used first (use the search
command to see this). That way you can have your package underlying and you are overwriting changes as you're testing them in the environment.
Alternatively, you can use an IDE like StatET or ESS. They make loading individual lines or functions out of an R package very easy. StatET is particularly well designed to handle managing packages in a directory-like structure.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With