In the documentation, R suggests that raw data files (not Rdata nor Rda) should be placed in inst/extdata/
From the first paragraph in: http://cran.r-project.org/doc/manuals/R-exts.html#Data-in-packages
The data subdirectory is for data files, either to be made available via lazy-loading or for loading using data(). (The choice is made by the ‘LazyData’ field in the DESCRIPTION file: the default is not to do so.) It should not be used for other data files needed by the package, and the convention has grown up to use directory inst/extdata for such files.
So, I have moved all of my raw data into this folder, but when I build and reload the package and then try to access the data in a function with (for example):
read.csv(file=paste(path.package("my_package"),"/inst/extdata/my_raw_data.csv",sep=""))
# .path.package is now path.package in R 3.0+
I get the "cannot open file" error.
However, it does look like there is a folder called /extdata
in the package directory with the files in it (post-build and install). What's happening to the /inst
folder?
Does everything in the /inst folder get pushed into the /
of the package?
When the package is installed, all files (and folders) in inst/ are moved up one level to the top-level directory (so they can’t have names like R/ or DESCRIPTION ). To refer to files in inst/extdata (whether installed or not), use system.file (). For example, the readr package uses inst/extdata to store delimited files for use in examples:
The most common location for package data is (surprise!) data/. Each file in this directory should be a .RData file created by save () containing a single object (with the same name as the file). The easiest way to adhere to these rules is to use usethis::use_data ():
They’re only available inside your package. If you want to show examples of loading/parsing raw data, put the original files in inst/extdata. When the package is installed, all files (and folders) in inst/ are moved up one level to the top-level directory (so they can’t have names like R/ or DESCRIPTION ).
If you want to store parsed data, but not make it available to the user, put it in R/sysdata.rda. This is the best place to put data that your functions need. If you want to store raw data, put it in inst/extdata.
More useful than using file.path
would be to use system.file
. Once your package is installed, you can grab your file like so:
fpath <- system.file("extdata", "my_raw_data.csv", package="my_package")
fpath
will now have the absolute path on your HD to the file.
You were both very close and essentially had this. A formal reference from 'Writing R Extensions' is:
1.1.3 Package subdirectories
[...]
The contents of the
inst
subdirectory will be copied recursively to the installation directory. Subdirectories ofinst
should not interfere with those used by R (currently,R
,data
,demo
,exec
,libs
,man
,help
,html
andMeta
, and earlier versions usedlatex
,R-ex
). The copying of theinst
happens aftersrc
is built so itsMakefile
can create files to be installed. Prior to R 2.12.2, the files were installed on POSIX platforms with the permissions in the package sources, so care should be taken to ensure these are not too restrictive:R CMD build
will make suitable adjustments. To exclude files from being installed, one can specify a list of exclude patterns in file.Rinstignore
in the top-level source directory. These patterns should be Perl-like regular expressions (see the help forregexp
in R for the precise details), one per line, to be matched(10) against the file and directory paths, e.g.doc/.*[.]png$
will exclude all PNG files ininst/doc
based on the (lower-case) extension.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With