Assume I have two packages in R, the first named <code>foo</code>, the second named <code>bar</code>. I want to include a C function in <code>foo</code> and share that functionality with <code>bar</code> in a way that is platform independent and consistent with the CRAN policies. What is the preferred method of doing this, and how should I go about using function registration and dynamic libraries? The purpose of my question is that, even though I read through all the documentation I could find, there is nothing that occured to me as the obvious thing to do, and I am unsure what the most sustainable course of action is. Example: Assume that in one package <code>foo</code>, I define a C function <code>addinc</code> which adds two numbers. <pre class="prettyprint"><code>#include <R.h> #include <Rinternals.h> SEXP addinc(SEXP x_, SEXP y_) { double x = asReal(x_); double y = asReal(y_); double sum = x + y; return ScalarReal(sum); } </code></pre> In the same package, I can try calling <code>addinc</code> in an R function named <code>addinr</code> via the <code>.Call</code> interface. <pre class="prettyprint"><code>addinr <- function(x,y){ .Call("addinc", x, y, PACKAGE="foo") } </code></pre> However, when building, checking and installing the package, running <code>addinr</code> returns the error below, presumably because the function is not yet registered within R. <pre class="prettyprint"><code>library(foo) addinr(1,2) </code></pre> <blockquote> Error in .Call("addinc", x, y, PACKAGE = "foo") : "addinc" not available for .Call() for package "foo" </blockquote> As it seems to me, the easiest way to solve this is to build a dynamic library for the compiled code by adding <code>useDynLib(foo)</code> to <code>foo</code>s NAMESPACE file. This appears to solve the problem because I can now call <code>addinr()</code> without problems. Moreover, I can run <code>.Call("addinc", ..., PACKAGE="foo")</code> directly from within R. My real question, however, occurs when a second package, say <code>bar</code>, is supposed to use <code>foo</code>s <code>addinc</code>. For example, assume that <code>bar</code> defines a function <code>multiplyinr</code> as follows. <pre class="prettyprint"><code>multiplyinr <- function(x,y){ ans <- 0 for(i in 1:y) ans <- .Call("addinc", ans, x, PACKAGE="foo") ans } </code></pre> This, in fact, works completely fine, and I can call <code>multiplyinr</code> within R. However, when building and checking <code>bar</code>, I receive a Note complaining about the fact that <code>bar</code> is calling foreign language functions from a different package. <blockquote> Foreign function call to a different package: .Call("addinc", ..., PACKAGE = "foo") See chapter ‘System and foreign language interfaces’ in the ‘Writing R Extensions’ manual. </blockquote> According to this question, the package <code>bar</code> would not be suitable for submission to CRAN because using <code>.Call()</code> in this way is not considered "portable" as explained in the Writing R Extensions manual. In conclusion, the simple solution of having <code>foo</code> include a <code>useDynLib(foo)</code> in its NAMESPACE file does not quite seem to cut it. Therefore my question: What is the preferred method to share a C function with other packages? Moreover: Is using <code>useDynLib()</code> truly dangerous or inconsistent with the CRAN policies? What is the purpose of declaring <code>useDynLib()</code> in the NAMESPACE file as an alternative to registering and building the shared library manually? Would manually registering the C function and bulding the shared library change anything (i.e., using <code>R_RegisterCCallable()</code> or <code>R_registerRoutines()</code>)?

The general idea is that the 'native symbol info' objects created by using e.g. <code>useDynLib(<pkg>, <symbol>)</code> are not part of the public API of a package, and so client packages should not be calling them directly (it's assumed they could be changed in future revisions of the package). There are two ways to 'export' a compiled routine for use by client packages: <ol> <li>Just export an R wrapper function in <code>foo</code> that calls the native routine directly, or</li> <li>Use the <code>R_RegisterCCallable()</code> / <code>R_GetCCallable()</code> pair of functions to get a pointer to the function you want. (Package <code>foo</code> would call <code>R_RegisterCCallable()</code> to make some function available; client package <code>bar</code> would call <code>R_GetCCallable()</code> to get a pointer to that function)</li> </ol> In other words, if a package author 'registers' their C functions, they're declaring that to be part of the public C API of their package, and allow client packages to use / call that through this interface.

What is the preferred method for sharing compiled C code in an R package and running it from another?

Tags:

c

package

r

Assume I have two packages in R, the first named foo, the second named bar. I want to include a C function in foo and share that functionality with bar in a way that is platform independent and consistent with the CRAN policies.

What is the preferred method of doing this, and how should I go about using function registration and dynamic libraries?

The purpose of my question is that, even though I read through all the documentation I could find, there is nothing that occured to me as the obvious thing to do, and I am unsure what the most sustainable course of action is.

Example:

Assume that in one package foo, I define a C function addinc which adds two numbers.

#include <R.h>
#include <Rinternals.h>

SEXP addinc(SEXP x_, SEXP y_) {
  double x = asReal(x_);
  double y = asReal(y_);

  double sum = x + y;

  return ScalarReal(sum);
}

In the same package, I can try calling addinc in an R function named addinr via the .Call interface.

addinr <- function(x,y){
  .Call("addinc", x, y, PACKAGE="foo")
}

However, when building, checking and installing the package, running addinr returns the error below, presumably because the function is not yet registered within R.

library(foo)
addinr(1,2)

Error in .Call("addinc", x, y, PACKAGE = "foo") :
"addinc" not available for .Call() for package "foo"

As it seems to me, the easiest way to solve this is to build a dynamic library for the compiled code by adding useDynLib(foo) to foos NAMESPACE file. This appears to solve the problem because I can now call addinr() without problems. Moreover, I can run .Call("addinc", ..., PACKAGE="foo") directly from within R.

My real question, however, occurs when a second package, say bar, is supposed to use foos addinc. For example, assume that bar defines a function multiplyinr as follows.

multiplyinr <- function(x,y){
  ans <- 0
  for(i in 1:y) ans <- .Call("addinc", ans, x, PACKAGE="foo")
  ans
}

This, in fact, works completely fine, and I can call multiplyinr within R. However, when building and checking bar, I receive a Note complaining about the fact that bar is calling foreign language functions from a different package.

Foreign function call to a different package:
.Call("addinc", ..., PACKAGE = "foo")
See chapter ‘System and foreign language interfaces’ in the ‘Writing R Extensions’ manual.

According to this question, the package bar would not be suitable for submission to CRAN because using .Call() in this way is not considered "portable" as explained in the Writing R Extensions manual.

In conclusion, the simple solution of having foo include a useDynLib(foo) in its NAMESPACE file does not quite seem to cut it. Therefore my question: What is the preferred method to share a C function with other packages?

Moreover:

Is using useDynLib() truly dangerous or inconsistent with the CRAN policies? What is the purpose of declaring useDynLib() in the NAMESPACE file as an alternative to registering and building the shared library manually?

Would manually registering the C function and bulding the shared library change anything (i.e., using R_RegisterCCallable() or R_registerRoutines())?

639

asked Jan 30 '16 15:01

SimonG

1 Answers

The general idea is that the 'native symbol info' objects created by using e.g. useDynLib(<pkg>, <symbol>) are not part of the public API of a package, and so client packages should not be calling them directly (it's assumed they could be changed in future revisions of the package).

There are two ways to 'export' a compiled routine for use by client packages:

Just export an R wrapper function in foo that calls the native routine directly, or
Use the R_RegisterCCallable() / R_GetCCallable() pair of functions to get a pointer to the function you want. (Package foo would call R_RegisterCCallable() to make some function available; client package bar would call R_GetCCallable() to get a pointer to that function)

In other words, if a package author 'registers' their C functions, they're declaring that to be part of the public C API of their package, and allow client packages to use / call that through this interface.

190

answered Nov 05 '22 21:11

Kevin Ushey

Related questions
                            
                                Incorrect result for %p when implementation printf
                            
                                Libtool prefixes objects but gcov requires them without prefix
                            
                                Is it sane to include casts in #defined constants based on how they're intended to be used?
                            
                                Compiling C Source for iOS
                            
                                Makefile: Converting C-code to mex code (Linking error)
                            
                                Pointer analysis in LLVM
                            
                                #define a tuple in C
                            
                                Can multiple files be stored in the same block?
                            
                                Using condition flags as GNU C inline asm outputs
                            
                                What does section 5.1.2.3, paragraph 4 (in n1570.pdf) mean for null operations?
                            
                                Different Answers by removing a printf statement
                            
                                Is there any better way to calculate (n * 8 + 3) / 5?
                            
                                Threading and Thread Safety in C
                            
                                Naming a variable `current` in a kernel module leads to "function declaration isn’t a prototype" error
                            
                                How to pass a python list to C function (dll) using ctypes
                            
                                What does it mean to POSIX that a thread is "suspended"?
                            
                                Order of GCC optimization flags
                            
                                Haskell: use last reference to a variable to efficiently create a new variable
                            
                                How to free recursive struct (trie)
                            
                                Creating multiple threads in C

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With