Is it possible to compile (C++) code for the GPU with nvcc into a shared object (.so file) and load it dynamically from a C++ program (in this case, Cern's ROOT, which is essentially a C++ interpreter ("CINT")). A simple example that I would like to run is: <pre class="prettyprint"><code>extern "C" void TestCompiled() { printf("test\n"); exit(0); } </code></pre> This code was compiled with <code>nvcc --compiler-options '-fPIC' -o TestCompiled_C.so --shared TestCompiled.cu</code>. Loading the shared object into ROOT with: <pre class="prettyprint"><code>{ // Test.C program int error, check; check = gROOT->LoadMacro("TestCompiled_C.so", &error); cout << "check " << check << " " << " error: " << error << endl; TestCompiled(); // run macro exit(0); } </code></pre> loads the library OK, but does not find <code>TestCompiled()</code>: <pre class="prettyprint"><code>$ root -b -l Test.C root [0] Processing Test.C... check 0 error: 0 Error: Function Hello() is not defined in current scope Test.C:11: *** Interpreter error recovered *** </code></pre> Doing the same by compiling the first test script with ROOT (without the <code>extern</code> line, compiling with <code>root TestCompiled.C++</code>) works… What can I try in order to make the C++ program find the test function when nvcc does the compilation?

I'm copying, for reference, the salient points of the answer from the RootTalk forum that solved the problem: A key point is that the C interpreter of ROOT (CINT) requires a "CINT dictionary" for the externally compiled function. (There is no problem when compiling through ROOT, because ACLiC creates this dictionary when it pre-compiles the macro [<code>root TestCompiled.C++</code>]). So, an interface <code>TestCompiled.h++</code> must be created: <pre class="prettyprint"><code>#ifdef __cplusplus extern "C" { #endif void TestCompiled(void); #ifdef __cplusplus } /* end of extern "C" */ #endif </code></pre> The interface must then be loaded inside ROOT along with the shared object: <pre class="prettyprint"><code>{ // Test.C ROOT/CINT unnamed macro (interpreted) Int_t check, error; check = gROOT->LoadMacro("TestCompiled_C.so", &error); std::cout << "_C.so check " << check << " error " << error << std::endl; check = gROOT->LoadMacro("TestCompiled.h++", &error); std::cout << "_h.so check " << check << " error " << error << std::endl; TestCompiled(); // execute the compiled function } </code></pre> ROOT can now use the externally compiled program: <code>root -b -l -n -q Test.C</code> works. This can be tested with, e.g., g++ on the following <code>TestCompiled.C</code>: <pre class="prettyprint"><code>#include <cstdio> extern "C" void TestCompiled(void) { printf("test\n"); } </code></pre> compiled with <pre class="prettyprint"><code>g++ -fPIC -shared -o TestCompiled_C.so TestCompiled.C </code></pre>

Compiling part of a C++ program for GPU

Tags:

c++

dynamic-linking

nvcc

Is it possible to compile (C++) code for the GPU with nvcc into a shared object (.so file) and load it dynamically from a C++ program (in this case, Cern's ROOT, which is essentially a C++ interpreter ("CINT")).

A simple example that I would like to run is:

extern "C"
void TestCompiled() {
  printf("test\n");
  exit(0); 
}

This code was compiled with nvcc --compiler-options '-fPIC' -o TestCompiled_C.so --shared TestCompiled.cu. Loading the shared object into ROOT with:

{ // Test.C program
  int error, check;
  check = gROOT->LoadMacro("TestCompiled_C.so", &error);
  cout << "check " << check << " " << " error: " << error << endl;
  TestCompiled();  // run macro
  exit(0); 
}

loads the library OK, but does not find TestCompiled():

$ root -b -l Test.C
root [0] 
Processing Test.C...
check 0  error: 0
Error: Function Hello() is not defined in current scope  Test.C:11:
*** Interpreter error recovered ***

Doing the same by compiling the first test script with ROOT (without the extern line, compiling with root TestCompiled.C++) works… What can I try in order to make the C++ program find the test function when nvcc does the compilation?

244

asked May 22 '14 13:05

Eric O Lebigot

2 Answers

I am assuming that the shared object file being output is like any other shared library, such as one created with GCC using the shared option. In this case, to load the object dynamically, you will need to use the dlopen function to get a handle to the shared object. Then, you can use the dlsym function to look for a symbol in the file.

void *object_handle = dlopen("TestCompiled_C.so", RTLD_NOW);
if (object_handle == NULL)
{
  printf("%s\n", dlerror());
  // Exit or return error code
}
void *test_compiled_ptr = dlsym(object_handle, "TestCompiled");
if (!test_compiled)
{
  printf("%s\n", dlerror());
  // Exit or return error code
}

void (*test_compiled)() = (void (*)()) test_compiled_ptr;
test_compiled();

You will need to include dlfcn.h and link with -ldl when you compile.

The difference between this and what you are doing now is that you are loading the library statically rather that dynamically. Even though shared objects are "dynamically linked libraries," as they are called in the windows world, doing it the way you are now is loading all of the symbols in the object when the program is launched. To dynamically load certain symbols at runtime, you need to do it this way.

107

answered Sep 25 '22 20:09

ImOnALampshade

I'm copying, for reference, the salient points of the answer from the RootTalk forum that solved the problem:

A key point is that the C interpreter of ROOT (CINT) requires a "CINT dictionary" for the externally compiled function. (There is no problem when compiling through ROOT, because ACLiC creates this dictionary when it pre-compiles the macro [root TestCompiled.C++]).

So, an interface TestCompiled.h++ must be created:

#ifdef __cplusplus
extern "C" {
#endif

  void TestCompiled(void);

#ifdef __cplusplus
} /* end of extern "C" */
#endif

The interface must then be loaded inside ROOT along with the shared object:

{ // Test.C ROOT/CINT unnamed macro (interpreted)
  Int_t check, error;
  check = gROOT->LoadMacro("TestCompiled_C.so", &error);
  std::cout << "_C.so check " << check << " error " << error << std::endl;
  check = gROOT->LoadMacro("TestCompiled.h++", &error);
  std::cout << "_h.so check " << check << " error " << error << std::endl;
  TestCompiled(); // execute the compiled function
}

ROOT can now use the externally compiled program: root -b -l -n -q Test.C works.

This can be tested with, e.g., g++ on the following TestCompiled.C:

#include <cstdio>
extern "C" void TestCompiled(void) { printf("test\n"); }

compiled with

g++ -fPIC -shared -o TestCompiled_C.so TestCompiled.C

answered Sep 23 '22 20:09

Eric O Lebigot

Related questions
                            
                                Curl not timing out properly
                            
                                Discrepancy between istream's operator>> (double& val) between libc++ and libstdc++
                            
                                Write Program to Make CPU Usage About 50%
                            
                                Pointer arithmetic and integral promotion
                            
                                May a member function template specialization have a different access level than the main template?
                            
                                How to deal with precompiled headers randomly becoming corrupted on a cancelled build?
                            
                                What is the closest complete native library to three.js?
                            
                                Replicate Visual Studio 2013 custom GUI in winapi
                            
                                Size of polymorphic class derived virtually
                            
                                C/C++ linkage convention
                            
                                How to open Visual Studio Express 2013 for Windows Desktop (C++)? [closed]
                            
                                Qt: resize a QGraphicsItem (boundingRect()) into a QGraphicsScene with the mouse
                            
                                OpenMP recursive tasks
                            
                                Constructing a non-copyable, non-movable type into a function parameter without invoking initializer_list constructor
                            
                                Lua C API stack visualizer/viewer in Visual Studio 2013
                            
                                Why does std::sort compare the element to itself
                            
                                Cython syntax for declaring class hierarchies that have aliases
                            
                                Using SDL for a web application
                            
                                Is \0 ("\\0" in a C-style regex string) a valid escape sequence in C++ regular expressions?
                            
                                Explicit template instantiation with variadic templates

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Compiling part of a C++ program for GPU

Tags:

c++

dynamic-linking

nvcc

Eric O Lebigot

People also ask

2 Answers

ImOnALampshade

Eric O Lebigot

Recent Activity

Donate For Us