Passing a numpy array to C++

Tags:

I have some code writen in Python for which the output is a numpy array, and now I want to send that output to C++ code, where the heavy part of the calculations will be performed.

I have tried using cython's public cdef, but I am running on some issues. I would appreciate your help! Here goes my code:

pymodule.pyx:

Click to copy

from pythonmodule import result # result is my numpy array
import numpy as np
cimport numpy as np
cimport cython

@cython.boundscheck(False)
@cython.wraparound(False)
cdef public void cfunc():
    print 'I am in here!!!'
    cdef np.ndarray[np.float64_t, ndim=2, mode='c'] res = result
    print res

Once this is cythonized, I call:

pymain.c:

Click to copy

#include <Python.h>
#include <numpy/arrayobject.h>
#include "pymodule.h"

int main() {
  Py_Initialize();
  initpymodule();
  test(2);
  Py_Finalize();
}

int test(int a)
{
    Py_Initialize();
    initpymodule();
    cfunc();
    return 0;
}

I am getting a NameError for the result variable at C++. I have tried defining it with pointers and calling it indirectly from other functions, but the array remains invisible. I am pretty sure the answer is quite simple, but I just do not get it. Thanks for your help!

648

asked Jun 04 '16 08:06

user3225486

1 Answers

Short Answer

The NameError was cause by the fact that Python couldn't find the module, the working directory isn't automatically added to your PYTHONPATH. Using setenv with setenv("PYTHONPATH", ".", 1); in your C/C++ code fixes this.

Longer Answer

There's an easy way to do this, apparently. With a python module pythonmodule.py containing an already created array:

Click to copy

import numpy as np

result = np.arange(20, dtype=np.float).reshape((2, 10))

You can structure your pymodule.pyx to export that array by using the public keyword. By adding some auxiliary functions, you'll generally won't need to touch neither the Python, nor the Numpy C-API:

Click to copy

from pythonmodule import result
from libc.stdlib cimport malloc
import numpy as np
cimport numpy as np


cdef public np.ndarray getNPArray():
    """ Return array from pythonmodule. """
    return <np.ndarray>result

cdef public int getShape(np.ndarray arr, int shape):
    """ Return Shape of the Array based on shape par value. """
    return <int>arr.shape[1] if shape else <int>arr.shape[0]

cdef public void copyData(float *** dst, np.ndarray src):
    """ Copy data from src numpy array to dst. """
    cdef float **tmp
    cdef int i, j, m = src.shape[0], n=src.shape[1];

    # Allocate initial pointer 
    tmp = <float **>malloc(m * sizeof(float *))
    if not tmp:
        raise MemoryError()

    # Allocate rows
    for j in range(m):
        tmp[j] = <float *>malloc(n * sizeof(float))
        if not tmp[j]:
            raise MemoryError()

    # Copy numpy Array
    for i in range(m):
        for j in range(n):
            tmp[i][j] = src[i, j]

    # Assign pointer to dst
    dst[0] = tmp

Function getNPArray and getShape return the array and its shape, respectively. copyData was added in order to just extract the ndarray.data and copy it so you can then finalize Python and work without having the interpreter initialized.

A sample program (in C, C++ should look identical) would look like this:

Click to copy

#include <Python.h>
#include "numpy/arrayobject.h"
#include "pyxmod.h"
#include <stdio.h>

void printArray(float **arr, int m, int n);
void getArray(float ***arr, int * m, int * n);

int main(int argc, char **argv){
    // Holds data and shapes.
    float **data = NULL;
    int m, n;

    // Gets array and then prints it.
    getArray(&data, &m, &n);
    printArray(data, m, n);

    return 0;
}

void getArray(float ***data, int * m, int * n){
    // setenv is important, makes python find 
    // modules in working directory
    setenv("PYTHONPATH", ".", 1);

    // Initialize interpreter and module
    Py_Initialize();
    initpyxmod();

    // Use Cython functions.
    PyArrayObject *arr = getNPArray();
    *m = getShape(arr, 0);
    *n = getShape(arr, 1);

    copyData(data, arr);

    if (data == NULL){  //really redundant.
        fprintf(stderr, "Data is NULL\n");
        return ;
    }

    Py_DECREF(arr);
    Py_Finalize();
}

void printArray(float **arr, int m, int n){
    int i, j;
    for(i=0; i < m; i++){
        for(j=0; j < n; j++)
            printf("%f ", arr[i][j]);

        printf("\n");
    }
}

Always remember to set:

Click to copy

setenv("PYTHONPATH", ".", 1);

before you call Py_Initialize so Python can find modules in the working directory.

The rest is pretty straight-forward. It might need some additional error-checking and definitely needs a function to free the allocated memmory.

Alternate Way w/o Cython:

Doing it the way you are attempting is way hassle than it's worth, you would probably be better off using numpy.save to save your array in a npy binary file and then use some C++ library that reads that file for you.

answered Oct 15 '22 00:10

Dimitris Fasarakis Hilliard

Related questions
                            
                                Segmentation fault and crashing when trying to import opencv
                            
                                Fitting a Poisson distribution to data in statsmodels
                            
                                Python ftplib Optimal Block Size?
                            
                                "localhost" vs "127.0.0.1" performance
                            
                                DataFrame.interpolate() extrapolates over trailing missing data
                            
                                Debug C-library from Python (ctypes)
                            
                                Can tests with pytest fixtures be run interactively?
                            
                                How to install a dependency from a submodule in Python?
                            
                                Modify function in decorator
                            
                                Is there a tool to automatically calculate Big-O complexity for a function [duplicate]
                            
                                Scrapy spider memory leak
                            
                                Pythonic and efficient way to do an elementwise "in" using numpy
                            
                                Why 2700 records (320KB each) should take 30 seconds to be fetched?
                            
                                Python 3.5 type hinting dynamically generated instance attributes
                            
                                What exactly does 'use_idf' do when creating a TfidfTransformer in sklearn?
                            
                                When and why socket.send() returns 0 in python?
                            
                                Python import fails on travisCI but not locally
                            
                                Why do I get a Keras LSTM RNN input_shape error?
                            
                                Is it OK to print to stdout or stderr in Django data migrations? If so, how?
                            
                                How to find the nearest neighbors for latitude and longitude point on python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Passing a numpy array to C++

Tags:

c++

python

arrays

numpy

cython

user3225486

People also ask

1 Answers

Short Answer

Longer Answer

Alternate Way w/o Cython:

Dimitris Fasarakis Hilliard

Recent Activity

Donate For Us