Improve performance of converting numpy array to MATLAB double

Tags:

Calling MATLAB from Python is bound to give some performance reduction that I could avoid by rewriting (a lot of) code in Python. However, this isn't a realistic option for me, but it annoys me that a huge loss of efficiency lies in the simple conversion from a numpy array to a MATLAB double.

I'm talking about the following conversion from data1 to data1m, where

data1 = np.random.uniform(low = 0.0, high = 30000.0, size = (1000000,))
data1m = matlab.double(list(data1))

Here matlab.double comes from Mathworks own MATLAB package / engine. The second line of code takes 20 s on my system, which just seems like too much for a conversion that doesn't really do anything other than making the numbers 'edible' for MATLAB.

So basically I'm looking for a trick opposite to the one given here that works for converting MATLAB output back to Python.

753

asked Jul 24 '17 15:07

5Ke

1 Answers

Passing numpy arrays efficiently

Take a look at the file mlarray_sequence.py in the folder PYTHONPATH\Lib\site-packages\matlab\_internal. There you will find the construction of the MATLAB array object. The performance problem comes from copying data with loops within the generic_flattening function.

To avoid this behavior we will edit the file a bit. This fix should work on complex and non-complex datatypes.

Make a backup of the original file in case something goes wrong.
Add import numpy as np to the other imports at the beginning of the file

In line 38 you should find:

init_dims = _get_size(initializer)

replace this with:

try:
    init_dims=initializer.shape
except:
    init_dims = _get_size(initializer)

In line 48 you should find:

if is_complex:
    complex_array = flat(self, initializer,
                         init_dims, typecode)
    self._real = complex_array['real']
    self._imag = complex_array['imag']
else:
    self._data = flat(self, initializer, init_dims, typecode)

Replace this with:

if is_complex:
    try:
        self._real = array.array(typecode,np.ravel(initializer, order='F').real)
        self._imag = array.array(typecode,np.ravel(initializer, order='F').imag)
    except:
        complex_array = flat(self, initializer,init_dims, typecode)
        self._real = complex_array['real']
        self._imag = complex_array['imag']
else:
    try:
        self._data = array.array(typecode,np.ravel(initializer, order='F'))
    except:
        self._data = flat(self, initializer, init_dims, typecode)

Now you can pass a numpy array directly to the MATLAB array creation method.

data1 = np.random.uniform(low = 0.0, high = 30000.0, size = (1000000,))
#faster
data1m = matlab.double(data1)
#or slower method
data1m = matlab.double(data1.tolist())

data2 = np.random.uniform(low = 0.0, high = 30000.0, size = (1000000,)).astype(np.complex128)
#faster
data1m = matlab.double(data2,is_complex=True)
#or slower method
data1m = matlab.double(data2.tolist(),is_complex=True)

The performance in MATLAB array creation increases by a factor of 15 and the interface is easier to use now.

answered Sep 22 '22 01:09

max9111

Related questions
                            
                                python: convert pywintyptes.datetime to datetime.datetime
                            
                                How to convert DatetimeIndexResampler to DataFrame?
                            
                                How to secure APIs for Registration and Login in Django Rest Framework?
                            
                                Efficiently check if two numbers are co-primes (relatively primes)?
                            
                                PATCH and PUT don't work as expected when pytest is interacting with REST framework
                            
                                Creating a |N| x |M| matrix from a hash-table
                            
                                Add pip requirements to docker image in runtime
                            
                                with os.scandir() raises AttributeError: __exit__
                            
                                statsmodels add_constant for OLS intercept, what is this actually doing?
                            
                                Sublime Text 3: Anaconda package error connection to localhost timed out
                            
                                vectorize percentile value of column B of column A (for groups)
                            
                                How to remove EOFError: EOF when reading a line?
                            
                                Data order in seaborn heatmap from pivot
                            
                                How to change page size to A4 in python-docx
                            
                                How to round float 0.5 up to 1.0, while still rounding 0.45 to 0.0, as the usual school rounding?
                            
                                Using scikit-learn NMF with a precomputed set of basis vectors (Python)
                            
                                Can a PyMC3 trace be loaded and values accessed without the original model in memory?
                            
                                TensorFlow - tf.layers vs tf.contrib.layers
                            
                                Index out of range when using lambda [duplicate]
                            
                                Pandas - Groupby with conditional formula

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Improve performance of converting numpy array to MATLAB double

Tags:

python

numpy

matlab

5Ke

People also ask

1 Answers

max9111

Recent Activity

Donate For Us