Improving Numpy Performance

Tags:

I'd like to improve the performance of convolution using python, and was hoping for some insight on how to best go about improving performance.

I am currently using scipy to perform the convolution, using code somewhat like the snippet below:

import numpy
import scipy
import scipy.signal
import timeit

a=numpy.array ( [ range(1000000) ] )
a.reshape(1000,1000)
filt=numpy.array( [ [ 1, 1, 1 ], [1, -8, 1], [1,1,1] ] )

def convolve():
  global a, filt
  scipy.signal.convolve2d ( a, filt, mode="same" )

t=timeit.Timer("convolve()", "from __main__ import convolve")
print "%.2f sec/pass" % (10 * t.timeit(number=10)/100)

I am processing image data, using grayscale (integer values between 0 and 255), and I currently get about a quarter of a second per convolution. My thinking was to do one of the following:

Use corepy, preferably with some optimizations Recompile numpy with icc & ikml. Use python-cuda.

I was wondering if anyone had any experience with any of these approaches ( what sort of gain would be typical, and if it is worth the time ), or if anyone is aware of a better library to perform convolution with Numpy.

Thanks!

EDIT:

Speed up of about 10x by re-writing python loop in C over using Numpy.

469

asked Feb 04 '10 01:02

Bear

1 Answers

The code in scipy for doing 2d convolutions is a bit messy and unoptimized. See http://svn.scipy.org/svn/scipy/trunk/scipy/signal/firfilter.c if you want a glimpse into the low-level functioning of scipy.

If all you want is to process with a small, constant kernel like the one you showed, a function like this might work:

def specialconvolve(a):
    # sorry, you must pad the input yourself
    rowconvol = a[1:-1,:] + a[:-2,:] + a[2:,:]
    colconvol = rowconvol[:,1:-1] + rowconvol[:,:-2] + rowconvol[:,2:] - 9*a[1:-1,1:-1]
    return colconvol

This function takes advantage of the separability of the kernel like DarenW suggested above, as well as taking advantage of the more optimized numpy arithmetic routines. It's over 1000 times faster than the convolve2d function by my measurements.

175

answered Sep 17 '22 14:09

Theran

Related questions
                            
                                heatmap-like plot, but for categorical variables in seaborn
                            
                                pandas groupby-apply behavior, returning a Series (inconsistent output type)
                            
                                How (in what form) to share (deliver) a Python function?
                            
                                How to deal with UserWarning: Converting sparse IndexedSlices to a dense Tensor of unknown shape
                            
                                Coverage of Cython module using py.test and coverage.py
                            
                                Python - For loop millions of rows
                            
                                Why do I get "AttributeError: __fields_set__" when subclassing a Pydantic BaseModel?
                            
                                Robust Algorithm to detect uneven illumination in images [Detection Only Needed]
                            
                                Detect in python which keys are pressed
                            
                                Using "from __future__ import division" in my program, but it isn't loaded with my program
                            
                                Web app hangs for several hours in ssl.py at self._sslobj.do_handshake()
                            
                                Pandas Boolean .any() .all()
                            
                                Find the index of the end of a word in python
                            
                                How to create new column and insert row values while iterating through pandas data frame
                            
                                What is a fast and proper way to refresh/update plots in Bokeh (0.11) server app?
                            
                                Django: IntegrityError during Many To Many add()
                            
                                How do I set up Scrapy to deal with a captcha
                            
                                Querying model in Flask-APScheduler job raises app context RuntimeError
                            
                                Why does conda create try to install weird packages?
                            
                                Keras inconsistent prediction time

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Improving Numpy Performance

Tags:

python

math

numpy

scipy

convolution

Bear

People also ask

1 Answers

Theran

Recent Activity

Donate For Us