Poor numpy.cross() performance

Tags:

I've been doing some performance testing in order to improve the performance of a pet project I'm writing. It's a very number-crunching intensive application, so I've been playing with Numpy as a way of improving computational performance.

However, the result from the following performance tests were quite surprising....

Test Source Code (Updated with test cases for hoisting and batch submission)

import timeit

numpySetup = """
import numpy
left = numpy.array([1.0,0.0,0.0])
right = numpy.array([0.0,1.0,0.0])
"""

hoistSetup = numpySetup +'hoist = numpy.cross\n'

pythonSetup = """
left = [1.0,0.0,0.0]
right = [0.0,1.0,0.0]
"""

numpyBatchSetup = """
import numpy

l = numpy.array([1.0,0.0,0.0])
left = numpy.array([l]*10000)

r = numpy.array([0.0,1.0,0.0])
right = numpy.array([r]*10000)
"""

pythonCrossCode = """
x = ((left[1] * right[2]) - (left[2] * right[1]))
y = ((left[2] * right[0]) - (left[0] * right[2]))
z = ((left[0] * right[1]) - (left[1] * right[0]))
"""

pythonCross = timeit.Timer(pythonCrossCode, pythonSetup)
numpyCross = timeit.Timer ('numpy.cross(left, right)' , numpySetup)
hybridCross = timeit.Timer(pythonCrossCode, numpySetup)
hoistCross = timeit.Timer('hoist(left, right)', hoistSetup)
batchCross = timeit.Timer('numpy.cross(left, right)', numpyBatchSetup) 

print 'Python Cross Product : %4.6f ' % pythonCross.timeit(1000000)
print 'Numpy Cross Product  : %4.6f ' % numpyCross.timeit(1000000) 
print 'Hybrid Cross Product : %4.6f ' % hybridCross.timeit(1000000) 
print 'Hoist Cross Product  : %4.6f ' % hoistCross.timeit(1000000) 
# 100 batches of 10000 each is equivalent to 1000000
print 'Batch Cross Product  : %4.6f ' % batchCross.timeit(100)

Original Results

Python Cross Product : 0.754945 
Numpy Cross Product  : 20.752983 
Hybrid Cross Product : 4.467417

Final Results

Python Cross Product : 0.894334 
Numpy Cross Product  : 21.099040 
Hybrid Cross Product : 4.467194 
Hoist Cross Product  : 20.896225 
Batch Cross Product  : 0.262964

Needless to say, this wasn't the result I expected. The pure Python version performs almost 30x faster than Numpy. Numpy performance in other tests has been better than the Python equivalent (which was the expected result).

So, I've got two related questions:

Can anyone explain why NumPy is performing so poorly in this case?
Is there something I can do to fix it?

357

asked Jan 01 '10 07:01

Adam Luchjenbroers

1 Answers

Try this with larger arrays. I think that just the cost of calling the methods of numpy here overruns the simple several list accesses required by the Python version. If you deal with larger arrays, I think you'll see large wins for numpy.

127

answered Sep 25 '22 08:09

Eli Bendersky

Related questions
                            
                                Cleaning form data in Django
                            
                                Python shelve module question
                            
                                How to manage a CPU intensive process on a server
                            
                                Getting TTFB (time till first byte) for an HTTP Request
                            
                                Send headers along in python [duplicate]
                            
                                Identifying the types of all variables in a C project
                            
                                How to do fuzzy string search without a heavy database?
                            
                                ManyToOneField in Django
                            
                                OpenOffice.org development with pyUno for Windows—which Python?
                            
                                what is python equivalent to PHP $_SERVER?
                            
                                Is os.popen really deprecated in Python 2.6?
                            
                                When should I use varargs in designing a Python API?
                            
                                CPython or IronPython? [closed]
                            
                                Django equivalent of New Relic RPM for Rails?
                            
                                How can I change the font size in GTK?
                            
                                Jinja2 If Statement
                            
                                Twisted network client with multiprocessing workers?
                            
                                Get stacktrace from stuck python process that does not accept signals
                            
                                Virtualenv: global site-packages vs the site-packages in the virtual environment
                            
                                Reasons for this disparity in execution speed?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Poor numpy.cross() performance

Tags:

performance

python

numpy

Adam Luchjenbroers

People also ask

1 Answers

Eli Bendersky

Recent Activity

Donate For Us