How to avoid enormous additional memory consumption when using numpy vectorize?

Tags:

This code below best illustrates my problem:

The output to the console (NB it takes ~8 minutes to run even the first test) shows the 512x512x512x16-bit array allocations consuming no more than expected (256MByte for each one), and looking at "top" the process generally remains sub-600MByte as expected.

However, while the vectorized version of the function is being called, the process expands to enormous size (over 7GByte!). Even the most obvious explanation I can think of to account for this - that vectorize is converting the inputs and outputs to float64 internally - could only account for a couple of gigabytes, even though the vectorized function returns an int16, and the returned array is certainly an int16. Is there some way to avoid this happening ? Am I using/understanding vectorize's otypes argument wrong ?

import numpy as np
import subprocess

def logmem():
    subprocess.call('cat /proc/meminfo | grep MemFree',shell=True)

def fn(x):
    return np.int16(x*x)

def test_plain(v):
    print "Explicit looping:"
    logmem()
    r=np.zeros(v.shape,dtype=np.int16)
    for z in xrange(v.shape[0]):
        for y in xrange(v.shape[1]):
            for x in xrange(v.shape[2]):
                r[z,y,x]=fn(x)
    print type(r[0,0,0])
    logmem()
    return r

vecfn=np.vectorize(fn,otypes=[np.int16])

def test_vectorize(v):
    print "Vectorize:"
    logmem()
    r=vecfn(v)
    print type(r[0,0,0])
    logmem()
    return r

logmem()    
s=(512,512,512)
v=np.ones(s,dtype=np.int16)
logmem()
test_plain(v)
test_vectorize(v)
v=None
logmem()

I'm using whichever versions of Python/numpy are current on an amd64 Debian Squeeze system (Python 2.6.6, numpy 1.4.1).

853

asked Aug 16 '11 12:08

timday

2 Answers

It is a basic problem of vectorisation that all intermediate values are also vectors. While this is a convenient way to get a decent speed enhancement, it can be very inefficient with memory usage, and will be constantly thrashing your CPU cache. To overcome this problem, you need to use an approach which has explicit loops running at compiled speed, not at python speed. The best ways to do this are to use cython, fortran code wrapped with f2py or numexpr. You can find a comparison of these approaches here, although this focuses more on speed than memory usage.

answered Oct 23 '22 05:10

DaveP

you can read the source code of vectorize(). It convert the array's dtype to object, and call np.frompyfunc() to create the ufunc from your python function, the ufunc returns object array, and finally vectorize() convert object array to int16 array.

It will use many memory when the dtype of array is object.

Using python function to do element wise calculation is slow, even is's converted to ufunc by frompyfunc().

answered Oct 23 '22 03:10

HYRY

Related questions
                            
                                Is it possible to build exe on Vista and deploy on XP using py2exe
                            
                                Fill Django Form Field Data with Db Data
                            
                                How can I install a PyPi equivalent from scratch?
                            
                                erlang interface to python
                            
                                How do I put a task back in the queue if the task fails?
                            
                                Django i18n find supported languages
                            
                                openpyxl cell style not reporting correctly
                            
                                How to extend pyWavelets to work with N-dimensional data?
                            
                                Can I extend Jenkins with Jython/Python
                            
                                Ruby's watchr equivalent in Python?
                            
                                What sample application demonstrates best practices for MVC structure in a Google App Engine/Python app?
                            
                                Generate REST based service from database schema [closed]
                            
                                Twisted deferred vs blocking in web services
                            
                                PyGTK blocking non-GUI threads
                            
                                Python script to remove blank pages using pyPDF
                            
                                Static html Files in Cherrypy
                            
                                text layout recognition with python
                            
                                Why is this Python method leaking memory?
                            
                                Installing oursql on Mac OS Lion successes but import in python fails. **Why?**
                            
                                Choosing between different expression factorizations in SymPy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to avoid enormous additional memory consumption when using numpy vectorize?

Tags:

python

memory

vectorization

numpy

timday

People also ask

2 Answers

DaveP

HYRY

Recent Activity

Donate For Us