Python NUMPY HUGE Matrices multiplication

Tags:

I need to multiply two big matrices and sort their columns.

 import numpy
 a= numpy.random.rand(1000000, 100)
 b= numpy.random.rand(300000,100)
 c= numpy.dot(b,a.T)
 sorted = [argsort(j)[:10] for j in c.T]

This process takes a lot of time and memory. Is there a way to fasten this process? If not how can I calculate RAM needed to do this operation? I currently have an EC2 box with 4GB RAM and no swap.

I was wondering if this operation can be serialized and I dont have to store everything in the memory.

671

asked Jul 30 '14 20:07

pg2455

2 Answers

One thing that you can do to speed things up is compile numpy with an optimized BLAS library like e.g. ATLAS, GOTO blas or Intel's proprietary MKL.

To calculate the memory needed, you need to monitor Python's Resident Set Size ("RSS"). The following commands were run on a UNIX system (FreeBSD to be precise, on a 64-bit machine).

> ipython

In [1]: import numpy as np

In [2]: a = np.random.rand(1000, 1000)

In [3]: a.dtype
Out[3]: dtype('float64')

In [4]: del(a)

To get the RSS I ran:

ps -xao comm,rss | grep python

[Edit: See the ps manual page for a complete explanation of the options, but basically these ps options make it show only the command and resident set size of all processes. The equivalent format for Linux's ps would be ps -xao c,r, I believe.]

The results are;

After starting the interpreter: 24880 kiB
After importing numpy: 34364 kiB
After creating a: 42200 kiB
After deleting a: 34368 kiB

Calculating the size;

In [4]: (42200 - 34364) * 1024
Out[4]: 8024064

In [5]: 8024064/(1000*1000)
Out[5]: 8.024064

As you can see, the calculated size matches the 8 bytes for the default datatype float64 quite well. The difference is internal overhead.

The size of your original arrays in MiB will be approximately;

In [11]: 8*1000000*100/1024**2
Out[11]: 762.939453125

In [12]: 8*300000*100/1024**2
Out[12]: 228.8818359375

That's not too bad. However, the dot product will be way too large:

In [19]: 8*1000000*300000/1024**3
Out[19]: 2235.1741790771484

That's 2235 GiB!

What you can do is split up the problem and perfrom the dot operation in pieces;

load b as an ndarray
load every row from a as an ndarray in turn.
multiply the row by every column of b and write the result to a file.
del() the row and load the next row.

This wil not make it faster, but it would make it use less memory!

Edit: In this case I would suggest writing the output file in binary format (e.g. using struct or ndarray.tofile). That would make it much easier to read a column from the file with e.g. a numpy.memmap.

153

answered Sep 22 '22 20:09

Roland Smith

What DrV and Roland Smith said are good answers; they should be listened to. My answer does nothing more than present an option to make your data sparse, a complete game-changer.

Sparsity can be extremely powerful. It would transform your O(100 * 300000 * 1000000) operation into an O(k) operation with k non-zero elements (sparsity only means that the matrix is largely zero). I know sparsity has been mentioned by DrV and disregarded as not applicable but I would guess it is.

All that needs to be done is to find a sparse representation for computing this transform (and interpreting the results is another ball game). Easy (and fast) methods include the Fourier transform or wavelet transform (both rely on similarity between matrix elements) but this problem is generalizable through several different algorithms.

Having experience with problems like this, this smells like a relatively common problem that is typically solved through some clever trick. When in a field like machine learning where these types of problems are classified as "simple," that's often the case.

answered Sep 23 '22 20:09

Scott

Related questions
                            
                                Decorator after @task decorator in celery
                            
                                Python flask flash message exception remains after restarting
                            
                                How do I kill SimpleHTTPServer from within a Python script?
                            
                                In Python, how do you generate permutations of an array where you only have one element from each column and row?
                            
                                how to install PIL with JPEG support on a raspberry pi?
                            
                                Algo for dividing a number into (almost) equal whole numbers
                            
                                Add a legend in a 3D scatterplot with scatter() in Matplotlib
                            
                                Why does Python treat a tuple with one item as an integer? [duplicate]
                            
                                subprocess.check_output with grep command fails when grep finds no matches
                            
                                A Python "Everything" keyword that always returns True for membership tests
                            
                                How to use the actual feature names instead of "X" in scikit-learn DecisionTreeRegressor?
                            
                                Find and click an item from 'onclick' partial value
                            
                                How to first remove every second element on a list, then every third on whats remaining?
                            
                                Remove the last part of string separated with dot in Python
                            
                                Compare string in format HH:MM to time now in python
                            
                                AttributeError: 'EditForm' object has no attribute 'validate_on_submit'
                            
                                How to use different database engines in Django for testing and production
                            
                                Python requests library Exception handling
                            
                                How can i vectorize list using sklearn DictVectorizer
                            
                                SciPy medfilt wrong result

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python NUMPY HUGE Matrices multiplication

Tags:

performance

python

numpy

matrix-multiplication

pg2455

People also ask

2 Answers

Roland Smith

Scott

Recent Activity

Donate For Us