Matrix Multiplication in Clojure vs Numpy

Tags:

I'm working on an application in Clojure that needs to multiply large matrices and am running into some large performance issues compared to an identical Numpy version. Numpy seems to be able to multiply a 1,000,000x23 matrix by its transpose in under a second, while the equivalent clojure code takes over six minutes. (I can print out the resulting matrix from Numpy, so it's definitely evaluating everything).

Am I doing something terribly wrong in this Clojure code? Is there some trick of Numpy that I can try to mimic?

Here's the python:

Click to copy

import numpy as np  def test_my_mult(n):     A = np.random.rand(n*23).reshape(n,23)     At = A.T      t0 = time.time()     res = np.dot(A.T, A)     print time.time() - t0     print np.shape(res)      return res  # Example (returns a 23x23 matrix): # >>> results = test_my_mult(1000000) #  # 0.906938076019 # (23, 23)

And the clojure:

Click to copy

(defn feature-vec [n]   (map (partial cons 1)        (for [x (range n)]          (take 22 (repeatedly rand)))))  (defn dot-product [x y]   (reduce + (map * x y)))  (defn transpose   "returns the transposition of a `coll` of vectors"   [coll]   (apply map vector coll))  (defn matrix-mult   [mat1 mat2]   (let [row-mult (fn [mat row]                    (map (partial dot-product row)                         (transpose mat)))]     (map (partial row-mult mat2)          mat1)))  (defn test-my-mult   [n afn]   (let [xs  (feature-vec n)         xst (transpose xs)]     (time (dorun (afn xst xs)))))  ;; Example (yields a 23x23 matrix): ;; (test-my-mult 1000 i/mmult) => "Elapsed time: 32.626 msecs" ;; (test-my-mult 10000 i/mmult) => "Elapsed time: 628.841 msecs"  ;; (test-my-mult 1000 matrix-mult) => "Elapsed time: 14.748 msecs" ;; (test-my-mult 10000 matrix-mult) => "Elapsed time: 434.128 msecs" ;; (test-my-mult 1000000 matrix-mult) => "Elapsed time: 375751.999 msecs"   ;; Test from wikipedia ;; (def A [[14 9 3] [2 11 15] [0 12 17] [5 2 3]]) ;; (def B [[12 25] [9 10] [8 5]])  ;; user> (matrix-mult A B) ;; ((273 455) (243 235) (244 205) (102 160))

UPDATE: I implemented the same benchmark using the JBLAS library and found massive, massive speed improvements. Thanks to everyone for their input! Time to wrap this sucker in Clojure. Here's the new code:

Click to copy

(import '[org.jblas FloatMatrix])  (defn feature-vec [n]   (FloatMatrix.    (into-array (for [x (range n)]                  (float-array (cons 1 (take 22 (repeatedly rand))))))))  (defn test-mult [n]   (let [xs  (feature-vec n)         xst (.transpose xs)]     (time (let [result (.mmul xst xs)]             [(.rows result)              (.columns result)]))))  ;; user> (test-mult 10000) ;; "Elapsed time: 6.99 msecs" ;; [23 23]  ;; user> (test-mult 100000) ;; "Elapsed time: 43.88 msecs" ;; [23 23]  ;; user> (test-mult 1000000) ;; "Elapsed time: 383.439 msecs" ;; [23 23]  (defn matrix-stream [rows cols]   (repeatedly #(FloatMatrix/randn rows cols)))  (defn square-benchmark   "Times the multiplication of a square matrix."   [n]   (let [[a b c] (matrix-stream n n)]     (time (.mmuli a b c))     nil))  ;; forma.matrix.jblas> (square-benchmark 10) ;; "Elapsed time: 0.113 msecs" ;; nil ;; forma.matrix.jblas> (square-benchmark 100) ;; "Elapsed time: 0.548 msecs" ;; nil ;; forma.matrix.jblas> (square-benchmark 1000) ;; "Elapsed time: 107.555 msecs" ;; nil ;; forma.matrix.jblas> (square-benchmark 2000) ;; "Elapsed time: 793.022 msecs" ;; nil

545

asked Jan 17 '12 18:01

Sam Ritchie

1 Answers

The Python version is compiling down to a loop in C while the Clojure version is building a new intermediate sequence for each of the calls to map in this code. It is likely that the performance difference you see is coming from the difference of data structures.

To get better than this you could play with a library like Incanter or write your own version as explained in this SO question. see also this one, neanderthal or nd4j. If you really want to stay with sequences to keep the lazy evaluation properties etc. then you may get a real boost by looking into transients for the internal matrix calculations

EDIT: forgot to add the first step in tuning clojure, turn on "warn on reflection"

answered Sep 17 '22 19:09

Arthur Ulfeldt

Related questions
                            
                                TypeError: 'range' object does not support item assignment
                            
                                How to find probability distribution and parameters for real data? (Python 3)
                            
                                Why is python ordering my dictionary like so? [duplicate]
                            
                                How do I print a datetime in the local timezone?
                            
                                Minimum Euclidean distance between points in two different Numpy arrays, not within
                            
                                Numpy vs Cython speed
                            
                                Is it possible to make POST request in Flask?
                            
                                Getting 'str' object has no attribute 'get' in Django
                            
                                How to put text outside python plots?
                            
                                SQLAlchemy "default" vs "server_default" performance
                            
                                How to repeat a string with spaces?
                            
                                Python3: writing csv files
                            
                                Python dictionary.keys() error
                            
                                mongodb cursor id not valid error
                            
                                cursor.fetchall() vs list(cursor) in Python
                            
                                Can I add arguments to python code when I submit spark job?
                            
                                Sending email via Gmail & Python
                            
                                Torch sum a tensor along an axis
                            
                                How do you use Keras LeakyReLU in Python?
                            
                                how to get the caller's filename, method name in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Matrix Multiplication in Clojure vs Numpy

Tags:

python

matrix

numpy

clojure

Sam Ritchie

People also ask

1 Answers

Arthur Ulfeldt

Recent Activity

Donate For Us