I looked at maps source code which basically keeps creating lazy sequences. I would think that iterating over a collection and adding to a transient vector would be faster, but clearly it isn't. What don't I understand about clojures performance behavior? <pre class="prettyprint"><code>;=> (time (do-with / (range 1 1000) (range 1 1000))) ;"Elapsed time: 23.1808 msecs" ; ; vs ;=> (time (doall (map #(/ %1 %2) (range 1 1000) (range 1 1000)))) ;"Elapsed time: 2.604174 msecs" (defn do-with [fn coll1 coll2] (let [end (count coll1)] (loop [i 0 res (transient [])] (if (= i end) (persistent! res) (let [x (nth coll1 i) y (nth coll2 i) r (fn x y)] (recur (inc i) (conj! res r))) )))) </code></pre>

In order of conjectured impact on relative results: <ol> <li>Your <code>do-with</code> function uses <code>nth</code> to access the individual items in the input collections. <code>nth</code> operates in linear time on ranges, making <code>do-with</code> quadratic. Needless to say, this will kill performance on large collections.</li> <li><code>range</code> produces chunked seqs and <code>map</code> handles those extremely efficiently. (Essentially it produces chunks of up to 32 elements -- here it will in fact be exactly 32 -- by running a tight loop over the internal array of each input chunk in turn, placing results in internal arrays of output chunks.)</li> <li>Benchmarking with <code>time</code> doesn't give you steady state performance. (Which is why one should really use a proper benchmarking library; in the case of Clojure, Criterium is the standard solution.)</li> </ol> Incidentally, <code>(map #(/ %1 %2) xs ys)</code> can simply be written as <code>(map / xs ys)</code>. Update: I've benchmarked the <code>map</code> version, the original <code>do-with</code> and a new <code>do-with</code> version with Criterium, using <code>(range 1 1000)</code> as both inputs in each case (as in the question text), obtaining the following mean execution times: <pre class="prettyprint"><code>;;; (range 1 1000) new do-with 170.383334 µs (doall (map ...)) 230.756753 µs original do-with 15.624444 ms </code></pre> Additionally, I've repeated the benchmark using a vector stored in a Var as input rather than ranges (that is, with <code>(def r (vec (range 1 1000)))</code> at the start and using <code>r</code> as both collection arguments in each benchmark). Unsurprisingly, the original <code>do-with</code> came in first -- <code>nth</code> is very fast on vectors (plus using <code>nth</code> with a vector avoids all the intermediate allocations involved in seq traversal). <pre class="prettyprint"><code>;;; (vec (range 1 1000)) original do-with 73.975419 µs new do-with 87.399952 µs (doall (map ...)) 153.493128 µs </code></pre> Here's the new <code>do-with</code> with linear time complexity: <pre class="prettyprint"><code>(defn do-with [f xs ys] (loop [xs (seq xs) ys (seq ys) ret (transient [])] (if (and xs ys) (recur (next xs) (next ys) (conj! ret (f (first xs) (first ys)))) (persistent! ret)))) </code></pre>

why is this looping function so slow compared to map?

Tags:

clojure

lazy-sequences

I looked at maps source code which basically keeps creating lazy sequences. I would think that iterating over a collection and adding to a transient vector would be faster, but clearly it isn't. What don't I understand about clojures performance behavior?

;=> (time (do-with / (range 1 1000) (range 1 1000)))
;"Elapsed time: 23.1808 msecs"
;
; vs
;=> (time (doall (map #(/ %1 %2) (range 1 1000) (range 1 1000))))
;"Elapsed time: 2.604174 msecs"
(defn do-with
  [fn coll1 coll2]
  (let [end (count coll1)]
    (loop [i   0
           res (transient [])]
        (if
          (= i end)
            (persistent! res)
            (let [x (nth coll1 i)
                  y (nth coll2 i)
                  r (fn x y)]
              (recur (inc i) (conj! res r))) 
                  ))))

208

asked Aug 05 '13 07:08

Core

1 Answers

In order of conjectured impact on relative results:

Your do-with function uses nth to access the individual items in the input collections. nth operates in linear time on ranges, making do-with quadratic. Needless to say, this will kill performance on large collections.
range produces chunked seqs and map handles those extremely efficiently. (Essentially it produces chunks of up to 32 elements -- here it will in fact be exactly 32 -- by running a tight loop over the internal array of each input chunk in turn, placing results in internal arrays of output chunks.)
Benchmarking with time doesn't give you steady state performance. (Which is why one should really use a proper benchmarking library; in the case of Clojure, Criterium is the standard solution.)

Incidentally, (map #(/ %1 %2) xs ys) can simply be written as (map / xs ys).

Update:

I've benchmarked the map version, the original do-with and a new do-with version with Criterium, using (range 1 1000) as both inputs in each case (as in the question text), obtaining the following mean execution times:

;;; (range 1 1000)
new do-with           170.383334 µs
(doall (map ...))     230.756753 µs
original do-with       15.624444 ms

Additionally, I've repeated the benchmark using a vector stored in a Var as input rather than ranges (that is, with (def r (vec (range 1 1000))) at the start and using r as both collection arguments in each benchmark). Unsurprisingly, the original do-with came in first -- nth is very fast on vectors (plus using nth with a vector avoids all the intermediate allocations involved in seq traversal).

;;; (vec (range 1 1000))
original do-with       73.975419 µs
new do-with            87.399952 µs
(doall (map ...))     153.493128 µs

Here's the new do-with with linear time complexity:

(defn do-with [f xs ys]
  (loop [xs  (seq xs)
         ys  (seq ys)
         ret (transient [])]
    (if (and xs ys)
      (recur (next xs)
             (next ys)
             (conj! ret (f (first xs) (first ys))))
      (persistent! ret))))

answered Nov 05 '22 15:11

Michał Marczyk

Related questions
                            
                                Clojure equivalent to Python doctest?
                            
                                In clojure how can defmacro be defined in terms of itself?
                            
                                How to increment by a number in Clojure?
                            
                                How to deploy a clojure web application to Amazon EC2 (AWS Elastic Beanstalk + Leiningen + Compojure + Ring + Tomcat)
                            
                                Two functions which call each other recursively
                            
                                Library functions vs Java methods in Clojure
                            
                                How to get all the params of a POST request with Compojure
                            
                                Nested types in clojure?
                            
                                idiomatic file locking in clojure?
                            
                                Clojure - Quoting Confusion
                            
                                Why does clojure's group-by not always maintain order?
                            
                                install JAR from remote repo (clojar)
                            
                                Iterate over clojure map pairs (loop)
                            
                                Is it possible to have _good_ tab completions in Clojure REPL?
                            
                                Lazily extract lines from large file
                            
                                -> operator that breaks evaluation on encountering a nil/false return
                            
                                Is it possible to tweak the output of clojure.test?
                            
                                Recursion inside let function
                            
                                Extend Clojure protocol to a primitive array
                            
                                Convert map keys and values to string array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With