<p>In <em>Mathematica</em> there are a number of functions that return not only the final result or a single match, but all results. Such functions are named <code>*List</code>. Exhibit:</p> <ul> <li>FoldList</li> <li>NestList</li> <li>ReplaceList</li> <li>ComposeList</li> </ul> <h3>Something that I am missing is a MapList function.</h3> <p>For example, I want:</p> <pre class="prettyprint"><code>MapList[f, {1, 2, 3, 4}] </code></pre> <pre class="prettyprint">{{f[1], 2, 3, 4}, {1, f[2], 3, 4}, {1, 2, f[3], 4}, {1, 2, 3, f[4]}}</pre> <p>I want a list element for each application of the function:</p> <pre class="prettyprint"><code>MapList[ f, {h[1, 2], {4, Sin[x]}}, {2} ] // Column </code></pre> <pre class="prettyprint">{h[f[1], 2], {4, Sin[x]}} {h[1, f[2]], {4, Sin[x]}} {h[1, 2], {f[4], Sin[x]}} {h[1, 2], {4, f[Sin[x]]}}</pre> <hr> <h3>One may implement this as:</h3> <pre class="prettyprint"><code>MapList[f_, expr_, level_: 1] := MapAt[f, expr, #] & /@ Position[expr, _, level, Heads -> False] </code></pre> <h3>However, it is quite inefficient. Consider this simple case, and compare these timings:</h3> <pre class="prettyprint"><code>a = Range@1000; #^2 & /@ a // timeAvg MapList[#^2 &, a] // timeAvg ConstantArray[#^2 & /@ a, 1000] // timeAvg 0.00005088 0.01436 0.0003744 </code></pre> <p>This illustrates that on average <code>MapList</code> is about 38X slower than the combined total of mapping the function to every element in the list and creating a 1000x1000 array.</p> <hr> <h3>Therefore, how may MapList be most efficiently implemented?</h3>

<p>I suspect that <code>MapList</code> is nearing the performance limit for any transformation that performs structural modification. The existing target benchmarks are not really fair comparisons. The <code>Map</code> example is creating a simple vector of integers. The <code>ConstantArray</code> example is creating a simple vector of shared references to the same list. <code>MapList</code> shows poorly against these examples because it is creating a vector where each element is a freshly generated, unshared, data structure.</p> <p>I have added two more benchmarks below. In both cases each element of the result is a packed array. The <code>Array</code> case generates new elements by performing <code>Listable</code> addition on <code>a</code>. The <code>Module</code> case generates new elements by replacing a single value in a copy of <code>a</code>. These results are as follows:</p> <pre class="prettyprint"><code>In[8]:= a = Range@1000; #^2 & /@ a // timeAvg MapList[#^2 &, a] // timeAvg ConstantArray[#^2 & /@ a, 1000] // timeAvg Array[a+# &, 1000] // timeAvg Module[{c}, Table[c = a; c[[i]] = c[[i]]^2; c, {i, 1000}]] // timeAvg Out[9]= 0.0005504 Out[10]= 0.0966 Out[11]= 0.003624 Out[12]= 0.0156 Out[13]= 0.02308 </code></pre> <p>Note how the new benchmarks perform much more like <code>MapList</code> and less like the <code>Map</code> or <code>ConstantArray</code> examples. This seems to show that there is not much scope for dramatically improving the performance of <code>MapList</code> without some deep kernel magic. We can shave a bit of time from <code>MapList</code> thus:</p> <pre class="prettyprint"><code>MapListWR4[f_, expr_, level_: {1}] := Module[{positions, replacements} , positions = Position[expr, _, level, Heads -> False] ; replacements = # -> f[Extract[expr, #]] & /@ positions ; ReplacePart[expr, #] & /@ replacements ] </code></pre> <p>Which yields these timings:</p> <pre class="prettyprint"><code>In[15]:= a = Range@1000; #^2 & /@ a // timeAvg MapListWR4[#^2 &, a] // timeAvg ConstantArray[#^2 & /@ a, 1000] // timeAvg Array[a+# &, 1000] // timeAvg Module[{c}, Table[c = a; c[[i]] = c[[i]]^2; c, {i, 1000}]] // timeAvg Out[16]= 0.0005488 Out[17]= 0.04056 Out[18]= 0.003 Out[19]= 0.015 Out[20]= 0.02372 </code></pre> <p>This comes within factor 2 of the <code>Module</code> case and I expect that further micro-optimizations can close the gap yet more. But it is with eager anticipation that I join you awaiting an answer that shows a further tenfold improvement.</p>

A "MapList" function

Something that I am missing is a MapList function.

For example, I want:

MapList[f, {1, 2, 3, 4}]

{{f[1], 2, 3, 4}, {1, f[2], 3, 4}, {1, 2, f[3], 4}, {1, 2, 3, f[4]}}

I want a list element for each application of the function:

MapList[
  f,
  {h[1, 2], {4, Sin[x]}},
  {2}
] // Column

{h[f[1], 2], {4, Sin[x]}}
{h[1, f[2]], {4, Sin[x]}}
{h[1, 2], {f[4], Sin[x]}}
{h[1, 2], {4, f[Sin[x]]}}

One may implement this as:

MapList[f_, expr_, level_: 1] :=
 MapAt[f, expr, #] & /@
  Position[expr, _, level, Heads -> False]

However, it is quite inefficient. Consider this simple case, and compare these timings:

a = Range@1000;
#^2 & /@ a // timeAvg
MapList[#^2 &, a] // timeAvg
ConstantArray[#^2 & /@ a, 1000] // timeAvg

0.00005088

0.01436

0.0003744

This illustrates that on average MapList is about 38X slower than the combined total of mapping the function to every element in the list and creating a 1000x1000 array.

Therefore, how may MapList be most efficiently implemented?

573

asked Oct 30 '11 11:10

Mr.Wizard

1 Answers

I suspect that MapList is nearing the performance limit for any transformation that performs structural modification. The existing target benchmarks are not really fair comparisons. The Map example is creating a simple vector of integers. The ConstantArray example is creating a simple vector of shared references to the same list. MapList shows poorly against these examples because it is creating a vector where each element is a freshly generated, unshared, data structure.

I have added two more benchmarks below. In both cases each element of the result is a packed array. The Array case generates new elements by performing Listable addition on a. The Module case generates new elements by replacing a single value in a copy of a. These results are as follows:

In[8]:= a = Range@1000;
        #^2 & /@ a // timeAvg
        MapList[#^2 &, a] // timeAvg
        ConstantArray[#^2 & /@ a, 1000] // timeAvg
        Array[a+# &, 1000] // timeAvg
        Module[{c}, Table[c = a; c[[i]] = c[[i]]^2; c, {i, 1000}]] // timeAvg

Out[9]=  0.0005504

Out[10]= 0.0966

Out[11]= 0.003624

Out[12]= 0.0156

Out[13]= 0.02308

Note how the new benchmarks perform much more like MapList and less like the Map or ConstantArray examples. This seems to show that there is not much scope for dramatically improving the performance of MapList without some deep kernel magic. We can shave a bit of time from MapList thus:

MapListWR4[f_, expr_, level_: {1}] :=
  Module[{positions, replacements}
  , positions = Position[expr, _, level, Heads -> False]
  ; replacements = # -> f[Extract[expr, #]] & /@ positions
  ; ReplacePart[expr, #] & /@ replacements
  ]

Which yields these timings:

In[15]:= a = Range@1000;
         #^2 & /@ a // timeAvg
         MapListWR4[#^2 &, a] // timeAvg
         ConstantArray[#^2 & /@ a, 1000] // timeAvg
         Array[a+# &, 1000] // timeAvg
         Module[{c}, Table[c = a; c[[i]] = c[[i]]^2; c, {i, 1000}]] // timeAvg

Out[16]= 0.0005488

Out[17]= 0.04056

Out[18]= 0.003

Out[19]= 0.015

Out[20]= 0.02372

This comes within factor 2 of the Module case and I expect that further micro-optimizations can close the gap yet more. But it is with eager anticipation that I join you awaiting an answer that shows a further tenfold improvement.

186

answered Sep 22 '22 02:09

WReach

Related questions
                            
                                Runtime polymorphism overhead in Julia
                            
                                Make MySQL to choose the best index for a query
                            
                                How to convert the values of a Map of List to a single List
                            
                                Is it an optimization to explicitly initialize undefined object members in JavaScript, given knowledge of the innerworkings of V8/spidermonkey/chakra?
                            
                                Is CPU speed limited by the speed of fetching instructions from memory?
                            
                                How to load an iframe without blocking onload or waiting for onload
                            
                                C++ performance std::array vs std::vector
                            
                                Fastest way for boolean matrix computations
                            
                                Files on XP: Is turning off "last access time" safe?
                            
                                Fortran's performance
                            
                                ways to improve the launch speed of C++ application
                            
                                Trees: Linked Lists vs Arrays (Efficiency)
                            
                                How can I access the C# performance counter in the code?
                            
                                Will commenting-out unused code give my page a performance boost in any way, shape, or form?
                            
                                Efficient 2D drawing in Android
                            
                                How can I optimize for IE?
                            
                                Spring AOP slow startup time
                            
                                jQuery chaining faster than separate statements?
                            
                                std::vector faster than plain array?
                            
                                The Most Efficient Implementation of UniqueQueue and UniqueReplacementQueue Collections in .NET

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

A "MapList" function

Tags:

performance

wolfram-mathematica

map

mathematica-7