I've got a numpy array. What is the fastest way to compute all the permutations of orderings. What I mean is, given the first element in my array, I want a list of all the elements that sequentially follow it. Then given the second element, a list of all the elements that follow it. So given my list: b, c, & d follow a. c & d follow b, and d follows c. <pre class="prettyprint"><code>x = np.array(["a", "b", "c", "d"]) </code></pre> So a potential output looks like: <pre class="prettyprint"><code>[ ["a","b"], ["a","c"], ["a","d"], ["b","c"], ["b","d"], ["c","d"], ] </code></pre> I will need to do this several million times so I am looking for an efficient solution. I tried something like: <pre class="prettyprint"><code>im = np.vstack([x]*len(x)) a = np.vstack(([im], [im.T])).T results = a[np.triu_indices(len(x),1)] </code></pre> but its actually slower than looping...

You can use <code>itertools</code>'s functions like <code>chain.from_iterable</code> and <code>combinations</code> with <code>np.fromiter</code> for this. This involves no loop in Python, but still not a pure NumPy solution: <pre class="prettyprint"><code>>>> from itertools import combinations, chain >>> arr = np.fromiter(chain.from_iterable(combinations(x, 2)), dtype=x.dtype) >>> arr.reshape(arr.size/2, 2) array([['a', 'b'], ['a', 'c'], ['a', 'd'], ..., ['b', 'c'], ['b', 'd'], ['c', 'd']], dtype='|S1') </code></pre> Timing comparisons: <pre class="prettyprint"><code>>>> x = np.array(["a", "b", "c", "d"]*100) >>> %%timeit im = np.vstack([x]*len(x)) a = np.vstack(([im], [im.T])).T results = a[np.triu_indices(len(x),1)] ... 10 loops, best of 3: 29.2 ms per loop >>> %%timeit arr = np.fromiter(chain.from_iterable(combinations(x, 2)), dtype=x.dtype) arr.reshape(arr.size/2, 2) ... 100 loops, best of 3: 6.63 ms per loop </code></pre>

efficiently compute ordering permutations in numpy array

Tags:

performance

python

arrays

numpy

I've got a numpy array. What is the fastest way to compute all the permutations of orderings.

What I mean is, given the first element in my array, I want a list of all the elements that sequentially follow it. Then given the second element, a list of all the elements that follow it.

So given my list: b, c, & d follow a. c & d follow b, and d follows c.

x = np.array(["a", "b", "c", "d"])

So a potential output looks like:

[
    ["a","b"],
    ["a","c"],
    ["a","d"],

    ["b","c"],
    ["b","d"],

    ["c","d"],
]

I will need to do this several million times so I am looking for an efficient solution.

I tried something like:

im = np.vstack([x]*len(x))
a = np.vstack(([im], [im.T])).T
results = a[np.triu_indices(len(x),1)]

but its actually slower than looping...

844

asked Dec 06 '14 18:12

JoeDanger

1 Answers

You can use itertools's functions like chain.from_iterable and combinations with np.fromiter for this. This involves no loop in Python, but still not a pure NumPy solution:

>>> from itertools import combinations, chain
>>> arr = np.fromiter(chain.from_iterable(combinations(x, 2)), dtype=x.dtype)
>>> arr.reshape(arr.size/2, 2)
array([['a', 'b'],
       ['a', 'c'],
       ['a', 'd'],
       ..., 
       ['b', 'c'],
       ['b', 'd'],
       ['c', 'd']], 
      dtype='|S1')

Timing comparisons:

>>> x = np.array(["a", "b", "c", "d"]*100)
>>> %%timeit
    im = np.vstack([x]*len(x))
    a = np.vstack(([im], [im.T])).T
    results = a[np.triu_indices(len(x),1)]
... 
10 loops, best of 3: 29.2 ms per loop
>>> %%timeit
    arr = np.fromiter(chain.from_iterable(combinations(x, 2)), dtype=x.dtype)
    arr.reshape(arr.size/2, 2)
... 
100 loops, best of 3: 6.63 ms per loop

127

answered Nov 14 '22 21:11

Ashwini Chaudhary

Related questions
                            
                                Expected speedup from embarrassingly parallel task using Python Multiprocessing
                            
                                format not a string literal and no format arguments [-Wformat-security]
                            
                                Why can't Mako locate a template beside the one that's including it?
                            
                                Does Python keep track of when something has been sorted, internally?
                            
                                Python string.join ( list ) last entry with "and"
                            
                                Python generator function/object naming convention
                            
                                Read all possible sequential substrings in Python
                            
                                Android on-screen keyboard hiding Python Kivy TextInputs
                            
                                How do people use n-grams for sentiment analysis, considering that as n increases, the memory requirement also increases rapidly?
                            
                                How to __enter__ n context managers?
                            
                                Convert SAS numeric to python datetime
                            
                                Compute the product of neighborhood for each cell in a matrix with numpy/scipy
                            
                                Python check if force closed
                            
                                How to post image to twitter with Twython?
                            
                                In IPython, how do you save and append to a file rather than overwriting it?
                            
                                Should I use a tfidf corpus or just corpus to inference documents using LDA?
                            
                                subplot with pandas dataframes
                            
                                pandas tz_convert: difference among EST, US/Eastern and America/New_York
                            
                                Python slice objects and __getitem__
                            
                                Ansible EC2 Python Error: ValueError: No JSON object could be decoded

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With