I have two arrays, for example with shape <code>(3,2)</code> and the other with shape <code>(10,7)</code>. I want all combinations of the two arrays such that I end up with a 9 column array. In other words, I want all combinations of each row of the first array with the rows of the second array. How can I do this? I am not using meshgrid correctly as far as I can tell. Based on previous posts, I was under the impression that <pre class="prettyprint"><code>a1 = np.zeros((10,7)) a2 = np.zeros((3,2)) r = np.array(np.meshgrid(a1, a2)).T.reshape(-1, a1.shape[1] + a2.shape[1]) </code></pre> would work, but that gives me dimensions of (84,10).

<h3>Approach #1</h3> With focus on performance here's one approach with <code>array-initialization</code> and <code>element-broadcasting</code> for assignments - <pre class="prettyprint"><code>m1,n1 = a1.shape m2,n2 = a2.shape out = np.zeros((m1,m2,n1+n2),dtype=int) out[:,:,:n1] = a1[:,None,:] out[:,:,n1:] = a2 out.shape = (m1*m2,-1) </code></pre> <h3>Explanation :</h3> The trick lies in the two steps : <pre class="prettyprint"><code>out[:,:,:n1] = a1[:,None,:] out[:,:,n1:] = a2 </code></pre> Step #1 : <pre class="prettyprint"><code>In [227]: np.random.seed(0) In [228]: a1 = np.random.randint(1,9,(3,2)) In [229]: a2 = np.random.randint(1,9,(2,7)) In [230]: m1,n1 = a1.shape ...: m2,n2 = a2.shape ...: out = np.zeros((m1,m2,n1+n2),dtype=int) ...: In [231]: out[:,:,:n1] = a1[:,None,:] In [232]: out[:,:,:n1] Out[232]: array([[[5, 8], [5, 8]], [[6, 1], [6, 1]], [[4, 4], [4, 4]]]) In [233]: a1[:,None,:] Out[233]: array([[[5, 8]], [[6, 1]], [[4, 4]]]) </code></pre> So, basically we are assigning the elements of <code>a1</code> keeping the first axis aligned with the corresponding one of the output, while letting the elements along the second axis of the output array being filled in a broadcasted manner corresponding to the <code>newaxis</code> being added for <code>a1</code> along that axis. This is the crux here and brings about performance because we are not allocating extra memory space, which we would need otherwise with explicit repeating/tiling methods. Step #2 : <pre class="prettyprint"><code>In [237]: out[:,:,n1:] = a2 In [238]: out[:,:,n1:] Out[238]: array([[[4, 8, 2, 4, 6, 3, 5], [8, 7, 1, 1, 5, 3, 2]], [[4, 8, 2, 4, 6, 3, 5], [8, 7, 1, 1, 5, 3, 2]], [[4, 8, 2, 4, 6, 3, 5], [8, 7, 1, 1, 5, 3, 2]]]) In [239]: a2 Out[239]: array([[4, 8, 2, 4, 6, 3, 5], [8, 7, 1, 1, 5, 3, 2]]) </code></pre> Here, we are basically broadcasting that block <code>a2</code> along the first axis of the output array without explicitly making repeated copies. Sample input, output for completeness - <pre class="prettyprint"><code>In [242]: a1 Out[242]: array([[5, 8], [6, 1], [4, 4]]) In [243]: a2 Out[243]: array([[4, 8, 2, 4, 6, 3, 5], [8, 7, 1, 1, 5, 3, 2]]) In [244]: out Out[244]: array([[[5, 8, 4, 8, 2, 4, 6, 3, 5], [5, 8, 8, 7, 1, 1, 5, 3, 2]], [[6, 1, 4, 8, 2, 4, 6, 3, 5], [6, 1, 8, 7, 1, 1, 5, 3, 2]], [[4, 4, 4, 8, 2, 4, 6, 3, 5], [4, 4, 8, 7, 1, 1, 5, 3, 2]]]) </code></pre> <h3>Approach #2</h3> Another with <code>tiling/repeating</code> - <pre class="prettyprint"><code>parte1 = np.repeat(a1[:,None,:],m2,axis=0).reshape(-1,m2) parte2 = np.repeat(a2[None],m1,axis=0).reshape(-1,n2) out = np.c_[parte1, parte2] </code></pre>

Combination of all rows in two numpy arrays

Tags:

python

arrays

numpy

I have two arrays, for example with shape (3,2) and the other with shape (10,7). I want all combinations of the two arrays such that I end up with a 9 column array. In other words, I want all combinations of each row of the first array with the rows of the second array.

How can I do this? I am not using meshgrid correctly as far as I can tell.

Based on previous posts, I was under the impression that

a1 = np.zeros((10,7))
a2 = np.zeros((3,2))
r = np.array(np.meshgrid(a1, a2)).T.reshape(-1, a1.shape[1] + a2.shape[1])

would work, but that gives me dimensions of (84,10).

404

asked Nov 06 '17 18:11

Ian Fiddes

2 Answers

Approach #1

With focus on performance here's one approach with array-initialization and element-broadcasting for assignments -

m1,n1 = a1.shape
m2,n2 = a2.shape
out = np.zeros((m1,m2,n1+n2),dtype=int)
out[:,:,:n1] = a1[:,None,:]
out[:,:,n1:] = a2
out.shape = (m1*m2,-1)

Explanation :

The trick lies in the two steps :

out[:,:,:n1] = a1[:,None,:]
out[:,:,n1:] = a2

Step #1 :

In [227]: np.random.seed(0)

In [228]: a1 = np.random.randint(1,9,(3,2))

In [229]: a2 = np.random.randint(1,9,(2,7))

In [230]: m1,n1 = a1.shape
     ...: m2,n2 = a2.shape
     ...: out = np.zeros((m1,m2,n1+n2),dtype=int)
     ...: 

In [231]: out[:,:,:n1] = a1[:,None,:]

In [232]: out[:,:,:n1]
Out[232]: 
array([[[5, 8],
        [5, 8]],

       [[6, 1],
        [6, 1]],

       [[4, 4],
        [4, 4]]])

In [233]: a1[:,None,:]
Out[233]: 
array([[[5, 8]],

       [[6, 1]],

       [[4, 4]]])

So, basically we are assigning the elements of a1 keeping the first axis aligned with the corresponding one of the output, while letting the elements along the second axis of the output array being filled in a broadcasted manner corresponding to the newaxis being added for a1 along that axis. This is the crux here and brings about performance because we are not allocating extra memory space, which we would need otherwise with explicit repeating/tiling methods.

Step #2 :

In [237]: out[:,:,n1:] = a2

In [238]: out[:,:,n1:]
Out[238]: 
array([[[4, 8, 2, 4, 6, 3, 5],
        [8, 7, 1, 1, 5, 3, 2]],

       [[4, 8, 2, 4, 6, 3, 5],
        [8, 7, 1, 1, 5, 3, 2]],

       [[4, 8, 2, 4, 6, 3, 5],
        [8, 7, 1, 1, 5, 3, 2]]])

In [239]: a2
Out[239]: 
array([[4, 8, 2, 4, 6, 3, 5],
       [8, 7, 1, 1, 5, 3, 2]])

Here, we are basically broadcasting that block a2 along the first axis of the output array without explicitly making repeated copies.

Sample input, output for completeness -

In [242]: a1
Out[242]: 
array([[5, 8],
       [6, 1],
       [4, 4]])

In [243]: a2
Out[243]: 
array([[4, 8, 2, 4, 6, 3, 5],
       [8, 7, 1, 1, 5, 3, 2]])

In [244]: out
Out[244]: 
array([[[5, 8, 4, 8, 2, 4, 6, 3, 5],
        [5, 8, 8, 7, 1, 1, 5, 3, 2]],

       [[6, 1, 4, 8, 2, 4, 6, 3, 5],
        [6, 1, 8, 7, 1, 1, 5, 3, 2]],

       [[4, 4, 4, 8, 2, 4, 6, 3, 5],
        [4, 4, 8, 7, 1, 1, 5, 3, 2]]])

Approach #2

Another with tiling/repeating -

parte1 = np.repeat(a1[:,None,:],m2,axis=0).reshape(-1,m2)
parte2 = np.repeat(a2[None],m1,axis=0).reshape(-1,n2)
out = np.c_[parte1, parte2]

144

answered Oct 12 '22 21:10

Divakar

A solution with np.tile and np.repeat :

a1 = np.arange(20).reshape(5,4)
a2 = np.arange(6).reshape(3,2)

res=hstack((np.tile(a1,(len(a2),1)),np.repeat(a2,len(a1),0)))

# array([[ 0,  1,  2,  3,  0,  1],
#        [ 4,  5,  6,  7,  0,  1],
#        [ 8,  9, 10, 11,  0,  1],
#        [12, 13, 14, 15,  0,  1],
#        [16, 17, 18, 19,  0,  1],
#        [ 0,  1,  2,  3,  2,  3],
#        [ 4,  5,  6,  7,  2,  3],
#        [ 8,  9, 10, 11,  2,  3],
#        [12, 13, 14, 15,  2,  3],
#        [16, 17, 18, 19,  2,  3],
#        [ 0,  1,  2,  3,  4,  5],
#        [ 4,  5,  6,  7,  4,  5],
#        [ 8,  9, 10, 11,  4,  5],
#        [12, 13, 14, 15,  4,  5],
#        [16, 17, 18, 19,  4,  5]])

answered Oct 12 '22 22:10

B. M.

Related questions
                            
                                PyQtGraph opening then closing straight away
                            
                                Python asyncio: Running subprocess_exec on a worker thread
                            
                                PyCharm / PyQt: How to obtain code completion with dynamically loaded ui files
                            
                                Tensorflow - Retrieve each character in a string tensor
                            
                                Unpacking PyInstaller packed files
                            
                                Unable to install gdal in python 3.6.1 in macos Sierra
                            
                                Moving numpy arrays from VBA to Python and back
                            
                                Creating executable from Python script while acquiring small output size
                            
                                Pytest - python testing with asyncio
                            
                                Django ORM, how to use values() and still work with choicefield?
                            
                                Error installing PyICU in python 3.6.2 recently installed package
                            
                                Reading stdin line at a time in Python
                            
                                Python package setup script install binary executable
                            
                                Take every nth block from list
                            
                                Require and option only if a choice is made when using click
                            
                                How to determine if a module name is part of python standard library
                            
                                Python refuses to iterate through lines in a file more than once [duplicate]
                            
                                Slack bot, register click on message button
                            
                                Run synchronous pull in Google Cloud Pub/Sub with the Python client API
                            
                                Python base image vs Ubuntu base image installing python separately in docker

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With