Optimizing assignment into an array from various arrays - NumPy

Tags:

I have four square matrices with dimension 3Nx3N, called A, B, C and D.

I want to combine them in a single matrix. The code with for loops is the following:

import numpy
N = 3
A = numpy.random.random((3*N, 3*N))
B = numpy.random.random((3*N, 3*N))
C = numpy.random.random((3*N, 3*N))
D = numpy.random.random((3*N, 3*N))

final = numpy.zeros((6*N, 6*N))

for i in range(N):
    for j in range(N):
        for k in range(3):
            for l in range(3):
                final[6*i + k][6*j + l] = A[3*i+k][3*j+l]
                final[6*i + k + 3][6*j + l + 3] = B[3*i+k][3*j+l]
                final[6*i + k + 3][6*j + l] = C[3*i+k][3*j+l]
                final[6*i + k][6*j + l + 3] = D[3*i+k][3*j+l]

Is it possible to write the previous for loops in a numpythonic way?

952

asked Jan 24 '17 16:01

nunodsousa

1 Answers

Great problem for practicing array-slicing into multi-dimensional tensors/arrays!

We will initialize the output array as a multi-dimensional 6D array and simply slice it and assign the four arrays being reshaped as 4D arrays. The intention is avoid any stacking/concatenating as those would be expensive specially when working with large arrays by instead working with reshaping of input arrays, which would be merely views.

Here's the implementation -

out = np.zeros((N,2,3,N,2,3),dtype=A.dtype)
out[:,0,:,:,0,:] = A.reshape(N,3,N,3)
out[:,0,:,:,1,:] = D.reshape(N,3,N,3)
out[:,1,:,:,0,:] = C.reshape(N,3,N,3)
out[:,1,:,:,1,:] = B.reshape(N,3,N,3)
out.shape = (6*N,6*N)

Just to explain a bit more, we had :

            |------------------------ Axes for selecting A, B, C, D
np.zeros((N,2,3,N,2,3),dtype=A.dtype)
                  |------------------------- Axes for selecting A, B, C, D

Thus, those two axes (second and fifth) of lengths (2x2) = 4 were used to select between the four inputs.

Runtime test

Approaches -

def original_app(A, B, C, D):
    final = np.zeros((6*N,6*N),dtype=A.dtype)
    for i in range(N):
        for j in range(N):
            for k in range(3):
                for l in range(3):
                    final[6*i + k][6*j + l] = A[3*i+k][3*j+l]
                    final[6*i + k + 3][6*j + l + 3] = B[3*i+k][3*j+l]
                    final[6*i + k + 3][6*j + l] = C[3*i+k][3*j+l]
                    final[6*i + k][6*j + l + 3] = D[3*i+k][3*j+l]
    return final

def slicing_app(A, B, C, D):
    out = np.zeros((N,2,3,N,2,3),dtype=A.dtype)
    out[:,0,:,:,0,:] = A.reshape(N,3,N,3)
    out[:,0,:,:,1,:] = D.reshape(N,3,N,3)
    out[:,1,:,:,0,:] = C.reshape(N,3,N,3)
    out[:,1,:,:,1,:] = B.reshape(N,3,N,3)
    return out.reshape(6*N,6*N)

Timings and verification -

In [147]: # Setup input arrays
     ...: N = 200
     ...: A = np.random.randint(11,99,(3*N,3*N))
     ...: B = np.random.randint(11,99,(3*N,3*N))
     ...: C = np.random.randint(11,99,(3*N,3*N))
     ...: D = np.random.randint(11,99,(3*N,3*N))
     ...: 

In [148]: np.allclose(slicing_app(A, B, C, D), original_app(A, B, C, D))
Out[148]: True

In [149]: %timeit original_app(A, B, C, D)
1 loops, best of 3: 1.63 s per loop

In [150]: %timeit slicing_app(A, B, C, D)
100 loops, best of 3: 9.26 ms per loop

answered Oct 02 '22 18:10

Divakar

Related questions
                            
                                Get Reddit user comments using PRAW causing TypeError: 'SubListing' object is not callable error
                            
                                Is it possible to inject a module into an imported module's globals?
                            
                                Why packages installed with pip install --user option are not 'visible' from shell?
                            
                                python's shortcut for len(list(filter(lambda x: criteria, iterable)))
                            
                                Flask/Django Server and Bokeh Server
                            
                                Pandas: merge dataframes without creating new columns
                            
                                How to start/stop a Python function within a time period (ex. from 10 am to 12:30pm)?
                            
                                Annualized Return in Pandas
                            
                                How to get a element to stick to the bottom-right corner in Tkinter?
                            
                                Searching one Python dataframe / dictionary for fuzzy matches in another dataframe
                            
                                subprocess.Popen shell=True to shell=False
                            
                                Python, Pandas, Numpy: Date_range: passing a np.timedelta as freq. argument
                            
                                Why doesn't this if statement execute? [closed]
                            
                                How to inject values into the middle of TensorFlow graph?
                            
                                Pandas: union duplicate strings
                            
                                Fine tuning pretrained model in keras
                            
                                Python argparse arguments with repeatable parameter pairs
                            
                                What exactly does the -q option of netcat do?
                            
                                .astype("int") or .astype(int)? Any differences between with and without quote/double?
                            
                                Elastic beanstalk require python 3.5

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Optimizing assignment into an array from various arrays - NumPy

Tags:

python

optimization

vectorization

matrix

numpy

nunodsousa

People also ask

1 Answers

Divakar

Recent Activity

Donate For Us