python - repeating numpy array without replicating data

Tags:

This question has been asked before, but the solution only works for 1D/2D arrays, and I need a more general answer.

How do you create a repeating array without replicating the data? This strikes me as something of general use, as it would help to vectorize python operations without the memory hit.

More specifically, I have a (y,x) array, which I want to tile multiple times to create a (z,y,x) array. I can do this with numpy.tile(array, (nz,1,1)), but I run out of memory. My specific case has x=1500, y=2000, z=700.

871

asked May 16 '14 13:05

user3644731

1 Answers

One simple trick is to use np.broadcast_arrays to broadcast your (x, y) against a z-long vector in the first dimension:

import numpy as np

M = np.arange(1500*2000).reshape(1500, 2000)
z = np.zeros(700)

# broadcasting over the first dimension
_, M_broadcast = np.broadcast_arrays(z[:, None, None], M[None, ...])

print M_broadcast.shape, M_broadcast.flags.owndata
# (700, 1500, 2000), False

To generalize the stride_tricks method given for a 1D array in this answer, you just need to include the shape and stride length for each dimension of your output array:

M_strided = np.lib.stride_tricks.as_strided(
                M,                              # input array
                (700, M.shape[0], M.shape[1]),  # output dimensions
                (0, M.strides[0], M.strides[1]) # stride length in bytes
            )

115

answered Sep 20 '22 10:09

ali_m

Related questions
                            
                                using python virtual env in R
                            
                                What are the under-the-hood differences between round() and numpy.round()?
                            
                                python import modules inside folder
                            
                                Making RSA keys from a password in python
                            
                                Numpy: Multiple Outer Products
                            
                                In-place sort of sublist
                            
                                pyscopg2 select NULL values
                            
                                Travis is not finding pandas installed by conda
                            
                                Numpy and matlab polyfit results differences
                            
                                Transposing part of a pandas dataframe
                            
                                How to get pandas.read_csv() to infer datetime and timedelta types from CSV file columns?
                            
                                Python ftplib: Show FTP upload progress
                            
                                Maze solving with python
                            
                                BitTorrent Client : Getting Peer List From Trackers [Python]
                            
                                Yowsup WhatsApp get phone number
                            
                                What happens to memory locations in Python when you overwrite a variable?
                            
                                How do I detach python script run from Pycharm so it keeps appending to files?
                            
                                Python unittest ignore numpy
                            
                                Good way to add terms to python pattern singularize
                            
                                How to swich theano.tensor to numpy.array?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python - repeating numpy array without replicating data

Tags:

python

memory

large-data

numpy

user3644731

People also ask

1 Answers

ali_m

Recent Activity

Donate For Us