Efficient way for appending numpy array

Tags:

I will keep it simple.I have a loop that appends new row to a numpy array...what is the efficient way to do this.

n=np.zeros([1,2])
for x in [[2,3],[4,5],[7,6]]
      n=np.append(n,x,axis=1)

Now the thing is there is a [0,0] sticking to it so I have to remove it by

   del n[0]

Which seems dumb...So please tell me an efficient way to do this.

   n=np.empty([1,2])

is even worse it creates an uninitialised value.

631

asked Jun 26 '14 19:06

user3443615

1 Answers

A bit of technical explanation for the "why lists" part.

Internally, the problem for a list of unknown length is that it needs to fit in memory somehow regardless of its length. There are essentially two different possibilities:

Use a data structure (linked list, some tree structure, etc.) which makes it possible to allocate memory separately for each new element in a list.
Store the data in a contiguous memory area. This area has to be allocated when the list is created, and it has to be larger than what we initially need. If we get more stuff into the list, we need to try to allocate more memory, preferably at the same location. If we cannot do it at the same location, we need to allocate a bigger block and move all data.

The first approach enables all sorts of fancy insertion and deletion options, sorting, etc. However, it is slower in sequential reading and allocates more memory. Python actually uses the method #2, the lists are stored as "dynamic arrays". For more information on this, please see:

Size of list in memory

What this means is that lists are designed to be very efficient with the use of append. There is very little you can do to speed things up if you do not know the size of the list beforehand.

If you know even the maximum size of the list beforehand, you are probably best off allocating a numpy.array using numpy.empty (not numpy.zeros) with the maximum size and then use ndarray.resize to shrink the array once you have filled in all data.

For some reason numpy.array(l) where l is a list is often slow with large lists, whereas copying even large arrays is quite fast (I just tried to create a copy of a 100 000 000 element array; it took less than 0.5 seconds).

This discussion has more benchmarking on different options:

Fastest way to grow a numpy numeric array

I have not benchmarked the numpy.empty + ndarray.resize combo, but both should be rather microsecond than millisecond operations.

113

answered Oct 08 '22 19:10

DrV

Related questions
                            
                                Getting a name error when trying to input a string [duplicate]
                            
                                TypeError: must be str, not float
                            
                                Removing first bit
                            
                                Group values based on range of number in python
                            
                                Python - how to speed up calculation of distances between cities
                            
                                Django: Filter ModelChoiceField by user
                            
                                Python line_profiler installation
                            
                                Error: Uncaught SyntaxError: Unexpected token &
                            
                                Python Recursion and list
                            
                                How to remove whitespace from end of string in Python?
                            
                                Reading 4 byte integers from binary file in Python
                            
                                How can I stop Python's csv.DictWriter.writerows from adding empty lines between rows in Windows?
                            
                                Difference between @override in Java and @decorator in Python
                            
                                How to break up one print command in two lines of code in Python 3
                            
                                Using GitPython, how do I do git submodule update --init
                            
                                Pythonic random list of booleans of length n with exactly k Trues
                            
                                Using Python to parse a 12GB CSV
                            
                                Python groupby doesn't work as expected [duplicate]
                            
                                Deserialize list of objects using protobuf
                            
                                Find Most Common Words from a Website in Python 3 [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Efficient way for appending numpy array

Tags:

python

arrays

numpy

user3443615

People also ask

1 Answers

DrV

Recent Activity

Donate For Us