Why does hstack() copy data but hsplit() create a view on it?

Tags:

numpy

In NumPy, why does hstack() copy the data from the arrays being stacked:

A, B = np.array([1,2]), np.array([3,4])
C = np.hstack((A,B))
A[0]=99

gives for C:

array([1, 2, 3, 4])

whereas hsplit() creates a view on the data:

a = np.array(((1,2),(3,4)))
b, c = np.hsplit(a,2)
a[0][0]=99

gives for b:

array([[99],
       [ 3]])

I mean - what is the reasoning behind the implementation of this behaviour (which I find inconsistent and hard to remember): I accept that this happens because it's coded that way...

526

asked Mar 24 '14 17:03

2 Answers

Basically the underlying ndarray data structure only has a single pointer to the start of its data's memory and then stride information about how to move through each dimension. If you concatenate two arrays, it won't know how to move from one memory location to the other. On the other hand, if you split an array into two arrays, each can easily store a pointer to the first element (which is somewhere inside the original array).

The basic C implementation is here, and there is a good discussion at:

http://scipy-lectures.github.io/advanced/advanced_numpy/index.html#life-of-ndarray

117

answered Sep 28 '22 19:09

JoshAdel

NumPy generally tries to create views whenever possible, since memory copies are inefficient and can quite quickly eat up a lot of cycles.

hsplit splits the input array into multiple output arrays. The output arrays can each be views into a portion of the original parent array (since they are basically simple slices). Thus, for efficiency, NumPy creates views, instead of copies.

hstack combines two completely separate arrays into a single output array. The underlying array implementation cannot handle two separate data sources in a single array, so there is no way to share the data with the original. Thus, NumPy is forced to create a copy.

answered Sep 28 '22 18:09

nneonneo

Related questions
                            
                                Getting to interactive Django shell in PyDev
                            
                                What does this mean in the docs for random.shuffle?
                            
                                Conditional import in a module
                            
                                Pandas stacked bar chart duplicates colors for large legends
                            
                                Duplicating a Pandas DF N times
                            
                                Why there is no early termination in bitwise operations?
                            
                                Writing array to Excel in Python with win32com
                            
                                About NOTSET in python logging
                            
                                SQLAlchemy case insensitive IN based search query?
                            
                                specifying fixture argument for py.test from command line
                            
                                "Windows Error: provider DLL failed to initialize correctly" on import of cgi module in frozen wxpython app
                            
                                Finding indices given condition in numpy matrix
                            
                                Catch exception gets UnboundLocalError
                            
                                Format timedelta using string variable
                            
                                Adding a custom tick and label
                            
                                python pandas plot with uneven timeseries index (with count evenly distributed)
                            
                                Exiting while loop by pressing enter without blocking. How can I improve this method?
                            
                                Why Recursive Generator doesn't work in Python 3.3?
                            
                                Python dictionary "plus-equal" behavior
                            
                                How to render HTML with jQuery from an AJAX call

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With