How do I stack column-wise <code>n</code> vectors of shape <code>(x,)</code> where x could be any number? For example, <pre class="prettyprint"><code>from numpy import * a = ones((3,)) b = ones((2,)) c = vstack((a,b)) # <-- gives an error c = vstack((a[:,newaxis],b[:,newaxis])) #<-- also gives an error </code></pre> <code>hstack</code> works fine but concatenates along the wrong dimension.

Short answer: you can't. NumPy does not support jagged arrays natively. Long answer: <pre class="prettyprint"><code>>>> a = ones((3,)) >>> b = ones((2,)) >>> c = array([a, b]) >>> c array([[ 1. 1. 1.], [ 1. 1.]], dtype=object) </code></pre> gives an array that may or may not behave as you expect. E.g. it doesn't support basic methods like <code>sum</code> or <code>reshape</code>, and you should treat this much as you'd treat the ordinary Python list <code>[a, b]</code> (iterate over it to perform operations instead of using vectorized idioms). Several possible workarounds exist; the easiest is to coerce <code>a</code> and <code>b</code> to a common length, perhaps using masked arrays or NaN to signal that some indices are invalid in some rows. E.g. here's <code>b</code> as a masked array: <pre class="prettyprint"><code>>>> ma.array(np.resize(b, a.shape[0]), mask=[False, False, True]) masked_array(data = [1.0 1.0 --], mask = [False False True], fill_value = 1e+20) </code></pre> This can be stacked with <code>a</code> as follows: <pre class="prettyprint"><code>>>> ma.vstack([a, ma.array(np.resize(b, a.shape[0]), mask=[False, False, True])]) masked_array(data = [[1.0 1.0 1.0] [1.0 1.0 --]], mask = [[False False False] [False False True]], fill_value = 1e+20) </code></pre> (For some purposes, <code>scipy.sparse</code> may also be interesting.)

How do I stack vectors of different lengths in NumPy?

Tags:

python

numpy

How do I stack column-wise n vectors of shape (x,) where x could be any number?

For example,

from numpy import * a = ones((3,)) b = ones((2,))  c = vstack((a,b)) # <-- gives an error c = vstack((a[:,newaxis],b[:,newaxis])) #<-- also gives an error

hstack works fine but concatenates along the wrong dimension.

913

asked Feb 16 '13 23:02

mac389

2 Answers

Short answer: you can't. NumPy does not support jagged arrays natively.

Long answer:

>>> a = ones((3,)) >>> b = ones((2,)) >>> c = array([a, b]) >>> c array([[ 1.  1.  1.], [ 1.  1.]], dtype=object)

gives an array that may or may not behave as you expect. E.g. it doesn't support basic methods like sum or reshape, and you should treat this much as you'd treat the ordinary Python list [a, b] (iterate over it to perform operations instead of using vectorized idioms).

Several possible workarounds exist; the easiest is to coerce a and b to a common length, perhaps using masked arrays or NaN to signal that some indices are invalid in some rows. E.g. here's b as a masked array:

>>> ma.array(np.resize(b, a.shape[0]), mask=[False, False, True]) masked_array(data = [1.0 1.0 --],              mask = [False False  True],        fill_value = 1e+20)

This can be stacked with a as follows:

>>> ma.vstack([a, ma.array(np.resize(b, a.shape[0]), mask=[False, False, True])]) masked_array(data =  [[1.0 1.0 1.0]  [1.0 1.0 --]],              mask =  [[False False False]  [False False  True]],        fill_value = 1e+20)

(For some purposes, scipy.sparse may also be interesting.)

answered Oct 05 '22 17:10

Fred Foo

In general, there is an ambiguity in putting together arrays of different length because alignment of data might matter. Pandas has different advanced solutions to deal with that, e.g. to merge series into dataFrames.

If you just want to populate columns starting from first element, what I usually do is build a matrix and populate columns. Of course you need to fill the empty spaces in the matrix with a null value (in this case np.nan)

a = ones((3,)) b = ones((2,)) arraylist=[a,b]  outarr=np.ones((np.max([len(ps) for ps in arraylist]),len(arraylist)))*np.nan #define empty array for i,c in enumerate(arraylist):  #populate columns     outarr[:len(c),i]=c  In [108]: outarr Out[108]:  array([[  1.,   1.],        [  1.,   1.],        [  1.,  nan]])

answered Oct 05 '22 19:10

Vincenzooo

Related questions
                            
                                Maximum size of pandas dataframe
                            
                                Test Flask render_template() context
                            
                                Filter rows of a numpy array?
                            
                                Why/When in Python does `x==y` call `y.__eq__(x)`?
                            
                                Python `for` syntax: block code vs single line generator expressions
                            
                                What's different between Python and Javascript regular expressions?
                            
                                "Flat is better than nested" - for data as well as code?
                            
                                How do I run long term (infinite) Python processes?
                            
                                Python: Getting the error message of an exception
                            
                                In Python, can I specify a function argument's default in terms of other arguments?
                            
                                make matplotlib plotting window pop up as the active one
                            
                                Remove an imported python module [duplicate]
                            
                                Python: Passing parameters by name along with kwargs
                            
                                Cython: cimport and import numpy as (both) np
                            
                                How can modify request.data in django REST framework
                            
                                Difference between io.open vs open in python
                            
                                Where do I find the python standard library code?
                            
                                Python: why pickle?
                            
                                PIP: Installing only the dependencies
                            
                                How do I use the unittest setUpClass method()?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With