Say I have an array <code>a</code>: <pre class="prettyprint"><code>a = np.array([[1,2,3], [4,5,6]]) array([[1, 2, 3], [4, 5, 6]]) </code></pre> I would like to convert it to a 1D array (i.e. a column vector): <pre class="prettyprint"><code>b = np.reshape(a, (1,np.product(a.shape))) </code></pre> but this returns <pre class="prettyprint"><code>array([[1, 2, 3, 4, 5, 6]]) </code></pre> which is not the same as: <pre class="prettyprint"><code>array([1, 2, 3, 4, 5, 6]) </code></pre> I can take the first element of this array to manually convert it to a 1D array: <pre class="prettyprint"><code>b = np.reshape(a, (1,np.product(a.shape)))[0] </code></pre> but this requires me to know how many dimensions the original array has (and concatenate [0]'s when working with higher dimensions) Is there a dimensions-independent way of getting a column/row vector from an arbitrary ndarray?

Use np.ravel (for a 1D view) or np.ndarray.flatten (for a 1D copy) or np.ndarray.flat (for an 1D iterator): <pre class="prettyprint"><code>In [12]: a = np.array([[1,2,3], [4,5,6]]) In [13]: b = a.ravel() In [14]: b Out[14]: array([1, 2, 3, 4, 5, 6]) </code></pre> Note that <code>ravel()</code> returns a <code>view</code> of <code>a</code> when possible. So modifying <code>b</code> also modifies <code>a</code>. <code>ravel()</code> returns a <code>view</code> when the 1D elements are contiguous in memory, but would return a <code>copy</code> if, for example, <code>a</code> were made from slicing another array using a non-unit step size (e.g. <code>a = x[::2]</code>). If you want a copy rather than a view, use <pre class="prettyprint"><code>In [15]: c = a.flatten() </code></pre> If you just want an iterator, use <code>np.ndarray.flat</code>: <pre class="prettyprint"><code>In [20]: d = a.flat In [21]: d Out[21]: <numpy.flatiter object at 0x8ec2068> In [22]: list(d) Out[22]: [1, 2, 3, 4, 5, 6] </code></pre>

<pre class="prettyprint"><code>In [14]: b = np.reshape(a, (np.product(a.shape),)) In [15]: b Out[15]: array([1, 2, 3, 4, 5, 6]) </code></pre> or, simply: <pre class="prettyprint"><code>In [16]: a.flatten() Out[16]: array([1, 2, 3, 4, 5, 6]) </code></pre>

From ND to 1D arrays

Tags:

python

numpy

Say I have an array a:

a = np.array([[1,2,3], [4,5,6]])

array([[1, 2, 3],
       [4, 5, 6]])

I would like to convert it to a 1D array (i.e. a column vector):

b = np.reshape(a, (1,np.product(a.shape)))

but this returns

array([[1, 2, 3, 4, 5, 6]])

which is not the same as:

array([1, 2, 3, 4, 5, 6])

I can take the first element of this array to manually convert it to a 1D array:

b = np.reshape(a, (1,np.product(a.shape)))[0]

but this requires me to know how many dimensions the original array has (and concatenate [0]'s when working with higher dimensions)

Is there a dimensions-independent way of getting a column/row vector from an arbitrary ndarray?

845

asked Dec 05 '12 18:12

Amelio Vazquez-Reina

4 Answers

Use np.ravel (for a 1D view) or np.ndarray.flatten (for a 1D copy) or np.ndarray.flat (for an 1D iterator):

In [12]: a = np.array([[1,2,3], [4,5,6]])

In [13]: b = a.ravel()

In [14]: b
Out[14]: array([1, 2, 3, 4, 5, 6])

Note that ravel() returns a view of a when possible. So modifying b also modifies a. ravel() returns a view when the 1D elements are contiguous in memory, but would return a copy if, for example, a were made from slicing another array using a non-unit step size (e.g. a = x[::2]).

If you want a copy rather than a view, use

In [15]: c = a.flatten()

If you just want an iterator, use np.ndarray.flat:

In [20]: d = a.flat

In [21]: d
Out[21]: <numpy.flatiter object at 0x8ec2068>

In [22]: list(d)
Out[22]: [1, 2, 3, 4, 5, 6]

answered Nov 15 '22 22:11

unutbu

In [14]: b = np.reshape(a, (np.product(a.shape),))

In [15]: b
Out[15]: array([1, 2, 3, 4, 5, 6])

or, simply:

In [16]: a.flatten()
Out[16]: array([1, 2, 3, 4, 5, 6])

answered Nov 16 '22 00:11

NPE

I wanted to see a benchmark result of functions mentioned in answers including unutbu's.

Also want to point out that numpy doc recommend to use arr.reshape(-1) in case view is preferable. (even though ravel is tad faster in the following result)

TL;DR: np.ravel is the most performant (by very small amount).

Benchmark

Functions:

np.ravel: returns view, if possible
np.reshape(-1): returns view, if possible
np.flatten: returns copy
np.flat: returns numpy.flatiter. similar to iterable

numpy version: '1.18.0'

Execution times on different `ndarray` sizes

+-------------+----------+-----------+-----------+-------------+
|  function   |   10x10  |  100x100  | 1000x1000 | 10000x10000 |
+-------------+----------+-----------+-----------+-------------+
| ravel       | 0.002073 |  0.002123 |  0.002153 |    0.002077 |
| reshape(-1) | 0.002612 |  0.002635 |  0.002674 |    0.002701 |
| flatten     | 0.000810 |  0.007467 |  0.587538 |  107.321913 |
| flat        | 0.000337 |  0.000255 |  0.000227 |    0.000216 |
+-------------+----------+-----------+-----------+-------------+

Conclusion

ravel and reshape(-1)'s execution time was consistent and independent from ndarray size. However, ravel is tad faster, but reshape provides flexibility in reshaping size. (maybe that's why numpy doc recommend to use it instead. Or there could be some cases where reshape returns view and ravel doesn't).
If you are dealing with large size ndarray, using flatten can cause a performance issue. Recommend not to use it. Unless you need a copy of the data to do something else.

Used code

import timeit
setup = '''
import numpy as np
nd = np.random.randint(10, size=(10, 10))
'''

timeit.timeit('nd = np.reshape(nd, -1)', setup=setup, number=1000)
timeit.timeit('nd = np.ravel(nd)', setup=setup, number=1000)
timeit.timeit('nd = nd.flatten()', setup=setup, number=1000)
timeit.timeit('nd.flat', setup=setup, number=1000)

answered Nov 15 '22 23:11

haku

For list of array with different size use following:

import numpy as np

# ND array list with different size
a = [[1],[2,3,4,5],[6,7,8]]

# stack them
b = np.hstack(a)

print(b)

Output:

[1 2 3 4 5 6 7 8]

answered Nov 16 '22 00:11

bikram

Related questions
                            
                                How to read the RGB value of a given pixel in Python?
                            
                                One line if-condition-assignment
                            
                                Renaming a virtualenv folder without breaking it
                            
                                Why wasn't PyPy included in standard Python?
                            
                                Python unit test with base and sub class
                            
                                What does tf.nn.embedding_lookup function do?
                            
                                Pythonic way to check if a list is sorted or not
                            
                                Disable a method in a ViewSet, django-rest-framework
                            
                                python design patterns [closed]
                            
                                How to mock an import
                            
                                Determine if 2 lists have the same elements, regardless of order? [duplicate]
                            
                                What is the difference between json.dump() and json.dumps() in python?
                            
                                Split views.py in several files
                            
                                TypeError: module.__init__() takes at most 2 arguments (3 given)
                            
                                Using javadoc for Python documentation [closed]
                            
                                Upgrade python packages from requirements.txt using pip command
                            
                                How to get the nth element of a python list or a default if not available
                            
                                ImportError: numpy.core.multiarray failed to import
                            
                                If list index exists, do X
                            
                                ImportError: No module named dateutil.parser

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

From ND to 1D arrays

Tags:

python

numpy

Amelio Vazquez-Reina

People also ask

4 Answers

unutbu

NPE

Benchmark

Execution times on different `ndarray` sizes

Conclusion

Used code

haku

For list of array with different size use following:

Output:

bikram

Recent Activity

Donate For Us

From ND to 1D arrays

Tags:

python

numpy

Amelio Vazquez-Reina

People also ask

4 Answers

unutbu

NPE

Benchmark

Execution times on different ndarray sizes

Conclusion

Used code

haku

For list of array with different size use following:

Output:

bikram

Related questions

Recent Activity

Donate For Us

Execution times on different `ndarray` sizes