Seeing this answer I am wondering if the creation of a flattened view of X are essentially the same, as long as I know that the number of axes in X is 3: <pre class="prettyprint"><code>A = X.ravel() s0, s1, s2 = X.shape B = X.reshape(s0*s1*s2) C = X.reshape(-1) # thanks to @hpaulj below </code></pre> I'm not asking if A and B and C are the same. I'm wondering if the particular use of <code>ravel</code> and <code>reshape</code> in this situation are essentially the same, or if there are significant differences, advantages, or disadvantages to one or the other, provided that you know the number of axes of X ahead of time. The second method takes a few microseconds, but that does not seem to be size dependent.

Look at their <code>__array_interface__</code> and do some timings. The only difference that I can see is that <code>ravel</code> is faster. <code>.flatten()</code> has a more significant difference - it returns a copy. <pre class="prettyprint"><code>A.reshape(-1) </code></pre> is a simpler way to use reshape. You could study the respective docs, and see if there is something else. I haven't explored what happens when you specify <code>order</code>. I would use <code>ravel</code> if I just want it to be 1d. I use <code>.reshape</code> most often to change a 1d (e.g. <code>arange()</code>) to nd. e.g. <pre class="prettyprint"><code>np.arange(10).reshape(2,5).ravel() </code></pre> Or choose the one that makes your code most readable. <hr> <code>reshape</code> and <code>ravel</code> are defined in <code>numpy</code> C code: In https://github.com/numpy/numpy/blob/0703f55f4db7a87c5a9e02d5165309994b9b13fd/numpy/core/src/multiarray/shape.c <code>PyArray_Ravel(PyArrayObject *arr, NPY_ORDER order)</code> requires nearly 100 lines of C code. And it punts to <code>PyArray_Flatten</code> if the order changes. In the same file, <code>reshape</code> punts to <code>newshape</code>. That in turn returns a <code>view</code> is the shape doesn't actually change, tries <code>_attempt_nocopy_reshape</code>, and as last resort returns a <code>PyArray_NewCopy</code>. Both make use of <code>PyArray_Newshape</code> and <code>PyArray_NewFromDescr</code> - depending on how shapes and order mix and match. So identifying where reshape (to 1d) and ravel are different would require careful study. <hr> Another way to do this ravel is to make a new array, with a new shape, but the same data buffer: <pre class="prettyprint"><code>np.ndarray((24,),buffer=A.data) </code></pre> It times the same as <code>reshape</code>. Its <code>__array_interface__</code> is the same. I don't recommend using this method, but it may clarify what is going on with these reshape/ravel functions. They all make a new array, with new shape, but with share data (if possible). Timing differences are the result of different sequences of function calls - in Python and C - not in different handling of the data.

Differences between X.ravel() and X.reshape(s0s1s2) when number of axes known

Tags:

numpy

Seeing this answer I am wondering if the creation of a flattened view of X are essentially the same, as long as I know that the number of axes in X is 3:

A = X.ravel()

s0, s1, s2 = X.shape
B = X.reshape(s0*s1*s2)

C = X.reshape(-1)  # thanks to @hpaulj below

I'm not asking if A and B and C are the same.

I'm wondering if the particular use of ravel and reshape in this situation are essentially the same, or if there are significant differences, advantages, or disadvantages to one or the other, provided that you know the number of axes of X ahead of time.

The second method takes a few microseconds, but that does not seem to be size dependent.

328

asked Oct 14 '15 05:10

uhoh

1 Answers

Look at their __array_interface__ and do some timings. The only difference that I can see is that ravel is faster.

.flatten() has a more significant difference - it returns a copy.

A.reshape(-1)

is a simpler way to use reshape.

You could study the respective docs, and see if there is something else. I haven't explored what happens when you specify order.

I would use ravel if I just want it to be 1d. I use .reshape most often to change a 1d (e.g. arange()) to nd.

e.g.

np.arange(10).reshape(2,5).ravel()

Or choose the one that makes your code most readable.

reshape and ravel are defined in numpy C code:

In https://github.com/numpy/numpy/blob/0703f55f4db7a87c5a9e02d5165309994b9b13fd/numpy/core/src/multiarray/shape.c

PyArray_Ravel(PyArrayObject *arr, NPY_ORDER order) requires nearly 100 lines of C code. And it punts to PyArray_Flatten if the order changes.

In the same file, reshape punts to newshape. That in turn returns a view is the shape doesn't actually change, tries _attempt_nocopy_reshape, and as last resort returns a PyArray_NewCopy.

Both make use of PyArray_Newshape and PyArray_NewFromDescr - depending on how shapes and order mix and match.

So identifying where reshape (to 1d) and ravel are different would require careful study.

Another way to do this ravel is to make a new array, with a new shape, but the same data buffer:

np.ndarray((24,),buffer=A.data)

It times the same as reshape. Its __array_interface__ is the same. I don't recommend using this method, but it may clarify what is going on with these reshape/ravel functions. They all make a new array, with new shape, but with share data (if possible). Timing differences are the result of different sequences of function calls - in Python and C - not in different handling of the data.

answered Oct 19 '22 17:10

hpaulj

Related questions
                            
                                Python function equivalent to R's `pretty()`?
                            
                                Ignoring NaN in a dataframe
                            
                                loading an image from cifar-10 dataset
                            
                                Making numpy arrays JSON serializable
                            
                                Using numpy.vstack in numba
                            
                                Numpy get index of row with second-largest value
                            
                                Pandas Dataframe - Droping Certain Hours of the Day from 20 Years of Historical Data
                            
                                Pairwise Distances Between Two "islands"/"connected components" in Numpy Array
                            
                                What's wrong with my PCA?
                            
                                Matlab / Octave bwdist() in Python or C
                            
                                Iteration through all 1 dimensional subarrays of a multi-dimensional array
                            
                                Finding unique points in numpy array
                            
                                Alternative to Scipy mode function in Numpy?
                            
                                How do I split an ndarray based on array of indexes?
                            
                                Numpy cross-product on rectangular grid
                            
                                Plot Mandelbrot with matplotlib / pyplot / numpy / python
                            
                                Why won't Perceptron Learning Algorithm converge?
                            
                                2D Numpy array to HTML table?
                            
                                Find Arc/Circle equation given three points in space (3D)
                            
                                How to declare an ndarray in cython with a general floating point type

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Differences between X.ravel() and X.reshape(s0s1s2) when number of axes known

Tags:

numpy

uhoh

People also ask

1 Answers

hpaulj

Recent Activity

Donate For Us

Differences between X.ravel() and X.reshape(s0*s1*s2) when number of axes known

Tags:

numpy

uhoh

People also ask

1 Answers

hpaulj

Related questions

Recent Activity

Donate For Us

Differences between X.ravel() and X.reshape(s0s1s2) when number of axes known