understanding numpy's dstack function

Tags:

I have some trouble understanding what numpy's dstack function is actually doing. The documentation is rather sparse and just says:

Stack arrays in sequence depth wise (along third axis).

Takes a sequence of arrays and stack them along the third axis to make a single array. Rebuilds arrays divided by dsplit. This is a simple way to stack 2D arrays (images) into a single 3D array for processing.

So either I am really stupid and the meaning of this is obvious or I seem to have some misconception about the terms 'stacking', 'in sequence', 'depth wise' or 'along an axis'. However, I was of the impression that I understood these terms in the context of vstack and hstack just fine.

Let's take this example:

In [193]: a Out[193]:  array([[0, 3],        [1, 4],        [2, 5]]) In [194]: b Out[194]:  array([[ 6,  9],        [ 7, 10],        [ 8, 11]]) In [195]: dstack([a,b]) Out[195]:  array([[[ 0,  6],         [ 3,  9]],         [[ 1,  7],         [ 4, 10]],         [[ 2,  8],         [ 5, 11]]])

First of all, a and b don't have a third axis so how would I stack them along 'the third axis' to begin with? Second of all, assuming a and b are representations of 2D-images, why do I end up with three 2D arrays in the result as opposed to two 2D-arrays 'in sequence'?

237

asked Aug 04 '14 10:08

timgeb

2 Answers

It's easier to understand what np.vstack, np.hstack and np.dstack* do by looking at the .shape attribute of the output array.

Using your two example arrays:

print(a.shape, b.shape) # (3, 2) (3, 2)

np.vstack concatenates along the first dimension...
```
print(np.vstack((a, b)).shape) # (6, 2) 
```
np.hstack concatenates along the second dimension...
```
print(np.hstack((a, b)).shape) # (3, 4) 
```
and np.dstack concatenates along the third dimension.
```
print(np.dstack((a, b)).shape) # (3, 2, 2) 
```

Since a and b are both two dimensional, np.dstack expands them by inserting a third dimension of size 1. This is equivalent to indexing them in the third dimension with np.newaxis (or alternatively, None) like this:

print(a[:, :, np.newaxis].shape) # (3, 2, 1)

If c = np.dstack((a, b)), then c[:, :, 0] == a and c[:, :, 1] == b.

You could do the same operation more explicitly using np.concatenate like this:

print(np.concatenate((a[..., None], b[..., None]), axis=2).shape) # (3, 2, 2)

* Importing the entire contents of a module into your global namespace using import * is considered bad practice for several reasons. The idiomatic way is to import numpy as np.

answered Sep 21 '22 06:09

ali_m

Let x == dstack([a, b]). Then x[:, :, 0] is identical to a, and x[:, :, 1] is identical to b. In general, when dstacking 2D arrays, dstack produces an output such that output[:, :, n] is identical to the nth input array.

If we stack 3D arrays rather than 2D:

x = numpy.zeros([2, 2, 3]) y = numpy.ones([2, 2, 4]) z = numpy.dstack([x, y])

then z[:, :, :3] would be identical to x, and z[:, :, 3:7] would be identical to y.

As you can see, we have to take slices along the third axis to recover the inputs to dstack. That's why dstack behaves the way it does.

answered Sep 24 '22 06:09

user2357112 supports Monica

Related questions
                            
                                Fitting a Weibull distribution using Scipy
                            
                                Return a requests.Response object from Flask
                            
                                Get mapping of categorical variables in pandas
                            
                                How to use youtube-dl script to download starting from some index in a playlist?
                            
                                Can't find module cPickle using Python 3.5 and Anaconda
                            
                                coverage.py: exclude files
                            
                                What is PyMySQL and how does it differ from MySQLdb? Can it affect Django deployment?
                            
                                Can pandas handle variable-length whitespace as column delimiters [duplicate]
                            
                                How to downgrade tensorflow, multiple versions possible?
                            
                                Clipping input data to the valid range for imshow with RGB data ([0..1] for floats or [0..255] for integers)
                            
                                Why can't Python increment variable in closure?
                            
                                SyntaxError invalid token
                            
                                After install ROS Kinetic, cannot import OpenCV
                            
                                How to use an Exception's attributes in Python?
                            
                                Faster way of polygon intersection with shapely
                            
                                How can I replace the first occurrence of a character in every word?
                            
                                How to sort multidimensional array by column?
                            
                                Python 3 In Memory Zipfile Error. string argument expected, got 'bytes'
                            
                                How to clear the Entry widget after a button is pressed in Tkinter?
                            
                                How to create Password Field in Model Django

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

understanding numpy's dstack function

Tags:

python

multidimensional-array

concatenation

numpy

timgeb

People also ask

2 Answers

ali_m

user2357112 supports Monica

Recent Activity

Donate For Us