So I'm struggling to understand some terminology about collections in Pytorch. I keep running into the same kinds of errors about the range of my tensors being incorrect, and when I try to Google for a solution often the explanations are further confusing. Here is an example: <pre class="prettyprint lang-py prettyprint-override"><code>m = torch.nn.LogSoftmax(dim=1) input = torch.tensor([0.3300, 0.3937, -0.3113, -0.2880]) output = m(input) </code></pre> I don't see anything wrong with the above code, and I've defined my <code>LogSoftmax</code> to accept a 1 dimensional input. So according to my experience with other programming languages the collection <code>[0.3300, 0.3937, -0.3113, -0.2880]</code> is a single dimension. The above triggers the following error for <code>m(input)</code>: <pre class="prettyprint"><code>IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1) </code></pre> What does that mean? I passed in a one dimensional tensor, but then it tells me that it was expecting a range of <code>[-1, 0], but got 1</code>. <ul> <li>A range of what?</li> <li>Why is the error comparing a dimension of <code>1</code> to <code>[-1, 0]</code>?</li> <li>What do the two numbers <code>[-1, 0]</code> mean?</li> </ul> I searched for an explanation for this error, and I find things like this link which make no sense to me as a programmer: https://github.com/pytorch/pytorch/issues/5554#issuecomment-370456868 So I was able to fix the above code by adding another dimension to my tensor data. <pre class="prettyprint lang-py prettyprint-override"><code>m = torch.nn.LogSoftmax(dim=1) input = torch.tensor([[-0.3300, 0.3937, -0.3113, -0.2880]]) output = m(input) </code></pre> So that works, but I don't understand how <code>[-1,0]</code> explains a nested collection. Further experiments showed that the following also works: <pre class="prettyprint lang-py prettyprint-override"><code>m = torch.nn.LogSoftmax(dim=1) input = torch.tensor([[0.0, 0.1], [1.0, 0.1], [2.0, 0.1]]) output = m(input) </code></pre> So <code>dim=1</code> means a collection of collections, but I don't understand how that means <code>[-1, 0]</code>. When I try using <code>LogSoftmax(dim=2)</code> <pre class="prettyprint lang-py prettyprint-override"><code>m = torch.nn.LogSoftmax(dim=2) input = torch.tensor([[0.0, 0.1], [1.0, 0.1], [2.0, 0.1]]) output = m(input) </code></pre> The above gives me the following error: <blockquote> IndexError: Dimension out of range (expected to be in range of [-2, 1], but got 2) </blockquote> Confusion again that <code>dim=2</code> equals <code>[-2, 1]</code>, because where did the <code>1</code> value come from? I can fix the error above by nesting collections another level, but at this point I don't understand what values <code>LogSoftmax</code> is expecting. <pre class="prettyprint lang-py prettyprint-override"><code>m = torch.nn.LogSoftmax(dim=2) input = torch.tensor([[[0.0, 0.1]], [[1.0, 0.1]], [[2.0, 0.1]]]) output = m(input) </code></pre> I am super confused by this terminology <code>[-1, 0]</code> and <code>[-2, 1]</code>? If the first value is the nested depth, then why is it negative and what could the second number mean? There is no error code associated with this error. So it's been difficult to find documentation on the subject. It appears to be an extremely common error people get confused by and nothing that I can find in the Pytorch documentation that talks specifically about it.

When specifying a tensor's dimension as an argument for a function (e.g. <code>m = torch.nn.LogSoftmax(dim=1)</code>) you can either use positive dimension indexing starting with 0 for the first dimension, 1 for the second etc. Alternatively, you can use negative dimension indexing to start from the last dimension to the first: -1 indicate the last dimension, -2 the second from last etc. Example: If you have a 4D tensor of dimensions <code>b</code>-by-<code>c</code>-by-<code>h</code>-by-<code>w</code> then <ul> <li>The "batch" dimension (the first) can be accessed as either <code>dim=0</code> or <code>dim=-4</code>.</li> <li>The "channel" dimension (the second) can be accessed as either <code>dim=1</code> or <code>dim=-3</code>. </li> <li>The "height"/"vertical" dimension (the third) can be accessed as either <code>dim=2</code> or <code>dim=-2</code>. </li> <li>The "width"/"horizontal" dimension (the fourth) can be accessed as either <code>dim=3</code> or <code>dim=-1</code>.</li> </ul> Therefore, if you have a 4D tensor <code>dim</code> argument can take values in the range <code>[-4, 3]</code>. In your case you have a 1D tensor and therefore <code>dim</code> argument can be wither 0 or -1 (which in this deprecate case amounts to the same dimension).

What is a dimensional range of [-1,0] in Pytorch?

Tags:

python

tensor

pytorch

softmax

So I'm struggling to understand some terminology about collections in Pytorch. I keep running into the same kinds of errors about the range of my tensors being incorrect, and when I try to Google for a solution often the explanations are further confusing.

Here is an example:

m = torch.nn.LogSoftmax(dim=1)
input = torch.tensor([0.3300, 0.3937, -0.3113, -0.2880])
output = m(input)

I don't see anything wrong with the above code, and I've defined my LogSoftmax to accept a 1 dimensional input. So according to my experience with other programming languages the collection [0.3300, 0.3937, -0.3113, -0.2880] is a single dimension.

The above triggers the following error for m(input):

IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)

What does that mean?

I passed in a one dimensional tensor, but then it tells me that it was expecting a range of [-1, 0], but got 1.

A range of what?
Why is the error comparing a dimension of 1 to [-1, 0]?
What do the two numbers [-1, 0] mean?

I searched for an explanation for this error, and I find things like this link which make no sense to me as a programmer:

https://github.com/pytorch/pytorch/issues/5554#issuecomment-370456868

So I was able to fix the above code by adding another dimension to my tensor data.

m = torch.nn.LogSoftmax(dim=1)
input = torch.tensor([[-0.3300, 0.3937, -0.3113, -0.2880]])
output = m(input)

So that works, but I don't understand how [-1,0] explains a nested collection.

Further experiments showed that the following also works:

m = torch.nn.LogSoftmax(dim=1)
input = torch.tensor([[0.0, 0.1], [1.0, 0.1], [2.0, 0.1]])
output = m(input)

So dim=1 means a collection of collections, but I don't understand how that means [-1, 0].

When I try using LogSoftmax(dim=2)

m = torch.nn.LogSoftmax(dim=2)
input = torch.tensor([[0.0, 0.1], [1.0, 0.1], [2.0, 0.1]])
output = m(input)

The above gives me the following error:

IndexError: Dimension out of range (expected to be in range of [-2, 1], but got 2)

Confusion again that dim=2 equals [-2, 1], because where did the 1 value come from?

I can fix the error above by nesting collections another level, but at this point I don't understand what values LogSoftmax is expecting.

m = torch.nn.LogSoftmax(dim=2)
input = torch.tensor([[[0.0, 0.1]], [[1.0, 0.1]], [[2.0, 0.1]]])
output = m(input)

I am super confused by this terminology [-1, 0] and [-2, 1]?

If the first value is the nested depth, then why is it negative and what could the second number mean?

There is no error code associated with this error. So it's been difficult to find documentation on the subject. It appears to be an extremely common error people get confused by and nothing that I can find in the Pytorch documentation that talks specifically about it.

863

asked Jan 12 '20 13:01

Reactgular

1 Answers

When specifying a tensor's dimension as an argument for a function (e.g. m = torch.nn.LogSoftmax(dim=1)) you can either use positive dimension indexing starting with 0 for the first dimension, 1 for the second etc.
Alternatively, you can use negative dimension indexing to start from the last dimension to the first: -1 indicate the last dimension, -2 the second from last etc.

Example:
If you have a 4D tensor of dimensions b-by-c-by-h-by-w then

The "batch" dimension (the first) can be accessed as either dim=0 or dim=-4.
The "channel" dimension (the second) can be accessed as either dim=1 or dim=-3.
The "height"/"vertical" dimension (the third) can be accessed as either dim=2 or dim=-2.
The "width"/"horizontal" dimension (the fourth) can be accessed as either dim=3 or dim=-1.

Therefore, if you have a 4D tensor dim argument can take values in the range [-4, 3].

In your case you have a 1D tensor and therefore dim argument can be wither 0 or -1 (which in this deprecate case amounts to the same dimension).

answered Oct 12 '22 18:10

Shai

Related questions
                            
                                Intel MKL FATAL ERROR: Cannot load mkl_intel_thread.dll
                            
                                What solver should I use if my objective function is an nonlinear (also exponential explanation) function? Python GEKKO
                            
                                How do I count letters in a string?
                            
                                Cannot Import Name 'keras_export' From 'tensorflow.python.util.tf_export'
                            
                                How do I pass a keyword argument to the forward used by a pre-forward hook?
                            
                                Why does reading a whole file take up more RAM than its size on DISK?
                            
                                Add keys to a dictionary with automatically incremented values
                            
                                How can I cancel an active boto3 s3 file_download?
                            
                                Which SSIM is correct : skimage.metrics.structural_similarity()?
                            
                                What exactly does pygame.init() do?
                            
                                Can I train a Tensorflow keras model with complex input/output?
                            
                                How to generate all possible combinations with a given condition to make it more efficient?
                            
                                How to use Font Awesome icons in python plotly dash
                            
                                Zero predictions despite masking support for zero-padded mini batch LSTM training in keras
                            
                                How do I crop an image using a binary mask image of the same picture to remove the background in python?
                            
                                Pandas Dataframe: Multiplying Two Columns
                            
                                How do I train gpt 2 from scratch?
                            
                                How to get the MSE of the node in the DecisionTreeRegressor of scikit-learn?
                            
                                How to isolate everything inside of a contour, scale it, and test the similarity to an image?
                            
                                How to extract features from FFT?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With