As the question says, what does <code>-1</code> do in pytorch <code>view</code>? <pre class="prettyprint lang-py prettyprint-override"><code>>>> a = torch.arange(1, 17) >>> a tensor([ 1., 2., 3., 4., 5., 6., 7., 8., 9., 10., 11., 12., 13., 14., 15., 16.]) >>> a.view(1,-1) tensor([[ 1., 2., 3., 4., 5., 6., 7., 8., 9., 10., 11., 12., 13., 14., 15., 16.]]) >>> a.view(-1,1) tensor([[ 1.], [ 2.], [ 3.], [ 4.], [ 5.], [ 6.], [ 7.], [ 8.], [ 9.], [ 10.], [ 11.], [ 12.], [ 13.], [ 14.], [ 15.], [ 16.]]) </code></pre> Does it (<code>-1</code>) generate additional dimension? Does it behave the same as numpy <code>reshape</code> <code>-1</code>?

I love the answer that Benjamin gives https://stackoverflow.com/a/50793899/1601580 <blockquote> Yes, it does behave like -1 in numpy.reshape(), i.e. the actual value for this dimension will be inferred so that the number of elements in the view matches the original number of elements. </blockquote> but I think the weird case edge case that might not be intuitive for you (or at least it wasn't for me) is when calling it with a single -1 i.e. <code>tensor.view(-1)</code>. My guess is that it works exactly the same way as always except that since you are giving a single number to view it assumes you want a single dimension. If you had <code>tensor.view(-1, Dnew)</code> it would produce a tensor of two dimensions/indices but would make sure the first dimension to be of the correct size according to the original dimension of the tensor. Say you had <code>(D1, D2)</code> you had <code>Dnew=D1*D2</code> then the new dimension would be 1. For real examples with code you can run: <pre class="prettyprint"><code>import torch x = torch.randn(1, 5) x = x.view(-1) print(x.size()) x = torch.randn(2, 4) x = x.view(-1, 8) print(x.size()) x = torch.randn(2, 4) x = x.view(-1) print(x.size()) x = torch.randn(2, 4, 3) x = x.view(-1) print(x.size()) </code></pre> output: <pre class="prettyprint"><code>torch.Size([5]) torch.Size([1, 8]) torch.Size([8]) torch.Size([24]) </code></pre> <hr> <h3>History/Context</h3> I feel a good example (common case early on in pytorch before the flatten layer was official added was this common code): <pre class="prettyprint"><code>class Flatten(nn.Module): def forward(self, input): # input.size(0) usually denotes the batch size so we want to keep that return input.view(input.size(0), -1) </code></pre> for sequential. In this view <code>x.view(-1)</code> is a weird flatten layer but missing the squeeze (i.e. adding a dimension of 1). Adding this squeeze or removing it is usually important for the code to actually run. <hr> <h3>Example2</h3> if you are wondering what <code>x.view(-1)</code> does it flattens the vector. Why? Because it has to construct a new view with only 1 dimension and infer the dimension -- so it flattens it. In addition it seems this operation avoids the very nasty bugs <code>.resize()</code> brings since the order of the elements seems to be respected. Fyi, pytorch now has this op for flattening: https://pytorch.org/docs/stable/generated/torch.flatten.html <pre class="prettyprint"><code>#%% """ Summary: view(-1, ...) keeps the remaining dimensions as give and infers the -1 location such that it respects the original view of the tensor. If it's only .view(-1) then it only has 1 dimension given all the previous ones so it ends up flattening the tensor. ref: my answer https://stackoverflow.com/a/66500823/1601580 """ import torch x = torch.arange(6) print(x) x = x.reshape(3, 2) print(x) print(x.view(-1)) </code></pre> output <pre class="prettyprint"><code>tensor([0, 1, 2, 3, 4, 5]) tensor([[0, 1], [2, 3], [4, 5]]) tensor([0, 1, 2, 3, 4, 5]) </code></pre> see the original tensor is returned!

What does -1 mean in pytorch view?

Tags:

dimensions

reshape

pytorch

As the question says, what does -1 do in pytorch view?

Click to copy

>>> a = torch.arange(1, 17) >>> a tensor([  1.,   2.,   3.,   4.,   5.,   6.,   7.,   8.,   9.,  10.,          11.,  12.,  13.,  14.,  15.,  16.])  >>> a.view(1,-1) tensor([[  1.,   2.,   3.,   4.,   5.,   6.,   7.,   8.,   9.,  10.,           11.,  12.,  13.,  14.,  15.,  16.]])  >>> a.view(-1,1) tensor([[  1.],         [  2.],         [  3.],         [  4.],         [  5.],         [  6.],         [  7.],         [  8.],         [  9.],         [ 10.],         [ 11.],         [ 12.],         [ 13.],         [ 14.],         [ 15.],         [ 16.]])

Does it (-1) generate additional dimension? Does it behave the same as numpy reshape -1?

277

asked Jun 11 '18 07:06

aerin

2 Answers

Yes, it does behave like -1 in numpy.reshape(), i.e. the actual value for this dimension will be inferred so that the number of elements in the view matches the original number of elements.

For instance:

Click to copy

import torch  x = torch.arange(6)  print(x.view(3, -1))      # inferred size will be 2 as 6 / 3 = 2 # tensor([[ 0.,  1.], #         [ 2.,  3.], #         [ 4.,  5.]])  print(x.view(-1, 6))      # inferred size will be 1 as 6 / 6 = 1 # tensor([[ 0.,  1.,  2.,  3.,  4.,  5.]])  print(x.view(1, -1, 2))   # inferred size will be 3 as 6 / (1 * 2) = 3 # tensor([[[ 0.,  1.], #          [ 2.,  3.], #          [ 4.,  5.]]])  # print(x.view(-1, 5))    # throw error as there's no int N so that 5 * N = 6 # RuntimeError: invalid argument 2: size '[-1 x 5]' is invalid for input with 6 elements  print(x.view(-1, -1, 3))  # throw error as only one dimension can be inferred # RuntimeError: invalid argument 1: only one dimension can be inferred

135

answered Sep 28 '22 01:09

benjaminplanche

I love the answer that Benjamin gives https://stackoverflow.com/a/50793899/1601580

Yes, it does behave like -1 in numpy.reshape(), i.e. the actual value for this dimension will be inferred so that the number of elements in the view matches the original number of elements.

but I think the weird case edge case that might not be intuitive for you (or at least it wasn't for me) is when calling it with a single -1 i.e. tensor.view(-1). My guess is that it works exactly the same way as always except that since you are giving a single number to view it assumes you want a single dimension. If you had tensor.view(-1, Dnew) it would produce a tensor of two dimensions/indices but would make sure the first dimension to be of the correct size according to the original dimension of the tensor. Say you had (D1, D2) you had Dnew=D1*D2 then the new dimension would be 1.

For real examples with code you can run:

Click to copy

import torch  x = torch.randn(1, 5) x = x.view(-1) print(x.size())  x = torch.randn(2, 4) x = x.view(-1, 8) print(x.size())  x = torch.randn(2, 4) x = x.view(-1) print(x.size())  x = torch.randn(2, 4, 3) x = x.view(-1) print(x.size())

output:

Click to copy

torch.Size([5]) torch.Size([1, 8]) torch.Size([8]) torch.Size([24])

History/Context

I feel a good example (common case early on in pytorch before the flatten layer was official added was this common code):

Click to copy

class Flatten(nn.Module):     def forward(self, input):         # input.size(0) usually denotes the batch size so we want to keep that         return input.view(input.size(0), -1)

for sequential. In this view x.view(-1) is a weird flatten layer but missing the squeeze (i.e. adding a dimension of 1). Adding this squeeze or removing it is usually important for the code to actually run.

Example2

if you are wondering what x.view(-1) does it flattens the vector. Why? Because it has to construct a new view with only 1 dimension and infer the dimension -- so it flattens it. In addition it seems this operation avoids the very nasty bugs .resize() brings since the order of the elements seems to be respected. Fyi, pytorch now has this op for flattening: https://pytorch.org/docs/stable/generated/torch.flatten.html

Click to copy

#%% """ Summary: view(-1, ...) keeps the remaining dimensions as give and infers the -1 location such that it respects the original view of the tensor. If it's only .view(-1) then it only has 1 dimension given all the previous ones so it ends up flattening the tensor.  ref: my answer https://stackoverflow.com/a/66500823/1601580 """ import torch  x = torch.arange(6) print(x)  x = x.reshape(3, 2) print(x)  print(x.view(-1))

output

Click to copy

tensor([0, 1, 2, 3, 4, 5]) tensor([[0, 1],         [2, 3],         [4, 5]]) tensor([0, 1, 2, 3, 4, 5])

see the original tensor is returned!

answered Sep 28 '22 02:09

Charlie Parker

Related questions
                            
                                Can I slice tensors with logical indexing or lists of indices?
                            
                                Unique values in PyTorch tensor
                            
                                Pytorch LSTM vs LSTMCell
                            
                                pytorch: "multi-target not supported" error message
                            
                                How to create a normal distribution in pytorch
                            
                                Pytorch beginner : tensor.new method
                            
                                PyTorch - How to get learning rate during training?
                            
                                Install PyTorch from requirements.txt
                            
                                Pytorch says that CUDA is not available
                            
                                How to use multiple GPUs in pytorch?
                            
                                Pytorch. How does pin_memory work in Dataloader?
                            
                                Meaning of parameters in torch.nn.conv2d
                            
                                How can l uninstall PyTorch?
                            
                                Pytorch RuntimeError: The size of tensor a (4) must match the size of tensor b (3) at non-singleton dimension 0
                            
                                What exactly is the definition of a 'Module' in PyTorch?
                            
                                Label Smoothing in PyTorch
                            
                                Get total amount of free GPU memory and available using pytorch
                            
                                Printing all the contents of a tensor
                            
                                How to install pytorch in Anaconda with conda or pip?
                            
                                How does Pytorch's "Fold" and "Unfold" work?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What does -1 mean in pytorch view?

Tags:

dimensions

reshape

pytorch

aerin

People also ask

2 Answers

benjaminplanche

History/Context

Example2

Charlie Parker

Recent Activity

Donate For Us