What are the differences between <code>torch.flatten()</code> and <code>torch.nn.Flatten()</code>?

Flattening is available in three forms in PyTorch <ul> <li> As a tensor method (oop style) <code>torch.Tensor.flatten</code> applied directly on a tensor: <code>x.flatten()</code>. </li> <li> As a function (functional form) <code>torch.flatten</code> applied as: <code>torch.flatten(x)</code>. </li> <li> As a module (layer <code>nn.Module</code>) <code>nn.Flatten()</code>. Generally used in a model definition. </li> </ul> All three are identical and share the same implementation, the only difference being <code>nn.Flatten</code> has <code>start_dim</code> set to <code>1</code> by default to avoid flattening the first axis (usually the batch axis). While the other two flatten from <code>axis=0</code> to <code>axis=-1</code> - i.e. the entire tensor - if no arguments are given.

You can think of the job of <code>torch.flatten()</code> as to simply doing a flattening operation of the tensor, without any strings attached. You give a tensor, it flattens, and returns it. That's all there to it. On the contrary, <code>nn.Flatten()</code> is much more sophisticated (i.e., it's a neural net layer). Being object oriented, it inherits from <code>nn.Module</code>, although it internally uses the plain tensor.flatten() OP in the <code>forward()</code> method for flattening the tensor. You can think of it more like a syntactic sugar over <code>torch.flatten()</code>. <hr> Important difference: A notable distinction is that <code>torch.flatten()</code> always returns an 1D tensor as result, provided that the input is at least 1D or greater, whereas <code>nn.Flatten()</code> always returns a 2D tensor, provided that the input is at least 2D or greater (With 1D tensor as input, it will throw an IndexError). <hr> <h3>Comparisons:</h3> <ul> <li> <code>torch.flatten()</code> is an API whereas <code>nn.Flatten()</code> is a neural net layer. </li> <li> <code>torch.flatten()</code> is a python function whereas <code>nn.Flatten()</code> is a python class. </li> <li> because of the above point, <code>nn.Flatten()</code> comes with lot of methods and attributes </li> <li> <code>torch.flatten()</code> can be used in the wild (e.g., for simple tensor OPs) whereas <code>nn.Flatten()</code> is expected to be used in a <code>nn.Sequential()</code> block as one of the layers. </li> <li> <code>torch.flatten()</code> has no information about the computation graph unless it is stuck into other graph-aware block (with <code>tensor.requires_grad</code> flag set to <code>True</code>) whereas <code>nn.Flatten()</code> is always being tracked by autograd. </li> <li> <code>torch.flatten()</code> cannot accept and process (e.g., linear/conv1D) layers as inputs whereas <code>nn.Flatten()</code> is mostly used for processing these neural net layers. </li> <li> both <code>torch.flatten()</code> and <code>nn.Flatten()</code> return views to input tensor. Thus, any modification to the result also affects the input tensor. (See the code below) </li> </ul> <hr> Code demo: <pre class="prettyprint"><code># input tensors to work with In [109]: t1 = torch.arange(12).reshape(3, -1) In [110]: t2 = torch.arange(12, 24).reshape(3, -1) In [111]: t3 = torch.arange(12, 36).reshape(3, 2, -1) # 3D tensor </code></pre> Flattening with <code>torch.flatten()</code>: <pre class="prettyprint"><code>In [113]: t1flat = torch.flatten(t1) In [114]: t1flat Out[114]: tensor([ 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11]) # modification to the flattened tensor In [115]: t1flat[-1] = -1 # input tensor is also modified; thus flattening is a view. In [116]: t1 Out[116]: tensor([[ 0, 1, 2, 3], [ 4, 5, 6, 7], [ 8, 9, 10, -1]]) </code></pre> Flattening with <code>nn.Flatten()</code>: <pre class="prettyprint"><code>In [123]: nnfl = nn.Flatten() In [124]: t3flat = nnfl(t3) # note that the result is 2D, as opposed to 1D with torch.flatten In [125]: t3flat Out[125]: tensor([[12, 13, 14, 15, 16, 17, 18, 19], [20, 21, 22, 23, 24, 25, 26, 27], [28, 29, 30, 31, 32, 33, 34, 35]]) # modification to the result In [126]: t3flat[-1, -1] = -1 # input tensor also modified. Thus, flattened result is a view. In [127]: t3 Out[127]: tensor([[[12, 13, 14, 15], [16, 17, 18, 19]], [[20, 21, 22, 23], [24, 25, 26, 27]], [[28, 29, 30, 31], [32, 33, 34, -1]]]) </code></pre> <hr> tidbit: <code>torch.flatten()</code> is the precursor to <code>nn.Flatten()</code> and its brethren <code>nn.Unflatten()</code> since it existed from the very beginning. Then, there was a legitimate use-case for <code>nn.Flatten()</code>, since this is a common requirement for almost all ConvNets (just before the softmax or elsewhere). So it was added later on in the PR #22245. There are also recent proposals to use <code>nn.Flatten()</code> in ResNets for model surgery.

Difference between torch.flatten() and nn.Flatten()

2 Answers

Flattening is available in three forms in PyTorch

As a tensor method (oop style) torch.Tensor.flatten applied directly on a tensor: x.flatten().
As a function (functional form) torch.flatten applied as: torch.flatten(x).
As a module (layer nn.Module) nn.Flatten(). Generally used in a model definition.

All three are identical and share the same implementation, the only difference being nn.Flatten has start_dim set to 1 by default to avoid flattening the first axis (usually the batch axis). While the other two flatten from axis=0 to axis=-1 - i.e. the entire tensor - if no arguments are given.

109

answered Sep 30 '22 16:09

Ivan

You can think of the job of torch.flatten() as to simply doing a flattening operation of the tensor, without any strings attached. You give a tensor, it flattens, and returns it. That's all there to it.

On the contrary, nn.Flatten() is much more sophisticated (i.e., it's a neural net layer). Being object oriented, it inherits from nn.Module, although it internally uses the plain tensor.flatten() OP in the forward() method for flattening the tensor. You can think of it more like a syntactic sugar over torch.flatten().

Important difference: A notable distinction is that torch.flatten() always returns an 1D tensor as result, provided that the input is at least 1D or greater, whereas nn.Flatten() always returns a 2D tensor, provided that the input is at least 2D or greater (With 1D tensor as input, it will throw an IndexError).

Comparisons:

torch.flatten() is an API whereas nn.Flatten() is a neural net layer.
torch.flatten() is a python function whereas nn.Flatten() is a python class.
because of the above point, nn.Flatten() comes with lot of methods and attributes
torch.flatten() can be used in the wild (e.g., for simple tensor OPs) whereas nn.Flatten() is expected to be used in a nn.Sequential() block as one of the layers.
torch.flatten() has no information about the computation graph unless it is stuck into other graph-aware block (with tensor.requires_grad flag set to True) whereas nn.Flatten() is always being tracked by autograd.
torch.flatten() cannot accept and process (e.g., linear/conv1D) layers as inputs whereas nn.Flatten() is mostly used for processing these neural net layers.
both torch.flatten() and nn.Flatten() return views to input tensor. Thus, any modification to the result also affects the input tensor. (See the code below)

Code demo:

# input tensors to work with
In [109]: t1 = torch.arange(12).reshape(3, -1)
In [110]: t2 = torch.arange(12, 24).reshape(3, -1)
In [111]: t3 = torch.arange(12, 36).reshape(3, 2, -1)   # 3D tensor

Flattening with torch.flatten():

In [113]: t1flat = torch.flatten(t1)

In [114]: t1flat
Out[114]: tensor([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11])

# modification to the flattened tensor    
In [115]: t1flat[-1] = -1

# input tensor is also modified; thus flattening is a view.
In [116]: t1
Out[116]: 
tensor([[ 0,  1,  2,  3],
        [ 4,  5,  6,  7],
        [ 8,  9, 10, -1]])

Flattening with nn.Flatten():

In [123]: nnfl = nn.Flatten()
In [124]: t3flat = nnfl(t3)

# note that the result is 2D, as opposed to 1D with torch.flatten
In [125]: t3flat
Out[125]: 
tensor([[12, 13, 14, 15, 16, 17, 18, 19],
        [20, 21, 22, 23, 24, 25, 26, 27],
        [28, 29, 30, 31, 32, 33, 34, 35]])

# modification to the result
In [126]: t3flat[-1, -1] = -1

# input tensor also modified. Thus, flattened result is a view.
In [127]: t3
Out[127]: 
tensor([[[12, 13, 14, 15],
         [16, 17, 18, 19]],

        [[20, 21, 22, 23],
         [24, 25, 26, 27]],

        [[28, 29, 30, 31],
         [32, 33, 34, -1]]])

tidbit: torch.flatten() is the precursor to nn.Flatten() and its brethren nn.Unflatten() since it existed from the very beginning. Then, there was a legitimate use-case for nn.Flatten(), since this is a common requirement for almost all ConvNets (just before the softmax or elsewhere). So it was added later on in the PR #22245.

There are also recent proposals to use nn.Flatten() in ResNets for model surgery.

answered Sep 30 '22 16:09

kmario23

Related questions
                            
                                print "EXTERNSHEET(b7-):" pandas
                            
                                Pandas - normalize Json list
                            
                                Exclude tests in pytest configuration file
                            
                                Jupyter starting a kernel in a docker container?
                            
                                Including and distributing third party libraries with a Python C extension
                            
                                Using for loop in Python to add leading zeros to date column
                            
                                finplot as a widget in layout
                            
                                Find all pairs of strings in two lists that contain no common characters
                            
                                why pip freeze returns some "gibberish" instead of package==VERSION?
                            
                                Length of endogenous variable must be larger the the number of lags used
                            
                                FastAPI - How to use HTTPException in responses?
                            
                                Why I'm getting this error while building docker image?
                            
                                twilio: raise KeyError(key) from None
                            
                                AttributeError: 'str' object has no attribute 'dim' in pytorch
                            
                                psycopg2.errors.InFailedSqlTransaction: current transaction is aborted, commands ignored until end of transaction block
                            
                                Is there a way to release the GIL for pure functions using pure python?
                            
                                How to remove all repeating elements from a list in python?
                            
                                Argsort DataFrame according to columns
                            
                                No module named 'scipy.spatial.transform._rotation_groups after compile python script with pyinstaller
                            
                                FastApi Sqlalchemy how to manage transaction (session and multiple commits)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between torch.flatten() and nn.Flatten()

Tags:

python

neural-network

difference

tensor

pytorch

jules

People also ask

2 Answers

Ivan

Comparisons:

kmario23

Recent Activity

Donate For Us