In <code>MaxPool2D</code> the padding is by default set to 0 and the <code>ceil_mode</code> is also set to <code>False</code>. Now, if I have an input of size <code>7x7</code> with <code>kernel=2,stride=2</code> the output shape becomes <code>3x3</code>, but when I use <code>ceil_mode=True</code>, it becomes <code>4x4</code>, which makes sense because (if the following formula is correct), for <code>7x7</code> with <code>output_shape</code> would be <code>3.5x3.5</code> and depending on the <code>ceil_mode</code> it would be either <code>3x3</code> or <code>4x4</code>. <img src="https://i.stack.imgur.com/90ZKl.gif" alt=""> Now, my question is, if the <code>ceil_mode=True</code>, does it change the default <code>padding</code>? If it does, then how is it adding the padding i.e. is it adding the padding on left first or right, up first or down?

Ceil_mode=True changes the padding. <blockquote> In the case of ceil mode, additional columns and rows are added at the right as well as at the down. (Not top and not left). It does not need to be one extra column. It depends on the stride value as well. I just wrote small code snippet where you can check how the populated values are pooled in either modes. </blockquote> Before I found the post referenced above, I experimented the same way with your problem, it also seems as though the zero-padding is not used during the pooling operation, as in my following example the zeros would have been the maximum elements to be taken, but this does not seem to be the case. <pre class="prettyprint"><code> test_tensor = torch.FloatTensor(2,7,7).random_(-10,-5) print(test_tensor) max_pool = nn.MaxPool2d(kernel_size=2, stride=2, ceil_mode=True) print(max_pool(test_tensor)) max_pool = nn.MaxPool2d(kernel_size=2, stride=2, ceil_mode=False) print(max_pool(test_tensor)) </code></pre> Random sample tensor: <pre class="prettyprint"><code> tensor([[[ -6., -9., -7., -10., -6., -8., -6.], [-10., -10., -10., -6., -10., -9., -6.], [-10., -7., -7., -8., -10., -10., -9.], [ -8., -10., -10., -9., -9., -10., -9.], [ -8., -6., -8., -6., -7., -7., -9.], [-10., -8., -7., -10., -9., -6., -8.], [-10., -6., -9., -10., -9., -9., -10.]], [[-10., -8., -6., -10., -9., -6., -7.], [ -7., -7., -10., -10., -6., -9., -7.], [ -6., -10., -7., -8., -8., -10., -9.], [ -8., -8., -6., -7., -6., -8., -6.], [ -9., -8., -7., -10., -8., -8., -7.], [-10., -10., -6., -9., -8., -8., -8.], [-10., -6., -9., -9., -7., -9., -10.]]]) </code></pre> ceil_mode=True <pre class="prettyprint"><code> tensor([[[ -6., -6., -6., -6.], [ -7., -7., -9., -9.], [ -6., -6., -6., -8.], [ -6., -9., -9., -10.]], [[ -7., -6., -6., -7.], [ -6., -6., -6., -6.], [ -8., -6., -8., -7.], [ -6., -9., -7., -10.]]]) </code></pre> ceil_mode=False <pre class="prettyprint"><code> tensor([[[-6., -6., -6.], [-7., -7., -9.], [-6., -6., -6.]], [[-7., -6., -6.], [-6., -6., -6.], [-8., -6., -8.]]]) </code></pre>

In PyTorch's "MaxPool2D", is padding added depending on "ceil_mode"?

Tags:

padding

neural-network

deep-learning

pytorch

max-pooling

In MaxPool2D the padding is by default set to 0 and the ceil_mode is also set to False. Now, if I have an input of size 7x7 with kernel=2,stride=2 the output shape becomes 3x3, but when I use ceil_mode=True, it becomes 4x4, which makes sense because (if the following formula is correct), for 7x7 with output_shape would be 3.5x3.5 and depending on the ceil_mode it would be either 3x3 or 4x4.

Now, my question is, if the ceil_mode=True, does it change the default padding?

If it does, then how is it adding the padding i.e. is it adding the padding on left first or right, up first or down?

426

asked Jan 25 '20 05:01

paul-shuvo

1 Answers

Ceil_mode=True changes the padding.

In the case of ceil mode, additional columns and rows are added at the right as well as at the down. (Not top and not left). It does not need to be one extra column. It depends on the stride value as well. I just wrote small code snippet where you can check how the populated values are pooled in either modes.

Before I found the post referenced above, I experimented the same way with your problem, it also seems as though the zero-padding is not used during the pooling operation, as in my following example the zeros would have been the maximum elements to be taken, but this does not seem to be the case.

    test_tensor = torch.FloatTensor(2,7,7).random_(-10,-5)
    print(test_tensor)
    max_pool = nn.MaxPool2d(kernel_size=2, stride=2, ceil_mode=True)
    print(max_pool(test_tensor))
    max_pool = nn.MaxPool2d(kernel_size=2, stride=2, ceil_mode=False)
    print(max_pool(test_tensor))

Random sample tensor:

    tensor([[[ -6.,  -9.,  -7., -10.,  -6.,  -8.,  -6.],
             [-10., -10., -10.,  -6., -10.,  -9.,  -6.],
             [-10.,  -7.,  -7.,  -8., -10., -10.,  -9.],
             [ -8., -10., -10.,  -9.,  -9., -10.,  -9.],
             [ -8.,  -6.,  -8.,  -6.,  -7.,  -7.,  -9.],
             [-10.,  -8.,  -7., -10.,  -9.,  -6.,  -8.],
             [-10.,  -6.,  -9., -10.,  -9.,  -9., -10.]],

            [[-10.,  -8.,  -6., -10.,  -9.,  -6.,  -7.],
             [ -7.,  -7., -10., -10.,  -6.,  -9.,  -7.],
             [ -6., -10.,  -7.,  -8.,  -8., -10.,  -9.],
             [ -8.,  -8.,  -6.,  -7.,  -6.,  -8.,  -6.],
             [ -9.,  -8.,  -7., -10.,  -8.,  -8.,  -7.],
             [-10., -10.,  -6.,  -9.,  -8.,  -8.,  -8.],
             [-10.,  -6.,  -9.,  -9.,  -7.,  -9., -10.]]])

ceil_mode=True


    tensor([[[ -6.,  -6.,  -6.,  -6.],
             [ -7.,  -7.,  -9.,  -9.],
             [ -6.,  -6.,  -6.,  -8.],
             [ -6.,  -9.,  -9., -10.]],

            [[ -7.,  -6.,  -6.,  -7.],
             [ -6.,  -6.,  -6.,  -6.],
             [ -8.,  -6.,  -8.,  -7.],
             [ -6.,  -9.,  -7., -10.]]])

ceil_mode=False

    tensor([[[-6., -6., -6.],
             [-7., -7., -9.],
             [-6., -6., -6.]],

            [[-7., -6., -6.],
             [-6., -6., -6.],
             [-8., -6., -8.]]])

130

answered Oct 04 '22 16:10

a-doering

Related questions
                            
                                tensorflow gradient - getting all nan values
                            
                                One dimensional data with CNN
                            
                                AttributeError:'Tensor' object has no attribute '_keras_history'
                            
                                Add hand-crafted features to Keras sequential model
                            
                                How can you re-use a variable scope in tensorflow without a new scope being created by default?
                            
                                Keras plot_model not showing the input layer appropriately
                            
                                Validation loss oscillates a lot, validation accuracy > learning accuracy, but test accuracy is high. Is my model overfitting?
                            
                                Keras model.predict() slower on first iteration then gets faster
                            
                                TF2 / Keras slice tensor using [:, :, 0]
                            
                                scale the loss value according to "badness" in caffe
                            
                                How is the training accuracy in Keras determined for every epoch?
                            
                                Understanding multi-label classifier using confusion matrix
                            
                                Keras Denoising Autoencoder (tabular data)
                            
                                How to count Multiply-Adds operations?
                            
                                Error when checking target: expected dense_3 to have shape (2,) but got array with shape (1,)
                            
                                What is a fused kernel (or fused layer) in deep learning?
                            
                                How to perform multi-label learning with LSTM using theano?
                            
                                Image classification with Keras on Tensorflow: how to find which images are misclassified during training?
                            
                                How to know Tensorflow Lite model's input/output feature info?
                            
                                Stacking RBMs to create Deep belief network in sklearn

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With