I read What is the difference between 'SAME' and 'VALID' padding in tf.nn.max_pool of tensorflow? but this is not true to my experiment. <pre class="prettyprint"><code>import tensorflow as tf inputs = tf.random_normal([1, 64, 64, 3]) print(inputs.shape) conv = tf.keras.layers.Conv2D(6, 4, strides=2, padding='same') outputs = conv(inputs) print(outputs.shape) </code></pre> produces <pre class="prettyprint"><code>(1, 64, 64, 3) (1, 32, 32, 6) </code></pre> . However following the above link produces <code>(1, 31, 31, 6)</code> because there is no extra values outside filter ranges without any padding. How does tf.keras.layers.Conv2D with padding='same' and strides > 1 behave? I want to know the exact answer and its evidence.

Keras uses TensorFlow implementation of padding. All the details are available in the documentation here <blockquote> First, consider the 'SAME' padding scheme. A detailed explanation of the reasoning behind it is given in these notes. Here, we summarize the mechanics of this padding scheme. When using 'SAME', the output height and width are computed as: <pre class="prettyprint"><code>out_height = ceil(float(in_height) / float(strides[1])) out_width = ceil(float(in_width) / float(strides[2])) </code></pre> The total padding applied along the height and width is computed as: <pre class="prettyprint"><code>if (in_height % strides[1] == 0): pad_along_height = max(filter_height - strides[1], 0) else: pad_along_height = max(filter_height - (in_height % strides[1]), 0) if (in_width % strides[2] == 0): pad_along_width = max(filter_width - strides[2], 0) else: pad_along_width = max(filter_width - (in_width % strides[2]), 0) </code></pre> Finally, the padding on the top, bottom, left and right are: <pre class="prettyprint"><code>pad_top = pad_along_height // 2 pad_bottom = pad_along_height - pad_top pad_left = pad_along_width // 2 pad_right = pad_along_width - pad_left </code></pre> Note that the division by 2 means that there might be cases when the padding on both sides (top vs bottom, right vs left) are off by one. In this case, the bottom and right sides always get the one additional padded pixel. For example, when pad_along_height is 5, we pad 2 pixels at the top and 3 pixels at the bottom. Note that this is different from existing libraries such as cuDNN and Caffe, which explicitly specify the number of padded pixels and always pad the same number of pixels on both sides. For the 'VALID' scheme, the output height and width are computed as: <pre class="prettyprint"><code>out_height = ceil(float(in_height - filter_height + 1) / float(strides[1])) out_width = ceil(float(in_width - filter_width + 1) / float(strides[2])) </code></pre> and no padding is used. </blockquote>

In tensorflow, for stride <code>s</code> and input size <code>n</code>, padding with same gives: &lceil;n/s&rceil; or the ceiling of input size divided by stride.

How does tf.keras.layers.Conv2D with padding='same' and strides > 1 behave?

Tags:

python

tensorflow

conv-neural-network

I read What is the difference between 'SAME' and 'VALID' padding in tf.nn.max_pool of tensorflow? but this is not true to my experiment.

import tensorflow as tf

inputs = tf.random_normal([1, 64, 64, 3])
print(inputs.shape)
conv = tf.keras.layers.Conv2D(6, 4, strides=2, padding='same')
outputs = conv(inputs)
print(outputs.shape)

produces

(1, 64, 64, 3)
(1, 32, 32, 6)

. However following the above link produces (1, 31, 31, 6) because there is no extra values outside filter ranges without any padding.

How does tf.keras.layers.Conv2D with padding='same' and strides > 1 behave?
I want to know the exact answer and its evidence.

956

asked Dec 17 '18 16:12

T. Ogawa

2 Answers

Keras uses TensorFlow implementation of padding. All the details are available in the documentation here

First, consider the 'SAME' padding scheme. A detailed explanation of the reasoning behind it is given in these notes. Here, we summarize the mechanics of this padding scheme. When using 'SAME', the output height and width are computed as:
out_height = ceil(float(in_height) / float(strides[1]))
out_width  = ceil(float(in_width) / float(strides[2]))
The total padding applied along the height and width is computed as:
if (in_height % strides[1] == 0):
  pad_along_height = max(filter_height - strides[1], 0)
else:
  pad_along_height = max(filter_height - (in_height % strides[1]), 0)
if (in_width % strides[2] == 0):
  pad_along_width = max(filter_width - strides[2], 0)
else:
  pad_along_width = max(filter_width - (in_width % strides[2]), 0)
Finally, the padding on the top, bottom, left and right are:
pad_top = pad_along_height // 2
pad_bottom = pad_along_height - pad_top
pad_left = pad_along_width // 2
pad_right = pad_along_width - pad_left
Note that the division by 2 means that there might be cases when the padding on both sides (top vs bottom, right vs left) are off by one. In this case, the bottom and right sides always get the one additional padded pixel. For example, when pad_along_height is 5, we pad 2 pixels at the top and 3 pixels at the bottom. Note that this is different from existing libraries such as cuDNN and Caffe, which explicitly specify the number of padded pixels and always pad the same number of pixels on both sides.

For the 'VALID' scheme, the output height and width are computed as:
out_height = ceil(float(in_height - filter_height + 1) / float(strides[1]))
out_width  = ceil(float(in_width - filter_width + 1) / float(strides[2]))
and no padding is used.

160

answered Oct 06 '22 11:10

BiBi

In tensorflow, for stride s and input size n, padding with same gives:

⌈n/s⌉

or the ceiling of input size divided by stride.

answered Oct 06 '22 11:10

Gerges

Related questions
                            
                                how profiling class method using IPython %lprun magic function
                            
                                Access a specific item in PySpark dataframe
                            
                                How to automatically add parameter types in sphinx documentation
                            
                                WebDriverWait not working as expected
                            
                                Openpyxl.utils.exceptions.IllegalcharacterError
                            
                                import opencv vs import cv2
                            
                                limit number of concurrent requests aiohttp
                            
                                Idle and Anaconda
                            
                                Load all images from a folder using PIL
                            
                                Getting data from hidden html (popup) using BS4
                            
                                Submit a form using POST with g-recaptcha-response argument
                            
                                Pyspark Error: "Py4JJavaError: An error occurred while calling o655.count." when calling count() method on dataframe
                            
                                How to prevent popping-up xdg-open dialogue from Ubuntu chrome while opening specific link?
                            
                                subprocess.run() doesn't return stdout or stderr
                            
                                Cartopy examples produce a Segmentation fault
                            
                                OpenCV & Python Multithreading - Seeking within a VideoCapture Object
                            
                                Python Regular Expressions to NFA
                            
                                Render dynamically changing images with same filenames in Flask
                            
                                How to get interactive bokeh in Jupyter notebook
                            
                                asyncio: RuntimeError this event loop is already running

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With