Input shape and Conv1d in Keras

Tags:

keras

The first layer of my neural network is like this:

model.add(Conv1D(filters=40,
                 kernel_size=25,
                 input_shape=x_train.shape[1:],
                 activation='relu',
                 kernel_regularizer=regularizers.l2(5e-6),
                 strides=1))

if my input shape is (600,10)

i get (None, 576, 40) as output shape

if my input shape is (6000,1)

i get (None, 5976, 40) as output shape

so my question is what exactly is happening here? is the first example simply ignoring 90% of the input?

399

asked Aug 03 '17 10:08

1 Answers

It is not "ignoring" a 90% of the input, the problem is simply that if you perform a 1-dimensional convolution with a kernel of size K over an input of size X the result of the convolution will have size X - K + 1. If you want the output to have the same size as the input, then you need to extend or "pad" your data. There are several strategies for that, such as add zeros, replicate the value at the ends or wrap around. Keras' Convolution1D has a padding parameter that you can set to "valid" (the default, no padding), "same" (add zeros at both sides of the input to obtain the same output size as the input) and "causal" (padding with zeros at one end only, idea taken from WaveNet).

Update

About the questions in your comments. So you say your input is (600, 10). That, I assume, is the size of one example, and you have a batch of examples with size (N, 600, 10). From the point of view of the convolution operation, this means you have N examples, each of with a length of at most 600 (this "length" may be time or whatever else, it's just the dimension across which the convolution works) and, at each of these 600 points, you have vectors of size 10. Each of these vectors is considered an atomic sample with 10 features (e.g. price, heigh, size, whatever), or, as is sometimes called in the context of convolution, "channels" (from the RGB channels used in 2D image convolution).

The point is, the convolution has a kernel size and a number of output channels, which is the filters parameter in Keras. In your example, what the convolution does is take every possible slice of 25 contiguous 10-vectors and produce a single 40-vector for each (that, for every example in the batch, of course). So you pass from having 10 features or channels in your input to having 40 after the convolution. It's not that it's using only one of the 10 elements in the last dimension, it's using all of them to produce the output.

If the meaning of the dimensions in your input is not what the convolution is interpreting, or if the operation it is performing is not what you were expecting, you may need to either reshape your input or use a different kind of layer.

110

answered Oct 18 '22 04:10

jdehesa

Related questions
                            
                                Where to configure logging?
                            
                                Multiple subset sum calculation
                            
                                Tmpfile error with django-import-export on Heroku
                            
                                Python - Disable warnings for SecurityWarning: Certificate has no `subjectAltName`, RFC 2818
                            
                                Pandas Dataframe: join items in range based on their geo coordinates (longitude and latitude)
                            
                                Python function equivalent to R's `pretty()`?
                            
                                How do stream data to a Bokeh plot in Jupyter with a high refresh rate?
                            
                                How to set up multiple Dag directories in airflow
                            
                                Using Sphinx to automatically generate a separate document for each function
                            
                                Jupyter notebook: Widget Javascript not detected
                            
                                Django F expression on datetime objects
                            
                                Keras: real amount of GPU memory used
                            
                                Python3 ImportError: No module named '_tkinter' [duplicate]
                            
                                "Unable to get Filesystem for path" error when training neural network on google cloud
                            
                                Scrapy on a schedule
                            
                                Python: requests can't login to a website
                            
                                Ignoring NaN in a dataframe
                            
                                Creating a dictionary for each word in a file and counting the frequency of words that follow it
                            
                                How to wait for object to change state
                            
                                loading an image from cifar-10 dataset

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Input shape and Conv1d in Keras

Tags:

python

keras

drulludanni

People also ask

1 Answers

jdehesa

Recent Activity

Donate For Us