Keras - 1D Convolution How it works

Tags:

From this example: https://github.com/fchollet/keras/blob/master/examples/imdb_cnn.py

comes this snippet below. The embedding layer outputs a 400 x 50 matrix for each example in a batch. My question is how does the 1D convolution work? How does it work across the 400 x 50 matrix?

# we start off with an efficient embedding layer which maps
# our vocab indices into embedding_dims dimensions
model.add(Embedding(max_features,
                    embedding_dims,
                    input_length=maxlen,
                    dropout=0.2))

# we add a Convolution1D, which will learn nb_filter
# word group filters of size filter_length:
model.add(Convolution1D(nb_filter=nb_filter,
                        filter_length=filter_length,
                        border_mode='valid',
                        activation='relu',
                        subsample_length=1))

853

asked Oct 19 '16 02:10

B_Miner

2 Answers

Coming from a background of signal processing it also took me while to understand the concept of it and it seems to be the case of many people in the community.

Pyan gave a very good explanation. As it is often explained with words in many forums, I made a little animation the I hope will help.

See below the input tensor, the filter (or weight) and the outputed tensor. You can also see the size of the output tensor as a function of the number of filters used (represented with different colours).

Visual Representation of the 1D Convolultion (Simplified)

Note that to perform the scalar multiplication between the input and the filter, the filter should be transposed. There are also different implementations (Karas, Tensorflow, Pytorch...), but I think this animation can give a good representation of what is happening.

Hope it can help someone.

112

answered Sep 29 '22 05:09

berthié gouin

In convolutional neural networks (CNNs), 1D and 2D filters are not really 1 and 2 dimensional. It is a convention for description.

In your example, each 1D filter is actually a Lx50 filter, where L is a parameter of filter length. The convolution is only performed in one dimension. That may be why it is called 1D. So, with proper padding, each 1D filter convolution gives a 400x1 vector. The Convolution1D layer will eventually output a matrix of 400*nb_filter.

answered Sep 29 '22 05:09

pyan

Related questions
                            
                                How to remove nodes from TensorFlow graph?
                            
                                Why does the gated activation function (used in Wavenet) work better than a ReLU?
                            
                                LSTM Autoencoder problems
                            
                                Predicting Football match winners based only on previous data of same match
                            
                                Denormalization of predicted data in neural networks
                            
                                What is loss exactly?
                            
                                Default activation function in Keras
                            
                                What is the difference between these two ways of adding Neural Network layers in Keras?
                            
                                Neural network bias for each neuron
                            
                                How to annotate the ground truth for image segmentation?
                            
                                Easy to use Perl modules for neural networks
                            
                                Why do weight parameters of logistic regression get initialized to zeros?
                            
                                Keras - how to get unnormalized logits instead of probabilities
                            
                                Why is the Mean Average Percentage Error(mape) extremely high?
                            
                                Extract text information from PDF files with different layouts - machine learning
                            
                                How do I make an OCR Program? [closed]
                            
                                Subtract mean from image
                            
                                Tensorflow return similar images
                            
                                What does "shuffle" do in fit_generator in keras?
                            
                                Neural Network with tanh wrong saturation with normalized data

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras - 1D Convolution How it works

Tags:

neural-network

keras

convolution

B_Miner

People also ask

2 Answers

berthié gouin

pyan

Recent Activity

Donate For Us