What is the difference between conv1d with kernel_size=1 and dense layer?

Tags:

I am building a CNN with Conv1D layers, and it trains pretty well. I'm now looking into how to reduce the number of features before feeding it into a Dense layer at the end of the model, so I've been reducing the size of the Dense layer, but then I came across this article. The article talks about the effect of using a Conv2D filters with a kernel_size=(1,1) to reduce the number of features.

I was wondering what the difference is between using a Conv2D layer with kernel_size=(1,1) tf.keras.layers.Conv2D(filters=n,kernel_size=(1,1)) and using a Dense layer of the same size tf.keras.layers.Dense(units=n)? From my perspective (I'm relatively new to neural nets), a filter with kernel_size=(1,1) is a single number, which is essentially equivalent to weight in a Dense layer, and both layers have biases, so are they equivalent, or am I misunderstanding something? And if my understanding is correct, in my case where I am using Conv1D layers, not Conv2D layers, does that change anything? As in is tf.keras.layers.Conv1D(filters=n, kernel_size=1) equivalent to tf.keras.layers.Dense(units=n)?

Please let me know if you need anything from me to clarify the question. I'm mostly curious about if Conv1D layers with kernel_size=1 and Conv2D layers with kernel_size=(1,1) behave differently than Dense layers.

715

asked Aug 16 '19 15:08

Michaela

2 Answers

Yes, since Dense layer is applied on the last dimension of its input (see this answer), Dense(units=N) and Conv1D(filters=N, kernel_size=1) (or Dense(units=N) and Conv2D(filters=N, kernel_size=1)) are basically equivalent to each other both in terms of connections and number of trainable parameters.

answered Sep 27 '22 21:09

today

In 1D CNN, the kernel moves in 1 direction. The input and output data of 1D CNN is 2 dimensional. Mostly used on Time-Series Data, Natural Language Processing tasks etc. Definitely gonna see people using it in Kaggle NLP competitions and notebooks.

In 2D CNN, the kernel moves in 2 directions. The input and output data of 2D CNN is 3 dimensional. Mostly used on Image data. Definitely gonna see people using it in Kaggle CNN Image Processing competitions and notebooks

In 3D CNN, the kernel moves in 3 directions. The input and output data of 3D CNN is 4 dimensional. Mostly used on 3D Image data (MRI, CT Scans). Haven't personally seen applied version in competitions

answered Sep 27 '22 22:09

Elvin Aghammadzada

Related questions
                            
                                Calculate face_descriptor faster
                            
                                Error importing tensorflow in anaconda on Mac OSX
                            
                                How to get all layers' activations for a specific input for Tensorflow Hub modules?
                            
                                google colab setting a '^C' in the proccess
                            
                                deploying the Tensorflow model in Python
                            
                                ML Engine Runtime version and Python version not supported
                            
                                Custom Hebbian Layer Implementation in Keras - input/output dims and lateral node connections
                            
                                Why does Tensorflow warn about AVX2 while I am using MKL?
                            
                                What does axis=[1,2,3] mean in K.sum in keras backend?
                            
                                How to convert tensorflow.js model and weights to standard tensorflow?
                            
                                Use TensorFlow loss Global Objectives (recall_at_precision_loss) with Keras (not metrics)
                            
                                Inplementation of LSTM in Keras
                            
                                ImportError: cannot import name 'keras'
                            
                                How to run TF object detection API model_main.py in evaluation mode only
                            
                                Tensorflow2.0 training: model.compile vs GradientTape
                            
                                Suppress OpenMP debug messages when running Tensorflow on CPU
                            
                                No module named 'tensorflow.python.platform'
                            
                                Implement Causal CNN in Keras for multivariate time-series prediction
                            
                                'no SavedModel bundles found!' on tensorflow_hub model deployment to AWS SageMaker
                            
                                Keras: display model shape in Jupyter Notebook

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the difference between conv1d with kernel_size=1 and dense layer?

Tags:

neural-network

tensorflow

keras

conv-neural-network

tf.keras

Michaela

People also ask

2 Answers

today

Elvin Aghammadzada

Recent Activity

Donate For Us