Convolving Across Channels in Keras CNN: Conv1D, Depthwise Separable Conv, CCCP?

Tags:

I am developing a CNN in keras to classify satellite imagery that has 10 spectral bands. I'm getting decent accuracy with the network below (~60% val accuracy across 15 classes) but I want to better incorporate the relationships between spectral bands at a single pixel which can yield a lot of information on the pixel's class. I see a lot of papers doing this but it is often called different things. For example:

Cascaded cross-channel parametric pooling
Conv1D
Depthwise Separable Convolution
Conv2D(num_filters, (1, 1))

And I'm not certain about the differences between these approaches (if there are any) and how I should implement this in my simple CNN below. I'm also not clear if I should do this at the very beginning or towards the end. I'm inclined to do it right at the start when the channels are still the raw spectral data rather than the feature maps.

input_shape = (32,32,10)
num_classes = 15

model = Sequential()
model.add(Conv2D(32, (3, 3), padding='same', input_shape=input_shape))
model.add(Activation('relu'))

model.add(Conv2D(32, (3, 3)))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))
model.add(Dropout(0.25))

model.add(Conv2D(64, (3, 3), padding='same'))
model.add(Activation('relu'))
model.add(Conv2D(64, (3, 3)))
model.add(Activation('relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))
model.add(Dropout(0.25))

model.add(Flatten())
model.add(Dense(256))
model.add(Activation('relu'))
model.add(Dropout(0.5))
model.add(Dense(num_classes))
model.add(Activation('softmax'))

771

asked Apr 30 '19 18:04

clifgray

1 Answers

Let me explain the operations you mentioned in a bit of detail so you understand the differences between their intuition and usage:

Cascaded cross-channel parametric pooling:

This is introduced in the Network-in-Network paper and is implemented in Keras as GlobalAveragePooling2D(). This operation averages over the output of each feature map in the previous layers.

It is a structural regularizer that enforces correspondence between feature maps and categories, so feature maps can be interpreted as category confidence. It reduces parameter count and sums up spatial information and hence, it is more robust to spatial translations of the input.

GlobalAveragePooling2D() is generally used without Dense() layers in the model before it.

Conv1D:

Conv1D() is a convolution operation exactly similar to Conv2D() but it applies only to one dimension. Conv1D() is generally used on sequences or other 1D data, not as much on images.

Depthwise Separable Convolution:

Quoting from the Keras documentation

Separable convolutions consist in first performing a depthwise spatial convolution (which acts on each input channel separately) followed by a pointwise convolution which mixes together the resulting output channels. The depth_multiplier argument controls how many output channels are generated per input channel in the depthwise step.

This blog explains the depthwise separable convolution pretty well.

Conv2D(num_filters, (1, 1)):

This is generally known as 1x1 convolution, introduced in the Network-in-Network paper.

The 1x1 convolutional filters are used to reduce/increase dimensionality in the filter dimension, without affecting the spatial dimensions. This is also used in the Google Inception architecture for dimensionality reduction in filter space.

In your particular case, I am not exactly sure which of this techniques you can use. I do not think Conv1D would be of much use. You can definitely use GlobalMaxPooling or GlobalAveragePooling as long as you do not use Dense before them. This is helpful to get spatial information. Depthwise Separable Convolution can be used as well in place of your Conv2D layers. Conv2D(num_filters, (1, 1)) is very helpful for dimensionality reduction in filter space, mostly towards the end of your model architecture.

Maybe, if you follow the resources you get a better understanding of the operations and see how they apply to your problem.

199

answered Sep 30 '22 09:09

Anakin

Related questions
                            
                                Can I use a machine learning model as the objective function in an optimization problem?
                            
                                Equivalent Python code for mutate_if from tidyverse
                            
                                py2neo - The client is unauthorized due to authentication failure
                            
                                Why do two sub-processes stop each other from working?
                            
                                Merging duplicate columns while reading CSV file
                            
                                Setting up Python Conda Environment in Heroku
                            
                                How to display multiple annotations in Seaborn Heatmap cells?
                            
                                Gcloud update broke my app -- GCP Python 2.7
                            
                                Django pass Haystack highlighter result to a view
                            
                                Difference between 3D-tensor and 4D-tensor for images input of DL Keras framework
                            
                                Is it possible to generate gremlin queries from bytecode in python
                            
                                Is there anyway I can set the working directory in airflow where my codes will run?
                            
                                Docker container fails to run, Error : python3: can't open file 'flask run --host=0.0.0.0': [Errno 2] No such file or directory
                            
                                Why does Pillow convert return colours outside the specified palette?
                            
                                How to smooth and plot x vs weighted average of y, weighted by x?
                            
                                How to create nested namespace packages for setuptools distribution
                            
                                AttributeError: type object 'spacy.syntax.nn_parser.array' has no attribute '__reduce_cython__' , (adding Paths to virtual environments)
                            
                                Understanding slots and getting its values in Alexa Skills Kit
                            
                                Google CoLab - How to run a jupyter notebook file that is in the 'Files' tab (i.e. /content/) of my CoLab environment
                            
                                How to downgrade the boto3 version in an AWS Lambda Function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Convolving Across Channels in Keras CNN: Conv1D, Depthwise Separable Conv, CCCP?

Tags:

python

tensorflow

keras

conv-neural-network