Why do we have normally more than one fully connected layers in the late steps of the CNNs?

Question

As I noticed, in many popular architectures of the convolutional neural networks (e.g. AlexNet), people use more than one fully connected layers with almost the same dimension to gather the responses to previously detected features in the early layers.

Why do not we use just one FC for that? Why this hierarchical arrangement of the fully connected layers is possibly more useful?

enter image description here

Why do not we use just one FC for that? Why this hierarchical arrangement of the fully connected layers is possibly more useful?

enter image description here

SpinyNormam · Accepted Answer

Because there are some functions, such as XOR, that can't be modeled by a single layer. In this type of architecture the convolutional layers are computing local features and the fully-connected output layer(s) are then combining these local features to derive the final outputs.. So, you can consider the fully-connected layers as a semi-independent mapping of features to outputs, and if this is a complex mapping then you may need the expressive power of multiple layers.

Why do we have normally more than one fully connected layers in the late steps of the CNNs?

Tags:

image-processing

machine-learning

deep-learning

computer-vision

conv-neural-network

Ali Sharifi B.

1 Answers

SpinyNormam

Recent Activity

Donate For Us

Why do we have normally more than one fully connected layers in the late steps of the CNNs?

Tags:

image-processing

machine-learning

deep-learning

computer-vision

conv-neural-network

Ali Sharifi B.

1 Answers

SpinyNormam

Related questions

Recent Activity

Donate For Us