Intuition behind U-net vs FCN for semantic segmentation

Question

I don't quite understand the following:

In the proposed FCN for Semantic Segmentation by Shelhamer et al, they propose a pixel-to-pixel prediction to construct masks/exact locations of objects in an image.

In the slightly modified version of the FCN for biomedical image segmentation, the U-net, the main difference seems to be "a concatenation with the correspondingly cropped feature map from the contracting path."

Now, why does this feature make a difference particularly for biomedical segmentation? The main differences I can point out for biomedical images vs other data sets is that in biomedical images there are not as rich set of features defining an object as for common every day objects. Also the size of the data set is limited. But is this extra feature inspired by these two facts or some other reason?

shasvat desai · Accepted Answer

FCN vs U-Net:

FCN

It upsamples only once. i.e. it has only one layer in the decoder
The original implementation github repo uses bilinear interpolation for upsampling the convoloved image. That is there is no learnable filter here
variants of FCN-[FCN 16s and FCN 8s] add the skip connections from lower layers to make the output robust to scale changes

U-Net

multiple upsampling layers
uses skip connections and concatenates instead of adding up
uses learnable weight filters instead of fixed interpolation technique

aivision2020 · Answer

U-Net is built upon J. Long's FCN paper. A couple of differences is that the original FCN paper used the decoder half to upsample the classification (i.e the entire second half of the net is of depth C - number of classes)

U-Net's think of the second half as being in feature space and do the final classification at the end.

Nothing about it is special to bio-medical IMO

Intuition behind U-net vs FCN for semantic segmentation

Tags:

artificial-intelligence

neural-network

image-segmentation

semantic-segmentation

convolutional-neural-network

Jonathan

2 Answers

shasvat desai

aivision2020

Recent Activity

Donate For Us

Intuition behind U-net vs FCN for semantic segmentation

Tags:

artificial-intelligence

neural-network

image-segmentation

semantic-segmentation

convolutional-neural-network

Jonathan

2 Answers

shasvat desai

aivision2020

Related questions

Recent Activity

Donate For Us