<p>Is semantic segmentation just a Pleonasm or is there a difference between "semantic segmentation" and "segmentation"? Is there a difference to "scene labeling" or "scene parsing"?</p> <p>What is the difference between pixel-level and pixelwise segmentation?</p> <p>(Side-question: When you have this kind of pixel-wise annotation, do you get object detection for free or is there still something to do?)</p> <p>Please give a source for your definitions.</p> <h3>Sources which use "semantic segmentation"</h3> <ul> <li>Jonathan Long, Evan Shelhamer, Trevor Darrell: Fully Convolutional Networks for Semantic Segmentation. CVPR, 2015 and PAMI, 2016 </li> <li>Hong, Seunghoon, Hyeonwoo Noh, and Bohyung Han: "Decoupled Deep Neural Network for Semi-supervised Semantic Segmentation." arXiv preprint arXiv:1506.04924, 2015.</li> <li>V. Lempitsky, A. Vedaldi, and A. Zisserman: A pylon model for semantic segmentation. In Advances in Neural Information Processing Systems, 2011.</li> </ul> <h3>Sources which use "scene labeling"</h3> <ul> <li>Clement Farabet, Camille Couprie, Laurent Najman, Yann LeCun: <a href="http://yann.lecun.com/exdb/publis/pdf/farabet-pami-13.pdf" rel="noreferrer">Learning Hierarchical Features for Scene Labeling</a>. In Pattern Analysis and Machine Intelligence, 2013.</li> </ul> <h3>Source which use "pixel-level"</h3> <ul> <li>Pinheiro, Pedro O., and Ronan Collobert: "From Image-level to Pixel-level Labeling with Convolutional Networks." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015. (see http://arxiv.org/abs/1411.6228)</li> </ul> <h3>Source which use "pixelwise"</h3> <ul> <li>Li, Hongsheng, Rui Zhao, and Xiaogang Wang: "Highly efficient forward and backward propagation of convolutional neural networks for pixelwise classification." arXiv preprint arXiv:1412.4526, 2014.</li> </ul> <h3>Google Ngrams</h3> <p>"Semantic segmentation" seems to be more used recently than "scene labeling"</p> <p><img src="https://i.stack.imgur.com/OI5p1.png" alt="enter image description here"></p>

<p><strong>"segmentation"</strong> is a partition of an image into several "coherent" parts, but <em>without</em> any attempt at understanding what these parts represent. One of the most famous works (but definitely not the first) is Shi and Malik "Normalized Cuts and Image Segmentation" PAMI 2000. These works attempt to define "coherence" in terms of low-level cues such as color, texture and smoothness of boundary. You can trace back these works to the Gestalt theory.</p> <p>On the other hand <strong>"semantic segmentation"</strong> attempts to partition the image into semantically meaningful parts, <em>and</em> to classify each part into one of the pre-determined classes. You can also achieve the same goal by classifying each pixel (rather than the entire image/segment). In that case you are doing pixel-wise classification, which leads to the same end result but in a slightly different path...</p> <p>So, I suppose you can say that "semantic segmentation", "scene labeling" and "pixelwise classification" are basically trying to achieve the same goal: semantically understanding the role of each pixel in the image. You can take many paths to reach that goal, and these paths lead to slight nuances in the terminology. </p>

What is "semantic segmentation" compared to "segmentation" and "scene labeling"?

Sources which use "semantic segmentation"

Jonathan Long, Evan Shelhamer, Trevor Darrell: Fully Convolutional Networks for Semantic Segmentation. CVPR, 2015 and PAMI, 2016
Hong, Seunghoon, Hyeonwoo Noh, and Bohyung Han: "Decoupled Deep Neural Network for Semi-supervised Semantic Segmentation." arXiv preprint arXiv:1506.04924, 2015.
V. Lempitsky, A. Vedaldi, and A. Zisserman: A pylon model for semantic segmentation. In Advances in Neural Information Processing Systems, 2011.

Sources which use "scene labeling"

Clement Farabet, Camille Couprie, Laurent Najman, Yann LeCun: Learning Hierarchical Features for Scene Labeling. In Pattern Analysis and Machine Intelligence, 2013.

Source which use "pixel-level"

Pinheiro, Pedro O., and Ronan Collobert: "From Image-level to Pixel-level Labeling with Convolutional Networks." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015. (see http://arxiv.org/abs/1411.6228)

Source which use "pixelwise"

Li, Hongsheng, Rui Zhao, and Xiaogang Wang: "Highly efficient forward and backward propagation of convolutional neural networks for pixelwise classification." arXiv preprint arXiv:1412.4526, 2014.

Google Ngrams

"Semantic segmentation" seems to be more used recently than "scene labeling"

enter image description here

833

asked Nov 26 '15 22:11

Martin Thoma

1 Answers

"segmentation" is a partition of an image into several "coherent" parts, but without any attempt at understanding what these parts represent. One of the most famous works (but definitely not the first) is Shi and Malik "Normalized Cuts and Image Segmentation" PAMI 2000. These works attempt to define "coherence" in terms of low-level cues such as color, texture and smoothness of boundary. You can trace back these works to the Gestalt theory.

On the other hand "semantic segmentation" attempts to partition the image into semantically meaningful parts, and to classify each part into one of the pre-determined classes. You can also achieve the same goal by classifying each pixel (rather than the entire image/segment). In that case you are doing pixel-wise classification, which leads to the same end result but in a slightly different path...

So, I suppose you can say that "semantic segmentation", "scene labeling" and "pixelwise classification" are basically trying to achieve the same goal: semantically understanding the role of each pixel in the image. You can take many paths to reach that goal, and these paths lead to slight nuances in the terminology.

answered Oct 01 '22 05:10

Shai

Related questions
                            
                                Image Processing, In Python? [closed]
                            
                                Remove White Background from an Image and Make It Transparent
                            
                                Is it possible to tell the quality level of a JPEG?
                            
                                How to fill OpenCV image with one solid color?
                            
                                How do I find Wally with Python?
                            
                                Viola-Jones' face detection claims 180k features
                            
                                What are keypoints in image processing?
                            
                                Python - Find dominant/most common color in an image
                            
                                How does photoshop blend two images together? [closed]
                            
                                inverting image in Python with OpenCV
                            
                                How would I tint an image programmatically on iOS?
                            
                                Merging two images
                            
                                c# Image resizing to different size while preserving aspect ratio
                            
                                GD vs ImageMagick vs Gmagick for jpg? [closed]
                            
                                OpenCV & Python - Image too big to display
                            
                                How does one convert a grayscale image to RGB in OpenCV (Python)?
                            
                                Near-Duplicate Image Detection [closed]
                            
                                How do you composite an image onto another image with PIL in Python?
                            
                                Image fingerprint to compare similarity of many images
                            
                                How can I measure the similarity between two images? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is "semantic segmentation" compared to "segmentation" and "scene labeling"?

Tags:

image-processing

computer-vision

image-segmentation

object-detection

semantic-segmentation