Why does one not use IOU for training?

Tags:

When people try to solve the task of semantic segmentation with CNN's they usually use a softmax-crossentropy loss during training (see Fully conv. - Long). But when it comes to comparing the performance of different approaches measures like intersection-over-union are reported.

My question is why don't people train directly on the measure they want to optimize? Seems odd to me to train on some measure during training, but evaluate on another measure for benchmarks.

I can see that the IOU has problems for training samples, where the class is not present (union=0 and intersection=0 => division zero by zero). But when I can ensure that every sample of my ground truth contains all classes, is there another reason for not using this measure?

686

asked Nov 07 '16 21:11

zimmermc

1 Answers

Checkout this paper where they come up with a way to make the concept of IoU differentiable. I implemented their solution with amazing results!

176

answered Sep 22 '22 13:09

mathetes

Related questions
                            
                                First Order Logic Engine
                            
                                Medical information extraction using Python
                            
                                Neural Network for File Decryption - Possible?
                            
                                General techniques to work with huge amounts of data on a non-super computer
                            
                                is it possible to use apache mahout without hadoop dependency?
                            
                                Right order of doing feature selection, PCA and normalization?
                            
                                Support Vector Machine or Artificial Neural Network for text processing? [closed]
                            
                                Ordinal classification packages and algorithms
                            
                                Package ‘neuralnet’ in R, rectified linear unit (ReLU) activation function?
                            
                                Is it possible to use TensorFlow C++ API on Windows?
                            
                                tensorflow:Can save best model only with val_acc available, skipping
                            
                                What does global pooling do?
                            
                                Interpreting a Self Organizing Map
                            
                                Items of feature_columns must be a _FeatureColumn Given: _VocabularyListCategoricalColumn
                            
                                List the words in a vocabulary according to occurrence in a text corpus, with Scikit-Learn CountVectorizer
                            
                                sklearn LinearRegression, why only one coefficient returned by the model?
                            
                                What is the difference between normalisation and regularisation in machine learning
                            
                                In machine learning, what is definition of “downstream”?
                            
                                Neural Network Ordinal Classification for Age
                            
                                Stop Training in Keras when Accuracy is already 1.0

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does one not use IOU for training?

Tags:

machine-learning

deep-learning

computer-vision

image-segmentation

zimmermc

People also ask

1 Answers

mathetes

Recent Activity

Donate For Us