Gradient calculation in Hamming loss for multi-label classification

Tags:

I am doing a multilabel classification using some recurrent neural network structure. My question is about the loss function: my output will be vectors of true/false (1/0) values to indicate each label's class. Many resources said the Hamming loss is the appropriate objective. However, the Hamming loss has a problem in the gradient calculation: H = average (y_true XOR y_pred),the XOR cannot derive the gradient of the loss. So is there other loss functions for training multilabel classification? I've tried MSE and binary cross-entropy with individual sigmoid input.

365

asked Feb 08 '17 23:02

William Chou

1 Answers

H = average(y_true*(1-y_pred)+(1-y_true)*y_pred)

is a continuous approximation of the hamming loss.

137

answered Oct 10 '22 18:10

Juan Wang

Related questions
                            
                                Object categories of pretrained imagenet model in caffe
                            
                                Spark MlLib linear regression (Linear least squares) giving random results
                            
                                Missing value error in the randomForest package of R
                            
                                Normalizing a list of restaurant dishes
                            
                                Neural Network Backpropagation implementation issues
                            
                                Tensorflow: List of Tensors for Cost
                            
                                How can I handle huge matrices?
                            
                                How to determine maximum batch size for a seq2seq tensorflow RNN training model
                            
                                Python keras how to change the size of input after convolution layer into lstm layer
                            
                                Function to determine a reasonable initial guess for scipy.optimize?
                            
                                Selecting the components showing the most variance in PCA
                            
                                How to use sklearn Pipeline with custom Features?
                            
                                Caffe sigmoid cross entropy loss
                            
                                How to obtain a confidence interval or a measure of prediction dispersion when using xgboost for classification?
                            
                                SVM - Difference between Energy vs Loss vs Regularization vs Cost function
                            
                                Keras RNN loss does not decrease over epoch
                            
                                Difference between LinearRegression() and Ridge(alpha=0)
                            
                                Image resizing method during preprocessing for neural network
                            
                                GridSearch with Keras Neural Networks
                            
                                Why is binary_crossentropy more accurate than categorical_crossentropy for multiclass classification in Keras?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Gradient calculation in Hamming loss for multi-label classification

Tags:

machine-learning

neural-network

gradient-descent

hamming-distance

multilabel-classification

William Chou

People also ask

1 Answers

Juan Wang

Recent Activity

Donate For Us