Keras: How is Accuracy Calculated for Multi-Label Classification?

Tags:

I'm doing the Toxic Comment Text Classification Kaggle challenge. There are 6 classes: ['threat', 'severe_toxic', 'obscene', 'insult', 'identity_hate', 'toxic']. A comment can be multiple of these classes so it's a multi-label classification problem.

I built a basic neural network with Keras as follows:

model = Sequential()
model.add(Embedding(10000, 128, input_length=250))
model.add(Flatten())
model.add(Dense(100, activation='relu'))
model.add(Dense(len(classes), activation='sigmoid'))
model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

I run this line:

model.fit(X_train, train_y, validation_split=0.5, epochs=3)

and get 99.11% accuracy after 3 epochs.

However, 99.11% accuracy is a good bit higher than the best Kaggle submission. This makes me think I'm either (possibly both) a) overfitting or b) misusing Keras's accuracy.

1) Seems a bit hard to overfit when I'm using 50% of my data as a validation split and only 3 epochs.

2) Is accuracy here just the percentage of the time the model gets each class correct?

So if I output [0, 0, 0, 0, 0, 1] and the correct output was [0, 0, 0, 0, 0, 0], my accuracy would be 5/6?

After a bit of thought, I sort of think the accuracy metric here is just looking at the class my model predicts with highest confidence and comparing vs. ground truth.

So if my model outputs [0, 0, 0.9, 0, 0, 0], it will compare the class at index 2 ('obscene') with the true value. Do you think this is what's happening?

Thanks for any help you can offer!

551

asked Jun 04 '18 17:06

anon_swe

1 Answers

For multi-label classification, I think it is correct to use sigmoid as the activation and binary_crossentropy as the loss.

If the output is sparse multi-label, meaning a few positive labels and a majority are negative labels, the Keras accuracy metric will be overflatted by the correctly predicted negative labels. If I remember correctly, Keras does not choose the label with the highest probability. Instead, for binary classification, the threshold is 50%. So the prediction would be [0, 0, 0, 0, 0, 1]. And if the actual labels were [0, 0, 0, 0, 0, 0], the accuracy would be 5/6. You can test this hypothesis by creating a model that always predicts negative label and look at the accuracy.

If that's indeed the case, you may try a different metric such as top_k_categorical_accuracy.

Another remote possibility I can think of is your training data. Are the labels y somehow "leaked" into x? Just a wild guess.

answered Oct 04 '22 12:10

neurite

Related questions
                            
                                Library or tool to download multiple files in parallel [closed]
                            
                                How do I apply a DCT to an image in Python?
                            
                                Django's Double Underscore
                            
                                SqlAlchemy won't accept datetime.datetime.now value in a DateTime column
                            
                                How to train a neural network to supervised data set using pybrain black-box optimization?
                            
                                Scipy: lognormal fitting
                            
                                Writing wav file in Python with wavfile.write from SciPy
                            
                                Python windows path slash [duplicate]
                            
                                What is the point of .ix indexing for pandas Series
                            
                                Utility of parameter 'out' in numpy functions
                            
                                Mock side effect only X number of times
                            
                                What to do if I want 3D spline/smooth interpolation of random unstructured data?
                            
                                ImportError: No module named extern
                            
                                What is the best way to remove accents with Apache Spark dataframes in PySpark?
                            
                                How to deal with "divide by zero" with pandas dataframes when manipulating columns? [duplicate]
                            
                                How can I print Hindi sentences(unicode) on image in Python?
                            
                                aiohttp+sqlalchemy: Can't reconnect until invalid transaction is rolled back
                            
                                Why is reading multiple files at the same time slower than reading sequentially?
                            
                                In sklearn.decomposition.PCA, why are components_ negative?
                            
                                Push notification from python to android

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras: How is Accuracy Calculated for Multi-Label Classification?

Tags:

python

machine-learning

keras

anon_swe

People also ask

1 Answers

neurite

Recent Activity

Donate For Us