Confusion between Binary_crossentropy and Categorical_crossentropy

Tags:

I am doing binary class classification using deep neural network. Whenever I am using binary_crossentropy my model is not giving good accuracy (it is closer to the random prediction). But if I use categorical crossentropy by making the size of the output layer 2, I am getting good accuracy in only 1 epoch which is close to the 0.90. Can anyone please explain what is happening here?

832

asked May 25 '16 04:05

Avijit Dasgupta

1 Answers

I also have this problem while trying to use binary_crossentropy with softmax activation in the output layer. As far as I know, softmax give the probability of each class, so if your output layer has 2 nodes, it will be something like p(x1), p(x2) and x1 + x2 = X. Therefore, if you have only 1 output node, it will always be equals to 1.0 (100%), that's why you have close to random prediction (honestly, it will be close to your category distribution in the evaluation set).

Try changing it to another activation method like sigmoid or relu.

answered Sep 20 '22 11:09

Nova

Related questions
                            
                                How do you add new categories and training to a pretrained Inception v3 model in TensorFlow?
                            
                                Is the L1 regularization in Keras/Tensorflow *really* L1-regularization?
                            
                                shape Detection - TensorFlow
                            
                                Edit tensorflow inceptionV3 retraining-example.py for multiple classificiations
                            
                                LSTMStateTuple vs cell.zero_state() for RNN in Tensorflow
                            
                                What is the difference between tf.estimator.Estimator and tf.contrib.learn.Estimator in TensorFlow
                            
                                Soft margin in linear support vector machine using python
                            
                                metaphone versus soundex versus NYSIIS
                            
                                Defining a custom PyMC distribution
                            
                                How to give a constant input to keras
                            
                                Support Vector Machine library for C# [closed]
                            
                                Can a Neural Network Find the i-th Permutation of a fixed size list?
                            
                                Difference between .pb and .pbtxt in tensorflow?
                            
                                Randomness in Artificial Intelligence & Machine Learning
                            
                                Perform Chi-2 feature selection on TF and TF*IDF vectors
                            
                                What are good features for classifying photos of clothing? [closed]
                            
                                Does Tessaract OCR uses neural networks as their default training mechanism
                            
                                Keras LSTM Time Series
                            
                                Modifying the Caffe C++ prediction code for multiple inputs
                            
                                python - TypeError: unorderable types: str() > float()

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Confusion between Binary_crossentropy and Categorical_crossentropy

Tags:

machine-learning

deep-learning

computer-vision

keras

Avijit Dasgupta

People also ask

1 Answers

Nova

Recent Activity

Donate For Us