In a CNN for binary classification of images, should the shape of output be (number of images, 1) or (number of images, 2)? Specifically, here are 2 kinds of last layer in a CNN: <pre class="prettyprint"><code>keras.layers.Dense(2, activation = 'softmax')(previousLayer) </code></pre> or <pre class="prettyprint"><code>keras.layers.Dense(1, activation = 'softmax')(previousLayer) </code></pre> In the first case, for every image there are 2 output values (probability of belonging to group 1 and probability of belonging to group 2). In the second case, each image has only 1 output value, which is its label (0 or 1, label=1 means it belongs to group 1). Which one is correct? Is there intrinsic difference? I don't want to recognize any object in those images, just divide them into 2 groups. Thanks a lot!

This first one is the correct solution: <pre class="prettyprint"><code>keras.layers.Dense(2, activation = 'softmax')(previousLayer) </code></pre> Usually, we use the <code>softmax</code> activation function to do classification tasks, and the output width will be the number of the categories. This means that if you want to classify one object into three categories with the labels <code>A</code>,<code>B</code>, or <code>C</code>, you would need to make the <code>Dense</code> layer generate an output with a shape of <code>(None, 3)</code>. Then you can use the <code>cross_entropy</code>loss function to calculate the <code>LOSS</code>, automatically calculate the gradient, and do the back-propagation process. If you want to only generate one value with the <code>Dense</code> layer, that means you get a tensor with a shape of <code>(None, 1)</code> - so it produces a single numeric value, like a <code>regression</code> task. You are using the value of the output to represent the category. The answer is correct, but does not perform like the general solution of the <code>classification</code> task.

Difference between Dense(2) and Dense(1) as the final layer of a binary classification CNN?

Tags:

tensorflow

classification

deep-learning

keras

convolutional-neural-network

In a CNN for binary classification of images, should the shape of output be (number of images, 1) or (number of images, 2)? Specifically, here are 2 kinds of last layer in a CNN:

keras.layers.Dense(2, activation = 'softmax')(previousLayer)

keras.layers.Dense(1, activation = 'softmax')(previousLayer)

In the first case, for every image there are 2 output values (probability of belonging to group 1 and probability of belonging to group 2). In the second case, each image has only 1 output value, which is its label (0 or 1, label=1 means it belongs to group 1).

Which one is correct? Is there intrinsic difference? I don't want to recognize any object in those images, just divide them into 2 groups.

Thanks a lot!

637

asked Jun 12 '18 02:06

BuboBubo

2 Answers

This first one is the correct solution:

keras.layers.Dense(2, activation = 'softmax')(previousLayer)

Usually, we use the softmax activation function to do classification tasks, and the output width will be the number of the categories. This means that if you want to classify one object into three categories with the labels A,B, or C, you would need to make the Dense layer generate an output with a shape of (None, 3). Then you can use the cross_entropyloss function to calculate the LOSS, automatically calculate the gradient, and do the back-propagation process.

If you want to only generate one value with the Dense layer, that means you get a tensor with a shape of (None, 1) - so it produces a single numeric value, like a regression task. You are using the value of the output to represent the category. The answer is correct, but does not perform like the general solution of the classification task.

121

answered Sep 20 '22 09:09

Ember Xu

The difference is if the class probabilities are independent of each other (multi-label classification) or not.

When there are 2 classes and you generally have P(c=1) + P(c=0) = 1 then

keras.layers.Dense(2, activation = 'softmax') 

keras.layers.Dense(1, activation = 'sigmoid')

both are correct in terms of class probabilities. The only difference being how you supply the labels during training. But

keras.layers.Dense(2, activation = 'sigmoid')

is incorrect in that context. However, it is correct implementation if you have P(c=1) + P(c=0) != 1. This is the case for multi-label classification where an instance may belong to more than one correct class.

answered Sep 17 '22 09:09

rajesh

Related questions
                            
                                Interpreting Tensorboard Distributions - Weights not Changing, only Biases
                            
                                Export Tensorflow graphs from Python for use in C++
                            
                                RNN in Tensorflow vs Keras, depreciation of tf.nn.dynamic_rnn()
                            
                                Is TensorFlow suitable for Recommendation Systems [closed]
                            
                                TensorFlow: Is there a way to convert a frozen graph into a checkpoint model?
                            
                                What is the difference between TF Learn (aka Scikit Flow) and TFLearn (aka TFLearn.org)
                            
                                Transfer learning with tf.estimator.Estimator framework
                            
                                What is the relation between validation_data and validation_split in Keras' fit function?
                            
                                "zsh: illegal hardware instruction python" when installing Tensorflow on macbook pro M1 [duplicate]
                            
                                How to implement pixel-wise classification for scene labeling in TensorFlow?
                            
                                Keras Binary Classification - Sigmoid activation function
                            
                                Tensorflow `set_random_seed` not working [duplicate]
                            
                                How can visualize tensorflow convolution filters?
                            
                                How do you convert a .onnx to tflite?
                            
                                Parallelization strategies for deep learning
                            
                                How to index a list with a TensorFlow tensor?
                            
                                CNN Image Recognition with Regression Output on Tensorflow
                            
                                Use attribute and target matrices for TensorFlow Linear Regression Python
                            
                                How to deploy and serve prediction using TensorFlow from API?
                            
                                Limit Tensorflow CPU and Memory usage

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With