I am reading people's implementation of DCGAN, especially this one in tensorflow. In that implementation, the author draws the losses of the discriminator and of the generator, which is shown below (images come from https://github.com/carpedm20/DCGAN-tensorflow): <img src="https://i.stack.imgur.com/PZm29.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/WRpsu.png" alt="enter image description here"> Both the losses of the discriminator and of the generator don't seem to follow any pattern. Unlike general neural networks, whose loss decreases along with the increase of training iteration. How to interpret the loss when training GANs?

Unfortunately, like you've said for GANs the losses are very non-intuitive. Mostly it happens down to the fact that generator and discriminator are competing against each other, hence improvement on the one means the higher loss on the other, until this other learns better on the received loss, which screws up its competitor, etc. Now one thing that should happen often enough (depending on your data and initialisation) is that both discriminator and generator losses are converging to some permanent numbers, like this: <img src="https://i.stack.imgur.com/2WU5Y.png" alt=""> (it's ok for loss to bounce around a bit - it's just the evidence of the model trying to improve itself) This loss convergence would normally signify that the GAN model found some optimum, where it can't improve more, which also should mean that it has learned well enough. (Also note, that the numbers themselves usually aren't very informative.) Here are a few side notes, that I hope would be of help: <ul> <li>if loss haven't converged very well, it doesn't necessarily mean that the model hasn't learned anything - check the generated examples, sometimes they come out good enough. Alternatively, can try changing learning rate and other parameters.</li> <li>if the model converged well, still check the generated examples - sometimes the generator finds one/few examples that discriminator can't distinguish from the genuine data. The trouble is it always gives out these few, not creating anything new, this is called mode collapse. Usually introducing some diversity to your data helps.</li> <li>as vanilla GANs are rather unstable, I'd suggest to use <a href="https://github.com/carpedm20/DCGAN-tensorflow" rel="noreferrer">some version of the DCGAN models</a>, as they contain some features like convolutional layers and batch normalisation, that are supposed to help with the stability of the convergence. (the picture above is a result of the DCGAN rather than vanilla GAN)</li> <li>This is some common sense but still: like with most neural net structures tweaking the model, i.e. changing its parameters or/and architecture to fit your certain needs/data can improve the model or screw it. </li> </ul>

How to interpret the discriminator's loss and the generator's loss in Generative Adversarial Nets?

Tags:

I am reading people's implementation of DCGAN, especially this one in tensorflow.

In that implementation, the author draws the losses of the discriminator and of the generator, which is shown below (images come from https://github.com/carpedm20/DCGAN-tensorflow):

enter image description here

Both the losses of the discriminator and of the generator don't seem to follow any pattern. Unlike general neural networks, whose loss decreases along with the increase of training iteration. How to interpret the loss when training GANs?

829

asked Mar 09 '17 08:03

shapeare

Video Answer

1 Answers

Unfortunately, like you've said for GANs the losses are very non-intuitive. Mostly it happens down to the fact that generator and discriminator are competing against each other, hence improvement on the one means the higher loss on the other, until this other learns better on the received loss, which screws up its competitor, etc.

Now one thing that should happen often enough (depending on your data and initialisation) is that both discriminator and generator losses are converging to some permanent numbers, like this: (it's ok for loss to bounce around a bit - it's just the evidence of the model trying to improve itself)

This loss convergence would normally signify that the GAN model found some optimum, where it can't improve more, which also should mean that it has learned well enough. (Also note, that the numbers themselves usually aren't very informative.)

Here are a few side notes, that I hope would be of help:

if loss haven't converged very well, it doesn't necessarily mean that the model hasn't learned anything - check the generated examples, sometimes they come out good enough. Alternatively, can try changing learning rate and other parameters.
if the model converged well, still check the generated examples - sometimes the generator finds one/few examples that discriminator can't distinguish from the genuine data. The trouble is it always gives out these few, not creating anything new, this is called mode collapse. Usually introducing some diversity to your data helps.
as vanilla GANs are rather unstable, I'd suggest to use some version of the DCGAN models, as they contain some features like convolutional layers and batch normalisation, that are supposed to help with the stability of the convergence. (the picture above is a result of the DCGAN rather than vanilla GAN)
This is some common sense but still: like with most neural net structures tweaking the model, i.e. changing its parameters or/and architecture to fit your certain needs/data can improve the model or screw it.

108

answered Oct 04 '22 20:10

Massyanya

Related questions
                            
                                React and Redux: redirect after action
                            
                                How to Run Multiple Test Files with Haskell Stack Project
                            
                                Why doesn't returning by ref work for elements of collections?
                            
                                Lombok builder to check non null and not empty
                            
                                Why does spread syntax convert my string into an array?
                            
                                DynamoDB FilterExpression with multiple condition javascript
                            
                                Haskell singletons: What do we gain with SNat
                            
                                Is there any way to access the current locale with React-Intl?
                            
                                How do I get the name of the pipeline from inside the jenkinsfile
                            
                                How can I support an HTTP Proxy using Spring 5 WebClient?
                            
                                IIS Express not stopping when debug session ends
                            
                                Prometheus pre build binary for Mac OS X

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With