Generative adversarial networks tanh? [closed]

Tags:

I was wondering, why in most of the models of GAN (in MNIST at least) I saw, the activation function (for the discriminator and the generator) was tanh ? Isn't ReLu more efficient ? (I always read that for predictive networks)

Thanks!

347

asked Jan 05 '17 16:01

Pusheen_the_dev

2 Answers

From the DCGAN paper [Radford et al. https://arxiv.org/pdf/1511.06434.pdf]...

"The ReLU activation (Nair & Hinton, 2010) is used in the generator with the exception of the output layer which uses the Tanh function. We observed that using a bounded activation allowed the model to learn more quickly to saturate and cover the color space of the training distribution. Within the discriminator we found the leaky rectified activation (Maas et al., 2013) (Xu et al., 2015) to work well, especially for higher resolution modeling. This is in contrast to the original GAN paper, which used the maxout activation (Goodfellow et al., 2013)."

It could be that the symmetry of tanh is an advantage here, since the network should be treating darker colours and lighter colours in a symmetric way.

132

answered Sep 28 '22 03:09

Ben Carr

Sometimes it depends on the range that you want the activations to fall into. Whenever you hear "gates" in ML literature, you'll probably see a sigmoid, which is between 0 and 1. In this case, maybe they want activations to fall between -1 and 1, so they use tanh. This page says to use tanh, but they don't give an explanation. DCGAN uses ReLUs or leaky ReLUs except for the output of the generator. Makes sense - what if half of your embedding becomes zeros? Might be better to have a smoothly varying embedding between -1 and 1.

I'd love to hear someone else's input, as I'm not sure.

answered Sep 28 '22 03:09

chris

Related questions
                            
                                What is the difference between protoc and protobuf (Protocol Buffer)
                            
                                How to debug Tensorflow segmentation fault in model.fit()?
                            
                                Freezing graph to pb in Tensorflow2
                            
                                TensorFlow is not using my M1 MacBook GPU during training
                            
                                How is Hard Sigmoid defined
                            
                                what does tf.app.flags do? why we need that? [duplicate]
                            
                                cannot activate virtualenv environment -- tensorflow
                            
                                Tensorflow allocating GPU memory when using tf.device('/cpu:0')
                            
                                Using tf.tile to replicate a tensor N times
                            
                                Tensorflow: Load data in multiple threads on cpu
                            
                                Tensorflow GetNext() failed because the iterator has not been initialized
                            
                                Tensorflow: How to use dataset from generator in Estimator
                            
                                How to force tensorflow to use all available GPUs?
                            
                                Can't install Tensorflow Mac
                            
                                Class weights for balancing data in TensorFlow Object Detection API
                            
                                tensor.numpy() not working in tensorflow.data.Dataset. Throws the error: AttributeError: 'Tensor' object has no attribute 'numpy'
                            
                                Keras : Shuffling dataset while using LSTM
                            
                                Tensorflow: How to Pool over Depth?
                            
                                How to create a Rotation Matrix in Tensorflow
                            
                                Install tensorflow on Windows with anaconda

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Generative adversarial networks tanh? [closed]

Tags:

neural-network

tensorflow

deep-learning

Pusheen_the_dev

People also ask

2 Answers

Ben Carr

chris

Recent Activity

Donate For Us