Tensorflow seems to have a large collection of optimizers, is there any high level guideline (or review paper) on which one is best adapted to specific classes of loss functions ?

It depends on your datasets and NN models, but generally, I would start with Adam. Figure 2 in this paper (http://arxiv.org/abs/1412.6980) shows Adam works well. <img src="https://i.stack.imgur.com/cWsLk.png" alt="enter image description here"> Also, you can see a very nice animation from http://www.denizyuret.com/2015/03/alec-radfords-animations-for.html. <img src="https://i.stack.imgur.com/88sSR.gif" alt="enter image description here">

How do I choose an optimizer for my tensorflow model?

1 Answers

It depends on your datasets and NN models, but generally, I would start with Adam. Figure 2 in this paper (http://arxiv.org/abs/1412.6980) shows Adam works well.

enter image description here

Also, you can see a very nice animation from http://www.denizyuret.com/2015/03/alec-radfords-animations-for.html.

enter image description here

answered Oct 24 '22 06:10

Sung Kim

Related questions
                            
                                How can I find Imagenet data labels?
                            
                                How can I print the intermediate variables in the loss function in TensorFlow and Keras?
                            
                                How much faster is NCHW compared to NHWC in TensorFlow/cuDNN?
                            
                                How to get reproducible result when running Keras with Tensorflow backend
                            
                                How do I feed Tensorflow placeholders with numpy arrays?
                            
                                Keras (Tensorflow backend) slower on GPU than on CPU when training certain networks
                            
                                Install pip for python 3.5
                            
                                tensorflow neural net with continuous / floating point output?
                            
                                TensorFlow Master and Worker Service
                            
                                Using pre-trained inception_resnet_v2 with Tensorflow
                            
                                Loaded runtime CuDNN library: 5005 (compatibility version 5000) but source was compiled with 5103 (compatibility version 5100)
                            
                                How do I resolve these tensorflow warnings?
                            
                                module 'tensorflow' has no attribute 'GPUOptions'
                            
                                AttributeError: 'google.protobuf.pyext._message.RepeatedCompositeCo' object has no attribute 'append'
                            
                                tensorflow (not tensorflow-gpu): failed call to cuInit: UNKNOWN ERROR (303)
                            
                                Is tensorflow lazy?
                            
                                Is it possible to visualize keras embeddings in tensorboard?
                            
                                RuntimeError: module compiled against API version 0xc but this version of numpy is 0xb
                            
                                Is there a way to force Bazel to run tests serially
                            
                                TensorFlow concat a variable-sized placeholder with a vector

How do I choose an optimizer for my tensorflow model?

Tags:

tensorflow

Luca Fiaschi

People also ask

1 Answers

Sung Kim

Recent Activity

Donate For Us