Retraining the last layer of Inception-ResNet-v2

Tags:

I am trying to retrain the last layer of inception-resnet-v2. Here's what I came up with:

Get names of variables in the final layer
Create a train_op to minimise only these variables wrt loss
Restore the whole graph except the final layer while initialising only the last layer randomly.

And I implemented that as follows:

with slim.arg_scope(arg_scope):
    logits = model(images_ph, is_training=True, reuse=None)
loss = tf.reduce_mean(tf.nn.sparse_softmax_cross_entropy_with_logits(logits, labels_ph))
accuracy = tf.contrib.metrics.accuracy(tf.argmax(logits, 1), labels_ph)

train_list = tf.get_collection(tf.GraphKeys.TRAINABLE_VARIABLES, 'InceptionResnetV2/Logits')
optimizer = tf.train.AdamOptimizer(learning_rate=FLAGS.learning_rate)

train_op = optimizer.minimize(loss, var_list=train_list)

# restore all variables whose names doesn't contain 'logits'
restore_list = tf.get_collection(tf.GraphKeys.TRAINABLE_VARIABLES, scope='^((?!Logits).)*$')

saver = tf.train.Saver(restore_list, write_version=tf.train.SaverDef.V2)

with tf.Session() as session:


    init_op = tf.group(tf.local_variables_initializer(), tf.global_variables_initializer())

    session.run(init_op)
    saver.restore(session, '../models/inception_resnet_v2_2016_08_30.ckpt')


# followed by code for running train_op

This doesn't seem to work (training loss, error don't improve much from initial values). Is there a better/elegant way to do this? It would be good learning for me if you can also tell me what's going wrong here.

633

asked Dec 31 '16 09:12

Priyatham

1 Answers

There are several things:

how is the learning rate? a too high value can mess with everything (probably not the reason)
try to use stochastic gradient descent, you should have less problems

is the scope correctly set? if you don't use L2 regularization and batch normalization of the gradients you might fall into a local minimum very soon and the network is unable to learn

from nets import inception_resnet_v2 as net
with net.inception_resnet_v2_arg_scope():
    logits, end_points = net.inception_resnet_v2(images_ph, num_classes=num_classes,
                                                 is_training=True)

you should add the regularization variables to the loss (or at least the ones of the last layer):

regularization_losses = tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)
all_losses = [loss] + regularization_losses
total_loss = tf.add_n(all_losses, name='total_loss')

training only the full connected layer might not be a good idea, I would train all the network as the features you need for your class aren't necessarily defined in the last layer but few layers before and you need to change them.

double check the train_op runs after the loss:

with ops.name_scope('train_op'):
    train_op = control_flow_ops.with_dependencies([train_op], total_loss)

126

answered Sep 23 '22 02:09

jorgemf

Related questions
                            
                                IPython Auto Scroll?
                            
                                Alternative to Mayavi for scientific 3d plotting
                            
                                Django - execute code at start-up
                            
                                Why doesn't my pandas rolling().apply() work when the series contains collections?
                            
                                understanding scikit learn Random Forest memory requirement for prediction
                            
                                Matrix legend in matplotlib (Python)
                            
                                Generate Fortran subroutine with SymPy codegen for a system of equations
                            
                                Jupyter Notebook: Multiple notebook to one kernel?
                            
                                PySpark: TypeError: 'Column' object is not callable
                            
                                How to prevent Jupyter Notebook download PDF from printing outside of margin?
                            
                                How do I properly combine numerical features with text (bag of words) in scikit-learn?
                            
                                How do I use `setrlimit` to limit memory usage? RLIMIT_AS kills too soon; RLIMIT_DATA, RLIMIT_RSS, RLIMIT_STACK kill not at all
                            
                                How can I use tkinter to prompt users to save a DataFrame to an Excel file?
                            
                                playing a chord in python
                            
                                Uploading file on sharepoint with python causes uploaded file to retain headers in the content
                            
                                Python Multiprocessing Pool Doesn't Create Enough Processes
                            
                                Jupyter Notebook rpy2 Rmagics: How to set the default plot size?
                            
                                Python Eve - Query Embedded Data Relation
                            
                                How to install OpenAI Universe without getting error code 1 on Windows?
                            
                                How to save and restore partitioned variable in Tensorflow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Retraining the last layer of Inception-ResNet-v2

Tags:

python

tensorflow

deep-learning

Priyatham

People also ask

1 Answers

jorgemf

Recent Activity

Donate For Us