Which layers should I freeze for fine tuning a resnet model on keras?

Tags:

I already know how to do it on vgg (fine tuning the last conv block) and inception (fine tuning the top two blocks). I'd like to know which layers is recommended to freeze in order to fine tuning a resnet model.

887

asked Nov 09 '17 16:11

lebebop

1 Answers

I think that there is no a state of the art strategy for this but I may share you my thoughts on this topic (names of layers are similar to these presented here:

In case of having a lot of data real-world photos: freeze all stages up to stage 4 (leave the only 5th trainable). If you overfit - make the 5th stage to have fewer layers. If underfit unfreeze a half of the fourth layer. Remember - the deeper into the network - the more ImageNet specific features you have.
In case of having a few real-world photos: cut 5th, leave half of 4th stage trainable and freeze the rest. If overfit - keep cutting stage 4th, if underfit - keep extending.
In case of having a lot of simple photos data (e.g. medical ones) - cut 4th and 5th - leave 3rd trainable and freeze the rest. If overfit - keep cutting - id underfit - try point 2.
In case of having a few simple (less than 10K) photos data - I would advise not to use ResNet50. From my experience it overfits severely. I usually implement my custom topologies similar to ResNet18. If you still want to try it - try instructions from 3rd point.

151

answered Sep 20 '22 07:09

Marcin Możejko

Related questions
                            
                                Flatten hierarchically indexed pandas.DataFrame from groupby and multiple aggregation
                            
                                python inserts pictures to powerpoint, how to set the width and height of the picture?
                            
                                Plotting heatmap for 3 columns in python with seaborn
                            
                                How to locate and read Data Matrix code with python
                            
                                python astype(str) gives SettingWithCopyWarning and requests I use loc
                            
                                sqlalchemy.exc.UnboundExecutionError: Table object 'responsibles' is not bound to an Engine or Connection
                            
                                Dynamically generating elements of list within list
                            
                                Python Pandas Match Vlookup columns based on header values
                            
                                NumPy sum one array based on values in another array for each matching element in 3rd array
                            
                                Get *all* current jobs from python-rq
                            
                                Pythonic way to use the second condition in list comprehensions
                            
                                PyCharm: Unresolved reference with Scapy
                            
                                'DataFrame' object has no attribute 'melt'
                            
                                OpenCV image subtraction vs Numpy subtraction
                            
                                Append data to HDF5 file with Pandas, Python
                            
                                No module named 'Queue'
                            
                                How to remove every word with non alphabetic characters
                            
                                Django Migrations: Same migrations being created with makemigrations
                            
                                Difference between foo.bar() and bar(foo)?
                            
                                AWS Glue - Truncate destination postgres table prior to insert

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Which layers should I freeze for fine tuning a resnet model on keras?

Tags:

python

machine-learning

neural-network

deep-learning

keras

lebebop

People also ask

1 Answers

Marcin Możejko

Recent Activity

Donate For Us