How to do transfer learning for MNIST dataset?

Tags:

I have been trying to use transfer learning for MNIST dataset using VGG/Inception. But both of these networks accept images of atleast 224x224x3 size. How can i rescale the 28x28x1 MNIST images to 224x224x3 to do transfer learing?

241

asked Dec 17 '17 06:12

user1159517

2 Answers

A common way to do what you're asking is to simply resize the images to the desired resolution required for the input layer into the CNN. Because you've tagged your question with keras, keras has a preprocessing module that allows you to load in images and optionally specify the desired size you want to scale the image by. If you look at the actual source of the method: https://github.com/keras-team/keras/blob/master/keras/preprocessing/image.py#L321, it internally uses Pillow interpolation methods to rescale the image to the desired resolution.

In addition, because the MNIST digits are originally grayscale, you will need to replicate the single channel image into a multi-channel image so that it artificially becomes RGB. This means that the red, green and blue channels are all the same and is the MNIST grayscale counterpart. The load_img method has the additional flag called grayscale, and you can set that to False to load in the image as a RGB image.

Once you load these images in converted to RGB and rescaled, you can go ahead and perform Transfer Learning with VGG19. In fact, it has been done before. Consult this link here: https://www.analyticsvidhya.com/blog/2017/06/transfer-learning-the-art-of-fine-tuning-a-pre-trained-model/ and look at Section 6: Use the pre-trained model for identifying digits.

I'd like to give you fair warning that taking a 28 x 28 image and resizing to a 224 x 224 image will have severe interpolation artifacts. You would perform transfer learning on image data that would contain noise due to upsampling but that's what was done in the blog post I linked earlier. I would recommend you change the interpolation to something like bilinear or bicubic. The default is to use nearest neighbour, which is terrible for upsampling images.

YMMV, so try resizing the image to the desired size of the input layer as well as pad the image with three channels to make it RGB and see what happens.

answered Sep 21 '22 14:09

rayryeng

This greatly depends on the model you wish to use. In case of VGGNet, you have to do rescaling of the input to the expected target size, because VGG network contains FC layer, which shape matches the image dimensions after certain number of downsamples. Note that convolutional layers can take any image size due to parameter sharing.

However, modern CNNs are following the trend of switching to all-convolutional and solve the problem of arbitrary transfer learning. If you choose this path, take one of the latest Inception models. In this case, out-of-the model model should be able to accept even small 28x28x1 images.

answered Sep 23 '22 14:09

Maxim

Related questions
                            
                                How do I keep track of the time the CPU is used vs the GPUs for deep learning?
                            
                                Hot to fix Tensorflow model not running in Eager mode with .fit()?
                            
                                Why I am getting DatasetV1Adapter return type instead of TensorSliceDataset for tf.data.Dataset.from_tensor_slices(X)
                            
                                How to use the PyTorch Transformer with multi-dimensional sequence-to-seqence?
                            
                                How can I work with my own dataset in scikit-learn (for computer vision)?
                            
                                sklearn - cross validation with precision scoring for a subset of classes
                            
                                Keras + Tensorflow strange results
                            
                                TensorFlow estimator.predict() gives WARNING:tensorflow:Input graph does not contain a QueueRunner
                            
                                Validation loss when using Dropout
                            
                                How to predict integer values using ML.NET?
                            
                                Stuck in understanding the difference between update usels of TD(0) and TD(λ)
                            
                                Inconsistent predictions from predict.gbm()
                            
                                Recommendation engine without ratings
                            
                                Minimum number of observation when performing Random Forest
                            
                                Tested implementation of APriori and FP-growth in python [closed]
                            
                                sklearn issue: Found arrays with inconsistent numbers of samples when doing regression
                            
                                Linear Regression with positive coefficients in Python
                            
                                DeprecationWarning in sklearn MiniBatchKMeans
                            
                                TensorFlow - GradientDescentOptimizer - are we actually finding global optimum?
                            
                                Attribute's predictive capacity for a particular target in Python, using feature selection in Sklearn

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to do transfer learning for MNIST dataset?

Tags:

machine-learning

tensorflow

deep-learning

keras

mnist

user1159517

People also ask

2 Answers

rayryeng

Maxim

Recent Activity

Donate For Us