Keras image augmentation: How to choose "steps per epoch" parameter and include specific augmentations during training?

Tags:

I am training an image classification CNN using Keras. Using the ImageDataGenerator function, I apply some random transformations to the training images (e.g. rotation, shearing, zooming). My understanding is, that these transformations are applied randomly to each image before passed to the model.

But some things are not clear to me:

1) How can I make sure that specific rotations of an image (e.g. 90°, 180°, 270°) are ALL included while training.

2) The steps_per_epoch parameter of model.fit_generator should be set to the number of unique samples of the dataset divided by the batch size define in the flow_from_directory method. Does this still apply when using the above mentioned image augmentation methods, since they increase the number of training images?

Thanks, Mario

863

asked Aug 31 '17 07:08

Mario Kreutzfeldt

1 Answers

Some time ago I raised myself the same questions and I think a possible explanation is here:

Consider this example:

    aug = ImageDataGenerator(rotation_range=90, width_shift_range=0.1, 
                             height_shift_range=0.1, shear_range=0.2, 
                             zoom_range=0.2, horizontal_flip=True, 
                             fill_mode="nearest")

For question 1): I specify a rotation_range=90, which means that while you flow (retrieve) the data, the generator will randomly rotate the image by a degree between 0 and 90 deg. You can not specify an exact angle cause that's what ImageDataGenerator does: generate randomly the rotation. It is also very important concerning your second question.

For question 2): Yes it still applies to the data augmentation method. I was also confused in the beginning. The reason is that since the image is generated randomly, for each epoch, the network sees the images all different from those in previous epoch. That's why the data is "augmented" - the augmentation is not done within an epoch, but throughout the entire training process. However, I have seen other people specifying 2x value of the original steps_per_epoch.

Hope this helps

answered Sep 20 '22 17:09

captainst

Related questions
                            
                                Fastest way to read .xlsx file with Python
                            
                                Redirect while passing message in django
                            
                                How to minimize repetition in tox file
                            
                                How to take elements along a given axis, given by their indices?
                            
                                Reading pdf files line by line using python
                            
                                Confused about tensor dimensions and batch sizes in pytorch
                            
                                Python - LightGBM with GridSearchCV, is running forever
                            
                                twilio App to App calling is not working
                            
                                Why OpenFST does not seem to have 'run' or 'accept' or 'transduce' command?
                            
                                Assign module method to a Class variable or Instance variable
                            
                                Django: How to disable Database status check at startup?
                            
                                Python 3.4 ImportError: No module named '_gdal_array'No module named '_gdal_array'
                            
                                Using pywinauto.top_window() hangs when using it with threads
                            
                                Connection Error while using requests to get response from google distance matrix api
                            
                                How to save a spark dataframe to csv on HDFS?
                            
                                The name 'DecodeJpeg/contents:0' refers to a Tensor which does not exist
                            
                                Celery: consumer: Cannot connect to amqp://guest:**@127.0.0.1:5672//: [Errno 92] Protocol not available
                            
                                Discuss the complexity of various python methods to obtain N largest elements from a list
                            
                                Jupyter notebook not displaying the tables in pycharm but showing in browser
                            
                                Why does Python hang when running mpirun within a subprocess?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras image augmentation: How to choose "steps per epoch" parameter and include specific augmentations during training?

Tags:

python

image

deep-learning

keras

image-preprocessing

Mario Kreutzfeldt

People also ask

1 Answers

captainst

Recent Activity

Donate For Us