Whether Data augmentation really needed in Machine Learning [closed]

Tags:

I am interested in knowing the importance of data augmentation(rotation at various angles, flipping the images) while providing a dataset to a Machine Learning problem.

Whether it is really needed? Or the CNN networks using will handle that as well no matter how different the data are transformed?

So I took a classification task with 2 classes to conclude some results

Arrow shapes
Circle shapes

The idea is to train the shapes with only one orientation(I have taken arrows pointing right) and check the model with a different orientation(I have taken arrows pointing downwards) which is not at all given during the training stage.

Some of the samples used in Training

enter image description here

Some of the samples used in Testing

enter image description here

This is the entire dataset I am using in for creating a tensorflow model. https://bitbucket.org/akhileshmalviya/samples/src/bab50b85d826?at=master

I am wondering with the results I got,

(i) Except a few downward arrows all others are getting predicted correctly as arrow. Does it mean data augmentation is not at all needed?

(ii) Or is this the right use case I have taken to understand the importance of data augmentation?

Kindly share your thoughts, Any help could be really appreciated!

475

asked Jun 22 '17 09:06

Karthik

1 Answers

Data augmentation is a data-depended process.

In general, you need it when your training data is complex and you have a few samples.

A neural network can easily learn to extract simple patterns like arcs or straight lines and these patterns are enough to classify your data.

In your case data augmentation can barely help, the features the network will learn to extract are easy and highly different from each other.

When you, instead, have to deal with complex structures (cats, dogs, airplanes, ...) you can't rely on simple features like edges, arcs, etc.. Instead, you have to show to your network that the instances you're trying to classify got an high variance and that the features extracted can be combined in a lot of different ways for the same subject.

Think about a cat: it can be of any color, the picture can be taken in different light conditions, its whole body can be in any position, the picture could be taken with a certain orientation... To correctly classify instances so different, the network must learn to extract robust features that could be learned only after seeing a lot of different inputs.

In your case, instead, simple features can completely discriminate your input, thus any sort of data augmentation could help by just a little bit.

114

answered Sep 25 '22 01:09

nessuno

Related questions
                            
                                Doing hyperparameter estimation for the estimator in each fold of Recursive Feature Elimination
                            
                                Learning rate of a Q learning agent
                            
                                Accuracy issue in caffe
                            
                                get function by its values in certain points
                            
                                Missing Value in Data Analysis
                            
                                What are effective preprocessing methods for reducing data set size (e.g., removing records) without losing information for machine learning problems?
                            
                                What is a good way to extract dominant colors from image without the shadow?
                            
                                Can a model be created on Spark batch and use it in Spark streaming?
                            
                                What is the difference between classification and pattern recognition?
                            
                                Adapting binary stacking example to multiclass
                            
                                Possible to modify/prune learned trees in scikit-learn?
                            
                                The output of a softmax isn't supposed to have zeros, right?
                            
                                Gradient clipping appears to choke on None
                            
                                Add new columns to pandas dataframe based on other dataframe
                            
                                Plot decision tree in R (Caret)
                            
                                Should I avoid to use L2 regularization in conjuntion with RMSProp?
                            
                                how to predict my own image using cnn in keras after training on MNIST dataset
                            
                                How to use `log_loss` in `GridSearchCV` with multi-class labels in Scikit-Learn (sklearn)?
                            
                                Which algorithm is used in google's tesseract-OCR for Recognition?
                            
                                Keras: model accuracy drops after reaching 99 percent accuracy and loss 0.01

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Whether Data augmentation really needed in Machine Learning [closed]

Tags:

machine-learning

tensorflow

conv-neural-network

Karthik

People also ask

1 Answers

nessuno

Recent Activity

Donate For Us