How to avoid overfitting with genetic algorithm

Tags:

genetic-algorithm

I am facing the following problem. I have a system able to produce a ranking of some operations according to their anomaly score. To improve the performance I implemented a genetic algorithm to perform a features selection, such that the most anomalous operations appears in the first positions. What I am doing is not exactly feature selection, because I am not using binary variables, rather float variables between 0-1, which sum is equal to 1.

Currently, I have a population of 200 individuals for 50 generations. I am using as the evaluation function the system itself and I evaluate the quality of the solution by using the true positive rate, counting how many anomalous operations appears in the first N positions (where N is the number of anomalous operations). Then as operator the uniform crossover and I change a valueof a cell of the individual for the mutation. Of course, every time I make a check to fix the individual such that the sum is 1. Finally I use elitism to save the best-so-far solution over the time.

I observed that one feature has a very high value, which is often important, but not always, and this causes very low values for the other features. I suspect that my GA is overfitting. Can you help me to find a good stop criteria?

589

asked Jan 04 '15 11:01

user2297037

1 Answers

Overfitting in genetic algorithms and programming is a big issue which is currently under research focus of the GP community, including myself. Most of the research is aimed at genetic programming and evolution of classification/regression models but it might also relate to your problem. There are some papers which might help you (and which I am working with too):

Gonçalves, Ivo, and Sara Silva. "Experiments on controlling overfitting in genetic programming." Proceedings of the 15th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence, EPIA. Vol. 84. 2011.
Langdon, W. B. "Minimising testing in genetic programming." RN 11.10 (2011): 1.
Gonçalves, Ivo, et al. "Random sampling technique for overfitting control in genetic programming." Genetic Programming. Springer Berlin Heidelberg, 2012. 218-229.
Gonçalves, Ivo, and Sara Silva. Balancing learning and overfitting in genetic programming with interleaved sampling of training data. Springer Berlin Heidelberg, 2013.

You can find the papers (the first two directly in pdf) by searching for their titles in scholar.google.com.

Basically, what all the papers work with, is the idea of using only a subset of the training data for directing the evolution and (randomly) changing this subset every generation (using the same subset for all individuals in one generation). Interestingly, experiments show that the smaller this subset is, the less overfitting occurs, up to the extreme of using only a single-element subset. The papers work with this idea and extend it with some tweaks (like switching between full dataset and a subset). But as I said in the beginning, all this is aimed at symbolic regression (more or less) and not feature selection.

I personally once tried another approach (again for symbolic regression by genetic programming) - using a subset of training data (e.g. a half) to drive the evolution (i.e. for fitness), but the "best-so-far" solution was determined using results on the remaining training data. The overfitting was much less significant.

182

answered Sep 21 '22 22:09

zegkljan

Related questions
                            
                                What are the real differences between genetic algorithms and evolutionary algorithms?
                            
                                C# how to create functions that are interpreted at runtime
                            
                                how to tackle this combinatorial algorithm problem
                            
                                How to represent a schedule for Timetabler Problem in Genetic Algorithms?
                            
                                How to implement selection and crossover in using genetic algorithm to find square root of a number in C
                            
                                Multithreaded galib247 genetic algorithm stuck in local maxima
                            
                                Do you have genetic algorithm in production?
                            
                                Genetic Algorithm: Higher Mutation Rate leads to lower run time
                            
                                Cross-over two integers bitwise
                            
                                Whats the difference between Cross-Entropy and Genetic Algorithms?
                            
                                Haskell Stack Overflow
                            
                                Genetic Algorithm Optimization
                            
                                What's the importance of invalid fitness in DEAP?
                            
                                Kalman Filter vs Exponential Filter
                            
                                Sampling Permutations of [1,2,3,...,N] for large N
                            
                                Looking for a better evaluation method for a genetic algorithm
                            
                                What model best suits optimizing for a real-time strategy game?
                            
                                how can i avoid the compiler error: std::transform?
                            
                                What algorithm should I use for "genetic AI improvement"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With