Function approximation

Tags:

I have a function,

P(x0, x1, ..., xn)

that takes 100 integers as input and gives as output an integer. P is a slow function to evaluate(it can range from 30 seconds to a couple of minutes).

I need to know which values of points will maximize the yielded value from P.

What techniques can I use to accomplish this? I know generally people use genetic algorithms for this, but I'm afraid it will take ages to compute it with them, as even with a small population and few generations (let's say, population = 50, generations = 50), P is so slow it will take more than 40 hours to compute it.

Is there any cheaper method of doing it? Maybe an iterative process? I don't need it to be really optimal, but I don't have any ideia of how it behaves (I've tried linear / quadratic / exponential but it doesn't seem to yield any good values. I know P can return values at least 5-10 times better than what I'm getting).

It should be something that's easier to implement (i.e., I must implement it myself).

Thanks

edit: P is a stochastic process.

723

asked Dec 11 '09 14:12

devoured elysium

1 Answers

Simulated annealing, closely related to Markov Chain Monte Carlo (MCMC). The variant you probably want is Metropolis-Hastings. When you get the hang of it, it's quite nice. Possibly there are some ways to optimize it because your inputs and result are all integers. It is compute-intensive and may require some tuning, but it is quite robust, and I'm not sure other methods could do any better.

Here's some brain-dead code to do it:

Click to copy

const int n = 100; // length of vector to optimize
int a[n]; // the vector to optimize
double P(a){..} // Get the probability of vector a.
                // This is the function to optimize.
// for a large number of a samples
for (i = 0; i < large_number; i++){
  // get P(a)
  double p = P(a);
  // for each element of vector a
  for (j = 0; j < n; j++){
    // get an amount by which to change it. This choice has to be symmetric.
    // this is called the Proposal Distribution
    int step = uniform_random_choice_from(-2, -1, 1, 2);
    // make the change to a[j], and get p1, the new value of p
    a[j] += step;
    double p1 = P(a);
    bool bKeepTheStep = true;
    // if p1 is better than p, keep the step
    // if p1 is worse than p, then keep the step p1/p of the time
    if (p1 < p){
      bKeepTheStep = (unif(0,1) < p1/p);
    }
    if (bKeepTheStep) p = p1;
    else a[j] -= step;
  }
  // now a is a sample, and p is its value
  // record a and p
}
// what you have now is a large random sampling of vectors from distribution P
// now you can choose the best one, the average, the variance,
// any statistic you like

Ways to tweak it are to widen or narrow the proposal distribution, so it takes larger or smaller steps, or you can have it initially take larger steps and then smaller steps. What you're looking for is a percentage of steps that are kept that is neither too high nor too low. You probably want to have a "burn-in" phase of an initial 1k or so samples that you throw away, while it hunts for the area of the mode.

And by all means, profile P. It needs to be as fast as possible. Here's my favorite way to do that.

answered Oct 07 '22 21:10

Mike Dunlavey

Related questions
                            
                                Algorithm to Generate 'n' Binary Prefix Codes
                            
                                Pattern matching with associative and commutative operators
                            
                                What is the optimal StringBuffer initial capacity for inputs with drastically varying lengths?
                            
                                Finding a repeating sequence at the end of a sequence of numbers
                            
                                check if two segments on the same circle overlap / intersect
                            
                                Variable initialization issue in switch statement
                            
                                When should a thread generally yield?
                            
                                Benefits of using Migrations [duplicate]
                            
                                Balanced binary trees versus indexed skiplists
                            
                                Human Name parsing
                            
                                How to find a maximal odd decomposition of integer M?
                            
                                If a console program terminates, will the database connections used in the program still remain open?
                            
                                New design patterns/design strategies [closed]
                            
                                A good Object-Oriented analogy [closed]
                            
                                Is it possible to prevent death by parentheses?
                            
                                How to find all possible values of four variables when squared sum to N?
                            
                                Scrolling a WebKit2.Webkit window in GTK+3
                            
                                Does a programming language with the following features exist?
                            
                                What is a Well Documented, Stable, Secure, and Scalable Web Application Framework?
                            
                                Playing with infinity - Lazy arithmetics

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Function approximation

Tags:

language-agnostic

optimization

math

numerical-methods

devoured elysium

People also ask

1 Answers

Mike Dunlavey

Recent Activity

Donate For Us