I have a computer vision algorithm I want to tune up using scipy.optimize.minimize. Right now I only want to tune up two parameters but the number of parameters might eventually grow so I would like to use a technique that can do high-dimensional gradient searches. The Nelder-Mead implementation in SciPy seemed like a good fit. I got the code all set up but it seems that the minimize function really wants to use floating point values with a step size that is less than one.The current set of parameters are both integers and one has a step size of one and the other has a step size of two (i.e. the value must be odd, if it isn't the thing I am trying to optimize will convert it to an odd number). Roughly one parameter is a window size in pixels and the other parameter is a threshold (a value from 0-255). For what it is worth I am using a fresh build of scipy from the git repo. Does anyone know how to tell scipy to use a specific step size for each parameter? Is there some way I can roll my own gradient function? Is there a scipy flag that could help me out? I am aware that this could be done with a simple parameter sweep, but I would eventually like to apply this code to much larger sets of parameters. The code itself is dead simple: <pre class="prettyprint"><code>import numpy as np from scipy.optimize import minimize from ScannerUtil import straightenImg import bson def doSingleIteration(parameters): # do some machine vision magic # return the difference between my value and the truth value parameters = np.array([11,10]) res = minimize( doSingleIteration, parameters, method='Nelder-Mead',options={'xtol': 1e-2, 'disp': True,'ftol':1.0,}) #not sure if these params do anything print "~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~" print res </code></pre> This is what my output looks like. As you can see we are repeating a lot of runs and not getting anywhere in the minimization. <pre class="prettyprint"><code>*+++++++++++++++++++++++++++++++++++++++++ [ 11. 10.] <-- Output from scipy minimize {'block_size': 11, 'degree': 10} <-- input to my algorithm rounded and made int +++++++++++++++++++++++++++++++++++++++++ 120 <-- output of the function I am trying to minimize +++++++++++++++++++++++++++++++++++++++++ [ 11.55 10. ] {'block_size': 11, 'degree': 10} +++++++++++++++++++++++++++++++++++++++++ 120 +++++++++++++++++++++++++++++++++++++++++ [ 11. 10.5] {'block_size': 11, 'degree': 10} +++++++++++++++++++++++++++++++++++++++++ 120 +++++++++++++++++++++++++++++++++++++++++ [ 11.55 9.5 ] {'block_size': 11, 'degree': 9} +++++++++++++++++++++++++++++++++++++++++ 120 +++++++++++++++++++++++++++++++++++++++++ [ 11.1375 10.25 ] {'block_size': 11, 'degree': 10} +++++++++++++++++++++++++++++++++++++++++ 120 +++++++++++++++++++++++++++++++++++++++++ [ 11.275 10. ] {'block_size': 11, 'degree': 10} +++++++++++++++++++++++++++++++++++++++++ 120 +++++++++++++++++++++++++++++++++++++++++ [ 11. 10.25] {'block_size': 11, 'degree': 10} +++++++++++++++++++++++++++++++++++++++++ 120 +++++++++++++++++++++++++++++++++++++++++ [ 11.275 9.75 ] {'block_size': 11, 'degree': 9} +++++++++++++++++++++++++++++++++++++++++ 120 +++++++++++++++++++++++++++++++++++++++++ ~~~ SNIP ~~~ +++++++++++++++++++++++++++++++++++++++++ [ 11. 10.0078125] {'block_size': 11, 'degree': 10} +++++++++++++++++++++++++++++++++++++++++ 120 Optimization terminated successfully. Current function value: 120.000000 Iterations: 7 Function evaluations: 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ status: 0 nfev: 27 success: True fun: 120.0 x: array([ 11., 10.]) message: 'Optimization terminated successfully.' nit: 7* </code></pre>

Assuming that the function to minimize is arbitrarily complex (nonlinear), this is a very hard problem in general. It cannot be guaranteed to be solved optimal unless you try every possible option. I do not know if there are any integer constrained nonlinear optimizer (somewhat doubt it) and I will assume you know that Nelder-Mead should work fine if it was a contiguous function. Edit: Considering the comment from @Dougal I will just add here: Set up a coarse+fine grid search first, if you then feel like trying if your Nelder-Mead works (and converges faster), the points below may help... But maybe some points that help: <ol> <li>Considering how the whole integer constraint is very difficult, maybe it would be an option to do some simple interpolation to help the optimizer. It should still converge to an integer solution. Of course this requires to calculate extra points, but it might solve many other problems. (even in linear integer programming its common to solve the unconstrained system first AFAIK)</li> <li>Nelder-Mead starts with N+1 points, these are hard wired in scipy (at least older versions) to <code>(1+0.05) * x0[j]</code> (for <code>j</code> in all dimensions, unless <code>x0[j]</code> is 0), which you will see in your first evaluation steps. Maybe these can be supplied in newer versions, otherwise you could just change/copy the scipy code (it is pure python) and set it to something more reasonable. Or if you feel that is simpler, scale all input variables down so that (1+0.05)*x0 is of sensible size.</li> <li>Maybe you should cache all function evaluations, since if you use Nelder-Mead I would guess you can always run into duplicat evaluation (at least at the end).</li> <li>You have to check how likely Nelder-Mead will just shrink to a single value and give up, because it always finds the same result.</li> <li>You generally must check if your function is well behaved at all... This optimization is doomed if the function does not change smooth over the parameter space, and even then it can easily run into local minima if you should have of those. (since you cached all evaluations - see 2. - you could at least plot those and have a look at the error landscape without needing to do any extra evluations)</li> </ol>

Integer step size in scipy optimize minimize

Tags:

I have a computer vision algorithm I want to tune up using scipy.optimize.minimize. Right now I only want to tune up two parameters but the number of parameters might eventually grow so I would like to use a technique that can do high-dimensional gradient searches. The Nelder-Mead implementation in SciPy seemed like a good fit.

I got the code all set up but it seems that the minimize function really wants to use floating point values with a step size that is less than one.The current set of parameters are both integers and one has a step size of one and the other has a step size of two (i.e. the value must be odd, if it isn't the thing I am trying to optimize will convert it to an odd number). Roughly one parameter is a window size in pixels and the other parameter is a threshold (a value from 0-255).

For what it is worth I am using a fresh build of scipy from the git repo. Does anyone know how to tell scipy to use a specific step size for each parameter? Is there some way I can roll my own gradient function? Is there a scipy flag that could help me out? I am aware that this could be done with a simple parameter sweep, but I would eventually like to apply this code to much larger sets of parameters.

The code itself is dead simple:

import numpy as np
from scipy.optimize import minimize
from ScannerUtil import straightenImg 
import bson

def doSingleIteration(parameters):
    # do some machine vision magic
    # return the difference between my value and the truth value

parameters = np.array([11,10])
res = minimize( doSingleIteration, parameters, method='Nelder-Mead',options={'xtol': 1e-2, 'disp': True,'ftol':1.0,}) #not sure if these params do anything
print "~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~"
print res

This is what my output looks like. As you can see we are repeating a lot of runs and not getting anywhere in the minimization.

*+++++++++++++++++++++++++++++++++++++++++
[ 11.  10.]  <-- Output from scipy minimize
{'block_size': 11, 'degree': 10} <-- input to my algorithm rounded and made int
+++++++++++++++++++++++++++++++++++++++++
120  <-- output of the function I am trying to minimize
+++++++++++++++++++++++++++++++++++++++++
[ 11.55  10.  ]
{'block_size': 11, 'degree': 10}
+++++++++++++++++++++++++++++++++++++++++
120
+++++++++++++++++++++++++++++++++++++++++
[ 11.   10.5]
{'block_size': 11, 'degree': 10}
+++++++++++++++++++++++++++++++++++++++++
120
+++++++++++++++++++++++++++++++++++++++++
[ 11.55   9.5 ]
{'block_size': 11, 'degree': 9}
+++++++++++++++++++++++++++++++++++++++++
120
+++++++++++++++++++++++++++++++++++++++++
[ 11.1375  10.25  ]
{'block_size': 11, 'degree': 10}
+++++++++++++++++++++++++++++++++++++++++
120
+++++++++++++++++++++++++++++++++++++++++
[ 11.275  10.   ]
{'block_size': 11, 'degree': 10}
+++++++++++++++++++++++++++++++++++++++++
120
+++++++++++++++++++++++++++++++++++++++++
[ 11.    10.25]
{'block_size': 11, 'degree': 10}
+++++++++++++++++++++++++++++++++++++++++
120
+++++++++++++++++++++++++++++++++++++++++
[ 11.275   9.75 ]
{'block_size': 11, 'degree': 9}
+++++++++++++++++++++++++++++++++++++++++
120
+++++++++++++++++++++++++++++++++++++++++
~~~
SNIP 
~~~
+++++++++++++++++++++++++++++++++++++++++
[ 11.         10.0078125]
{'block_size': 11, 'degree': 10}
+++++++++++++++++++++++++++++++++++++++++
120
Optimization terminated successfully.
         Current function value: 120.000000
         Iterations: 7
         Function evaluations: 27
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  status: 0
    nfev: 27
 success: True
     fun: 120.0
       x: array([ 11.,  10.])
 message: 'Optimization terminated successfully.'
     nit: 7*

448

asked Aug 29 '12 14:08

kscottz

1 Answers

Assuming that the function to minimize is arbitrarily complex (nonlinear), this is a very hard problem in general. It cannot be guaranteed to be solved optimal unless you try every possible option. I do not know if there are any integer constrained nonlinear optimizer (somewhat doubt it) and I will assume you know that Nelder-Mead should work fine if it was a contiguous function.

Edit: Considering the comment from @Dougal I will just add here: Set up a coarse+fine grid search first, if you then feel like trying if your Nelder-Mead works (and converges faster), the points below may help...

But maybe some points that help:

Considering how the whole integer constraint is very difficult, maybe it would be an option to do some simple interpolation to help the optimizer. It should still converge to an integer solution. Of course this requires to calculate extra points, but it might solve many other problems. (even in linear integer programming its common to solve the unconstrained system first AFAIK)
Nelder-Mead starts with N+1 points, these are hard wired in scipy (at least older versions) to (1+0.05) * x0[j] (for j in all dimensions, unless x0[j] is 0), which you will see in your first evaluation steps. Maybe these can be supplied in newer versions, otherwise you could just change/copy the scipy code (it is pure python) and set it to something more reasonable. Or if you feel that is simpler, scale all input variables down so that (1+0.05)*x0 is of sensible size.
Maybe you should cache all function evaluations, since if you use Nelder-Mead I would guess you can always run into duplicat evaluation (at least at the end).
You have to check how likely Nelder-Mead will just shrink to a single value and give up, because it always finds the same result.
You generally must check if your function is well behaved at all... This optimization is doomed if the function does not change smooth over the parameter space, and even then it can easily run into local minima if you should have of those. (since you cached all evaluations - see 2. - you could at least plot those and have a look at the error landscape without needing to do any extra evluations)

100

answered Oct 25 '22 20:10

seberg

Related questions
                            
                                Communication between Windows Store app and native desktop application
                            
                                Entity Framework spinup much slower on x64 vs x86
                            
                                AppStore / iOS apps and interpreted code - where do they draw the line?
                            
                                Display an .ico within an image element <img>
                            
                                Xcode with iOS - Creating a library in a way that is easy to run in debug mode, distribute, iterate
                            
                                How to maintain a Github fork of a popular project
                            
                                How to run commands on shell through python [duplicate]
                            
                                PHP Convert Word file to HTML without losing styling and images [closed]
                            
                                How to use a python context manager inside a generator
                            
                                Make vim indent C preprocessor directives the same as other statements
                            
                                How to write a web proxy in Python
                            
                                Why is Class.getSimpleName() not cached?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With