I'm using the SLSQP solver in scipy.minimize to solve a constrained optimization problem. Very often the solver will try parameter values that violate the constraints. When these constraints are violated, the objective function returns a <code>nan</code>. This would seem to pose problems, as my approximated Jacobian is full of <code>nan</code>'s nearly every time it is recalculated. More often than not, the optimization terminates in <code>exit mode 8: Positive directional derivative for linesearch</code>. I suspect the <code>nan</code>'s in the approximated Jacobian to be the scource of this. My question then is how does scipy.minimize handle <code>nan</code>'s? Are they benign, or should they be converted to a large (or even infinite) number? To the best of my knowledge, this information is not covered anywhere in the Scipy documentation.

There are checks in <code>scipy</code> for <code>nans</code> depending on which search algorithm you use. You'll have to check the source of each search algorithm. It generally doesn't affect minimisers (unless you use non-discriminatory methods) but it really messes up maximisation. In general, <code>scipy</code> lands up using <code>numpy</code> arrays. The best way to understand what happens is with the following simple example: <pre class="prettyprint"><code>>>> x = [-np.nan, np.nan, 1, 2, 3, np.nan] # some random sequence of numbers and nans >>> np.sort(x) array([ 1., 2., 3., nan, nan, nan]) </code></pre> The <code>np.nan</code> is always seen as the largest number thus, you have to account for this explicitly in your search algorithm such that these solutions are rejected from future iterations. As to interpreting <code>+/- nans</code> see this if the backend implementations are in fortran - which is sometimes the case.

How does scipy.minimize handle NaN's?

Tags:

python

optimization

nan

scipy

minimize

I'm using the SLSQP solver in scipy.minimize to solve a constrained optimization problem. Very often the solver will try parameter values that violate the constraints. When these constraints are violated, the objective function returns a nan. This would seem to pose problems, as my approximated Jacobian is full of nan's nearly every time it is recalculated. More often than not, the optimization terminates in exit mode 8: Positive directional derivative for linesearch. I suspect the nan's in the approximated Jacobian to be the scource of this. My question then is how does scipy.minimize handle nan's? Are they benign, or should they be converted to a large (or even infinite) number? To the best of my knowledge, this information is not covered anywhere in the Scipy documentation.

689

asked Mar 24 '18 04:03

Peter

1 Answers

There are checks in scipy for nans depending on which search algorithm you use. You'll have to check the source of each search algorithm. It generally doesn't affect minimisers (unless you use non-discriminatory methods) but it really messes up maximisation. In general, scipy lands up using numpy arrays. The best way to understand what happens is with the following simple example:

>>> x = [-np.nan, np.nan, 1, 2, 3, np.nan] # some random sequence of numbers and nans 
>>> np.sort(x)
array([ 1.,  2.,  3., nan, nan, nan])

The np.nan is always seen as the largest number thus, you have to account for this explicitly in your search algorithm such that these solutions are rejected from future iterations. As to interpreting +/- nans see this if the backend implementations are in fortran - which is sometimes the case.

answered Oct 23 '22 14:10

newkid

Related questions
                            
                                Angular/Javascript - return the correct number onKeyUp
                            
                                Safari losing hash params on http redirection
                            
                                How to setup Azure DevOps CI build/release pipeline for nuget packages (advanced)
                            
                                getElementById fails on very specific ID that is definitively in the DOM tree on IE 11, possibly Edge - why?
                            
                                Windows native callback only returns 32 bit result on 64 bit platform
                            
                                How to find multidimensional path of exact 0 cost with 1, 0, -1 weights
                            
                                Python turtle module causes OS X to crash
                            
                                Nothing to do. None of the projects specified contain packages to restore
                            
                                Failed to execute 'postMessage' on 'DOMWindow' using Stripe Payment Module
                            
                                UIReferenceLibraryViewController.dictionaryHasDefinition - **very slow** to return result - iOS 13 (simulator and real device) - Xcode 11
                            
                                ios13 tls certificates issue - connection error
                            
                                Flutter Push notification not displaying on IOS

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With