desc.optimize.fmintr

class desc.optimize.fmintr(fun, x0, grad, hess='bfgs', bounds=(-inf, inf), args=(), x_scale='hess', ftol=1e-06, xtol=1e-06, gtol=1e-06, verbose=1, maxiter=None, callback=None, options=None)Source 

Minimize a scalar function using a (quasi)-Newton trust region method.

Parameters:

fun (callable) – objective to be minimized. Should have a signature like fun(x,*args)-> float
x0 (array-like) – initial guess
grad (callable) – function to compute gradient, df/dx. Should take the same arguments as fun
hess (callable or 'bfgs', optional:) – function to compute Hessian matrix of fun, or 'bfgs' in which case the BFGS method will be used to approximate the Hessian.
bounds (tuple of array-like) – Lower and upper bounds on independent variables. Defaults to no bounds. Each array must match the size of x0 or be a scalar, in the latter case a bound will be the same for all variables. Use np.inf with an appropriate sign to disable bounds on all or some variables.
args (tuple) – additional arguments passed to fun, grad, and hess
x_scale (array_like or 'hess', optional) – Characteristic scale of each variable. Setting x_scale is equivalent to reformulating the problem in scaled variables xs = x / x_scale. An alternative view is that the size of a trust region along jth dimension is proportional to x_scale[j]. Improved convergence may be achieved by setting x_scale such that a step of a given size along any of the scaled variables has a similar effect on the cost function. If set to 'hess', the scale is iteratively updated using the inverse norms of the columns of the Hessian matrix.
ftol (float or None, optional) – Tolerance for termination by the change of the cost function. The optimization process is stopped when dF < ftol * F, and there was an adequate agreement between a local quadratic model and the true model in the last step. If None, the termination by this condition is disabled.
xtol (float or None, optional) – Tolerance for termination by the change of the independent variables. Optimization is stopped when norm(dx) < xtol * (xtol + norm(x)). If None, the termination by this condition is disabled.
gtol (float or None, optional) – Absolute tolerance for termination by the norm of the gradient. Optimizer terminates when max(abs(g)) < gtol. If None, the termination by this condition is disabled.
verbose ({0, 1, 2}, optional) –
- 0 : work silently.
- 1 (default) : display a termination report.
- 2 : display progress during iterations
maxiter (int, optional) – maximum number of iterations. Defaults to size(x)*100
callback (callable, optional) –
Called after each iteration. Should be a callable with the signature:

callback(xk, *args) -> bool

where xk is the current parameter vector, and args are the same arguments passed to fun and grad. If callback returns True the algorithm execution is terminated.
options (dict, optional) –
dictionary of optional keyword arguments to override default solver settings.
- "max_nfev" : (int > 0) Maximum number of function evaluations (each iteration may take more than one function evaluation). Default is 5*maxiter+1
- "max_dx" : (float > 0) Maximum allowed change in the norm of x from its starting point. Default np.inf.
- "initial_trust_radius" : ("scipy", "conngould", "mix" or float > 0) Initial trust region radius. "scipy" uses the scaled norm of x0, which is the default behavior in scipy.optimize.least_squares. "conngould" uses the norm of the Cauchy point, as recommended in ch17 of [1]. "mix" uses the geometric mean of the previous two options. A float can also be passed to specify the trust radius directly. Default is "scipy".
- "initial_trust_ratio" : (float > 0) A extra scaling factor that is applied after one of the previous heuristics to determine the initial trust radius. Default 1.
- "max_trust_radius" : (float > 0) Maximum allowable trust region radius. Default np.inf.
- "min_trust_radius" : (float >= 0) Minimum allowable trust region radius. Optimization is terminated if the trust region falls below this value. Default np.finfo(x0.dtype).eps.
- "tr_increase_threshold" : (0 < float < 1) Increase the trust region radius when the ratio of actual to predicted reduction exceeds this threshold. Default 0.75.
- "tr_decrease_threshold" : (0 < float < 1) Decrease the trust region radius when the ratio of actual to predicted reduction is less than this threshold. Default 0.25.
- "tr_increase_ratio" : (float > 1) Factor to increase the trust region radius by when the ratio of actual to predicted reduction exceeds threshold. Default 2.
- "tr_decrease_ratio" : (0 < float < 1) Factor to decrease the trust region radius by when the ratio of actual to predicted reduction falls below threshold. Default 0.25.
- "tr_method" : ("exact", "dogleg", "subspace") Method to use for trust region subproblem. "exact" uses a series of cholesky factorizations (usually 2-3) to find the optimal step. "dogleg" approximates the optimal step using Powell’s dogleg method. "subspace" solves a reduced subproblem over the space spanned by the gradient and Newton direction. Default "exact"
- "hessian_exception_strategy" : ("skip_update", "damp_update") If BFGS is used, defines how to proceed when the curvature condition is violated. Set it to ‘skip_update’ to just skip the update. Or, alternatively, set it to ‘damp_update’ to interpolate between the actual BFGS result and the unmodified matrix. Both exceptions strategies are explained in [2], p.536-537. Default is "damp_update".
- "hessian_min_curvature" : (float) If BFGS is used, this number, scaled by a normalization factor, defines the minimum curvature dot(delta_grad, delta_x) allowed to go unaffected by the exception strategy. By default is equal to 1e-8 when exception_strategy = "skip_update" and equal to 0.2 when exception_strategy = "damp_update".
- "hessian_init_scale" : (float, "auto") If BFGS is used, the matrix scale at first iteration. At the first iteration the Hessian matrix or its inverse will be initialized with init_scale*np.eye(n), where n is the problem dimension. Set it to "auto" in order to use an automatic heuristic for choosing the initial scale. The heuristic is described in [2], p.143. By default uses "auto".
- "scaled_termination" : Whether to evaluate termination criteria for xtol and gtol in scaled / normalized units (default) or base units.

Returns:

res (OptimizeResult) – The optimization result represented as a OptimizeResult object. Important attributes are: x the solution array, success a Boolean flag indicating if the optimizer exited successfully.

References