Subproblem Solvers

This module provides the machinery to solve 1- and N-dimensional trust-region subproblems.

Functions Summary

`dsecular`(lam, w, eigvals, eigvecs, delta)	Derivative of the secular equation
`dslam`(lam, w, eigvals, eigvecs)	Computes the derivative of the solution $s(\lambda)$ with respect to lambda, where $s$ is the subproblem solution according to
`get_1d_trust_region_boundary_solution`(B, g, ...)
`quadratic_form`(Q, p, x)	Computes the quadratic form $x^TQx + x^Tp$
`secular`(lam, w, eigvals, eigvecs, delta)	Secular equation
`slam`(lam, w, eigvals, eigvecs)	Computes the solution $s(\lambda)$ as subproblem solution according to
`solve_1d_trust_region_subproblem`(B, g, s, ...)	Solves the special case of a one-dimensional subproblem
`solve_nd_trust_region_subproblem`(B, g, delta)	This function exactly solves the n-dimensional subproblem.

Functions

fides.subproblem.dsecular(lam, w, eigvals, eigvecs, delta)[source]

Derivative of the secular equation

$\phi(\lambda) = \frac{1}{||s||} - \frac{1}{\Delta}$

with respect to $\lambda$

Parameters:

lam (float) – $\lambda$
w (numpy.ndarray) – precomputed eigenvector coefficients for -g
eigvals (numpy.ndarray) – precomputed eigenvalues of B
eigvecs (numpy.ndarray) – precomputed eigenvectors of B
delta (float) – trust region radius $\Delta$

Returns:

$\frac{\partial \phi(\lambda)}{\partial \lambda}$

fides.subproblem.dslam(lam, w, eigvals, eigvecs)[source]

Computes the derivative of the solution $s(\lambda)$ with respect to lambda, where $s$ is the subproblem solution according to

$-(B + \lambda I)s = g$

Parameters:

lam (float) – $\lambda$
w (numpy.ndarray) – precomputed eigenvector coefficients for -g
eigvals (numpy.ndarray) – precomputed eigenvalues of B
eigvecs (numpy.ndarray) – precomputed eigenvectors of B

Returns:

$\frac{\partial s(\lambda)}{\partial \lambda}$

fides.subproblem.get_1d_trust_region_boundary_solution(B, g, s, s0, delta)[source]

fides.subproblem.quadratic_form(Q, p, x)[source]

Computes the quadratic form $x^TQx + x^Tp$

Parameters:

Q (numpy.ndarray) – Matrix
p (numpy.ndarray) – Vector
x (numpy.ndarray) – Input

Return type:

float

Returns:

Value of form

fides.subproblem.secular(lam, w, eigvals, eigvecs, delta)[source]

Secular equation

$\phi(\lambda) = \frac{1}{||s||} - \frac{1}{\Delta}$

Subproblem solutions are given by the roots of this equation

Parameters:

lam (float) – $\lambda$
w (numpy.ndarray) – precomputed eigenvector coefficients for -g
eigvals (numpy.ndarray) – precomputed eigenvalues of B
eigvecs (numpy.ndarray) – precomputed eigenvectors of B
delta (float) – trust region radius $\Delta$

Returns:

$\phi(\lambda)$

fides.subproblem.slam(lam, w, eigvals, eigvecs)[source]

Computes the solution $s(\lambda)$ as subproblem solution according to

$-(B + \lambda I)s = g$

Parameters:

lam (float) – $\lambda$
w (numpy.ndarray) – precomputed eigenvector coefficients for -g
eigvals (numpy.ndarray) – precomputed eigenvalues of B
eigvecs (numpy.ndarray) – precomputed eigenvectors of B

Return type:

numpy.ndarray

Returns:

$s(\lambda)$

fides.subproblem.solve_1d_trust_region_subproblem(B, g, s, delta, s0)[source]

Solves the special case of a one-dimensional subproblem

Parameters:

B (numpy.ndarray) – Hessian of the quadratic subproblem
g (numpy.ndarray) – Gradient of the quadratic subproblem
s (numpy.ndarray) – Vector defining the one-dimensional search direction
delta (float) – Norm boundary for the solution of the quadratic subproblem
s0 (numpy.ndarray) – reference point from where search is started, also counts towards norm of step

Return type:

numpy.ndarray

Returns:

Proposed step-length

fides.subproblem.solve_nd_trust_region_subproblem(B, g, delta, logger=None)[source]

This function exactly solves the n-dimensional subproblem.

$argmin_s\{s^T B s + s^T g = 0: ||s|| <= \Delta, s \in \mathbb{ R}^n\}$

The solution is characterized by the equation $-(B + \lambda I)s = g$. If B is positive definite, the solution can be obtained by $\lambda = 0`$$ if $Bs = -g$ satisfies $||s|| <= \Delta$. If B is indefinite or $Bs = -g$ satisfies $||s|| > \Delta$ and an appropriate $\lambda$ has to be identified via 1D rootfinding of the secular equation

$\phi(\lambda) = \frac{1}{||s(\lambda)||} - \frac{1}{\Delta} = 0$

with $s(\lambda)$ computed according to an eigenvalue decomposition of B. The eigenvalue decomposition, although being more expensive than a cholesky decomposition, has the advantage that eigenvectors are invariant to changes in $\lambda$ and eigenvalues are linear in $\lambda$, so factorization only has to be performed once. We perform the linesearch via Newton’s algorithm and Brent-Q as fallback. The hard case is treated separately and serves as general fallback.

Parameters:

B (numpy.ndarray) – Hessian of the quadratic subproblem
g (numpy.ndarray) – Gradient of the quadratic subproblem
delta (float) – Norm boundary for the solution of the quadratic subproblem
logger (typing.Optional[logging.Logger]) – Logger instance to be used for logging

Return type:

typing.Tuple[numpy.ndarray, str]

Returns:

s: Selected step, step_type: Type of solution that was obtained