doc/manual/snes.md

7f296bb3SBarry Smith(ch_snes)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith# SNES: Nonlinear Solvers
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe solution of large-scale nonlinear problems pervades many facets of
7f296bb3SBarry Smithcomputational science and demands robust and flexible solution
7f296bb3SBarry Smithstrategies. The `SNES` library of PETSc provides a powerful suite of
7f296bb3SBarry Smithdata-structure-neutral numerical routines for such problems. Built on
7f296bb3SBarry Smithtop of the linear solvers and data structures discussed in preceding
7f296bb3SBarry Smithchapters, `SNES` enables the user to easily customize the nonlinear
7f296bb3SBarry Smithsolvers according to the application at hand. Also, the `SNES`
7f296bb3SBarry Smithinterface is *identical* for the uniprocess and parallel cases; the only
7f296bb3SBarry Smithdifference in the parallel version is that each process typically forms
7f296bb3SBarry Smithonly its local contribution to various matrices and vectors.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe `SNES` class includes methods for solving systems of nonlinear
7f296bb3SBarry Smithequations of the form
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\mathbf{F}(\mathbf{x}) = 0,
7f296bb3SBarry Smith$$ (fx0)
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere $\mathbf{F}: \, \Re^n \to \Re^n$. Newton-like methods provide the
7f296bb3SBarry Smithcore of the package, including both line search and trust region
7f296bb3SBarry Smithtechniques. A suite of nonlinear Krylov methods and methods based upon
7f296bb3SBarry Smithproblem decomposition are also included. The solvers are discussed
7f296bb3SBarry Smithfurther in {any}`sec_nlsolvers`. Following the PETSc design
7f296bb3SBarry Smithphilosophy, the interfaces to the various solvers are all virtually
7f296bb3SBarry Smithidentical. In addition, the `SNES` software is completely flexible, so
7f296bb3SBarry Smiththat the user can at runtime change any facet of the solution process.
7f296bb3SBarry Smith
7f296bb3SBarry SmithPETSc’s default method for solving the nonlinear equation is Newton’s
7f296bb3SBarry Smithmethod with line search, `SNESNEWTONLS`. The general form of the $n$-dimensional Newton’s method
7f296bb3SBarry Smithfor solving {math:numref}`fx0` is
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\mathbf{x}_{k+1} = \mathbf{x}_k - \mathbf{J}(\mathbf{x}_k)^{-1} \mathbf{F}(\mathbf{x}_k), \;\; k=0,1, \ldots,
7f296bb3SBarry Smith$$ (newton)
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere $\mathbf{x}_0$ is an initial approximation to the solution and
7f296bb3SBarry Smith$\mathbf{J}(\mathbf{x}_k) = \mathbf{F}'(\mathbf{x}_k)$, the Jacobian, is nonsingular at each
7f296bb3SBarry Smithiteration. In practice, the Newton iteration {math:numref}`newton` is
7f296bb3SBarry Smithimplemented by the following two steps:
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\begin{aligned}
7f296bb3SBarry Smith1. & \text{(Approximately) solve} & \mathbf{J}(\mathbf{x}_k) \Delta \mathbf{x}_k &= -\mathbf{F}(\mathbf{x}_k). \\
7f296bb3SBarry Smith2. & \text{Update} & \mathbf{x}_{k+1} &\gets \mathbf{x}_k + \Delta \mathbf{x}_k.
7f296bb3SBarry Smith\end{aligned}
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry SmithOther defect-correction algorithms can be implemented by using different
7f296bb3SBarry Smithchoices for $J(\mathbf{x}_k)$.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_snesusage)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith## Basic SNES Usage
7f296bb3SBarry Smith
7f296bb3SBarry SmithIn the simplest usage of the nonlinear solvers, the user must merely
7f296bb3SBarry Smithprovide a C, C++, Fortran, or Python routine to evaluate the nonlinear function
7f296bb3SBarry Smith{math:numref}`fx0`. The corresponding Jacobian matrix
7f296bb3SBarry Smithcan be approximated with finite differences. For codes that are
7f296bb3SBarry Smithtypically more efficient and accurate, the user can provide a routine to
7f296bb3SBarry Smithcompute the Jacobian; details regarding these application-provided
7f296bb3SBarry Smithroutines are discussed below. To provide an overview of the use of the
7f296bb3SBarry Smithnonlinear solvers, browse the concrete example in {ref}`ex1.c <snes-ex1>` or skip ahead to the discussion.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(snes_ex1)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith:::{admonition} Listing: `src/snes/tutorials/ex1.c`
7f296bb3SBarry Smith```{literalinclude} /../src/snes/tutorials/ex1.c
7f296bb3SBarry Smith:end-before: /*TEST
7f296bb3SBarry Smith```
7f296bb3SBarry Smith:::
7f296bb3SBarry Smith
7f296bb3SBarry SmithTo create a `SNES` solver, one must first call `SNESCreate()` as
7f296bb3SBarry Smithfollows:
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESCreate(MPI_Comm comm, SNES *snes);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe user must then set routines for evaluating the residual function {math:numref}`fx0`
7f296bb3SBarry Smithand, *possibly*, its associated Jacobian matrix, as
7f296bb3SBarry Smithdiscussed in the following sections.
7f296bb3SBarry Smith
7f296bb3SBarry SmithTo choose a nonlinear solution method, the user can either call
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESSetType(SNES snes, SNESType method);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithor use the option `-snes_type <method>`, where details regarding the
7f296bb3SBarry Smithavailable methods are presented in {any}`sec_nlsolvers`. The
7f296bb3SBarry Smithapplication code can take complete control of the linear and nonlinear
7f296bb3SBarry Smithtechniques used in the Newton-like method by calling
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESSetFromOptions(snes);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThis routine provides an interface to the PETSc options database, so
7f296bb3SBarry Smiththat at runtime the user can select a particular nonlinear solver, set
7f296bb3SBarry Smithvarious parameters and customized routines (e.g., specialized line
7f296bb3SBarry Smithsearch variants), prescribe the convergence tolerance, and set
7f296bb3SBarry Smithmonitoring routines. With this routine the user can also control all
7f296bb3SBarry Smithlinear solver options in the `KSP`, and `PC` modules, as discussed
7f296bb3SBarry Smithin {any}`ch_ksp`.
7f296bb3SBarry Smith
7f296bb3SBarry SmithAfter having set these routines and options, the user solves the problem
7f296bb3SBarry Smithby calling
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESSolve(SNES snes, Vec b, Vec x);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere `x` should be initialized to the initial guess before calling and contains the solution on return.
7f296bb3SBarry SmithIn particular, to employ an initial guess of
7f296bb3SBarry Smithzero, the user should explicitly set this vector to zero by calling
7f296bb3SBarry Smith`VecZeroEntries(x)`. Finally, after solving the nonlinear system (or several
7f296bb3SBarry Smithsystems), the user should destroy the `SNES` context with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESDestroy(SNES *snes);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_snesfunction)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Nonlinear Function Evaluation
7f296bb3SBarry Smith
7f296bb3SBarry SmithWhen solving a system of nonlinear equations, the user must provide a
7f296bb3SBarry Smitha residual function {math:numref}`fx0`, which is set using
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
*2a8381b2SBarry SmithSNESSetFunction(SNES snes, Vec f, PetscErrorCode (*FormFunction)(SNES snes, Vec x, Vec f, PetscCtx ctx), PetscCtx ctx);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe argument `f` is an optional vector for storing the solution; pass `NULL` to have the `SNES` allocate it for you.
7f296bb3SBarry SmithThe argument `ctx` is an optional user-defined context, which can
7f296bb3SBarry Smithstore any private, application-specific data required by the function
7f296bb3SBarry Smithevaluation routine; `NULL` should be used if such information is not
7f296bb3SBarry Smithneeded. In C and C++, a user-defined context is merely a structure in
7f296bb3SBarry Smithwhich various objects can be stashed; in Fortran a user context can be
7f296bb3SBarry Smithan integer array that contains both parameters and pointers to PETSc
7f296bb3SBarry Smithobjects.
7f296bb3SBarry Smith<a href="PETSC_DOC_OUT_ROOT_PLACEHOLDER/src/snes/tutorials/ex5.c.html">SNES Tutorial ex5</a>
7f296bb3SBarry Smithand
7f296bb3SBarry Smith<a href="PETSC_DOC_OUT_ROOT_PLACEHOLDER/src/snes/tutorials/ex5f90.F90.html">SNES Tutorial ex5f90</a>
7f296bb3SBarry Smithgive examples of user-defined application contexts in C and Fortran,
7f296bb3SBarry Smithrespectively.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_snesjacobian)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Jacobian Evaluation
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe user may also specify a routine to form some approximation of the
7f296bb3SBarry SmithJacobian matrix, `A`, at the current iterate, `x`, as is typically
7f296bb3SBarry Smithdone with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
*2a8381b2SBarry SmithSNESSetJacobian(SNES snes, Mat Amat, Mat Pmat, PetscErrorCode (*FormJacobian)(SNES snes, Vec x, Mat A, Mat B, PetscCtx ctx), PetscCtx ctx);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe arguments of the routine `FormJacobian()` are the current iterate,
7f296bb3SBarry Smith`x`; the (approximate) Jacobian matrix, `Amat`; the matrix from
7f296bb3SBarry Smithwhich the preconditioner is constructed, `Pmat` (which is usually the
7f296bb3SBarry Smithsame as `Amat`); and an optional user-defined Jacobian context,
7f296bb3SBarry Smith`ctx`, for application-specific data. The `FormJacobian()`
7f296bb3SBarry Smithcallback is only invoked if the solver requires it, always
7f296bb3SBarry Smith*after* `FormFunction()` has been called at the current iterate.
7f296bb3SBarry Smith
7f296bb3SBarry SmithNote that the `SNES` solvers
7f296bb3SBarry Smithare all data-structure neutral, so the full range of PETSc matrix
7f296bb3SBarry Smithformats (including “matrix-free” methods) can be used.
7f296bb3SBarry Smith{any}`ch_matrices` discusses information regarding
7f296bb3SBarry Smithavailable matrix formats and options, while {any}`sec_nlmatrixfree` focuses on matrix-free methods in
7f296bb3SBarry Smith`SNES`. We briefly touch on a few details of matrix usage that are
7f296bb3SBarry Smithparticularly important for efficient use of the nonlinear solvers.
7f296bb3SBarry Smith
7f296bb3SBarry SmithA common usage paradigm is to assemble the problem Jacobian in the
7f296bb3SBarry Smithpreconditioner storage `B`, rather than `A`. In the case where they
7f296bb3SBarry Smithare identical, as in many simulations, this makes no difference.
7f296bb3SBarry SmithHowever, it allows us to check the analytic Jacobian we construct in
7f296bb3SBarry Smith`FormJacobian()` by passing the `-snes_mf_operator` flag. This
7f296bb3SBarry Smithcauses PETSc to approximate the Jacobian using finite differencing of
7f296bb3SBarry Smiththe function evaluation (discussed in {any}`sec_fdmatrix`),
7f296bb3SBarry Smithand the analytic Jacobian becomes merely the preconditioner. Even if the
7f296bb3SBarry Smithanalytic Jacobian is incorrect, it is likely that the finite difference
7f296bb3SBarry Smithapproximation will converge, and thus this is an excellent method to
7f296bb3SBarry Smithverify the analytic Jacobian. Moreover, if the analytic Jacobian is
7f296bb3SBarry Smithincomplete (some terms are missing or approximate),
7f296bb3SBarry Smith`-snes_mf_operator` may be used to obtain the exact solution, where
7f296bb3SBarry Smiththe Jacobian approximation has been transferred to the preconditioner.
7f296bb3SBarry Smith
7f296bb3SBarry SmithOne such approximate Jacobian comes from “Picard linearization”, use `SNESSetPicard()`, which
7f296bb3SBarry Smithwrites the nonlinear system as
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\mathbf{F}(\mathbf{x}) := \mathbf{A}(\mathbf{x}) \mathbf{x} - \mathbf{b} = 0
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere $\mathbf{A}(\mathbf{x})$ usually contains the lower-derivative parts of the
7f296bb3SBarry Smithequation. For example, the nonlinear diffusion problem
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith- \nabla\cdot(\kappa(u) \nabla u) = 0
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithwould be linearized as
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry SmithA(u) v \simeq -\nabla\cdot(\kappa(u) \nabla v).
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry SmithUsually this linearization is simpler to implement than Newton and the
7f296bb3SBarry Smithlinear problems are somewhat easier to solve. In addition to using
7f296bb3SBarry Smith`-snes_mf_operator` with this approximation to the Jacobian, the
7f296bb3SBarry SmithPicard iterative procedure can be performed by defining $\mathbf{J}(\mathbf{x})$
7f296bb3SBarry Smithto be $\mathbf{A}(\mathbf{x})$. Sometimes this iteration exhibits better global
7f296bb3SBarry Smithconvergence than Newton linearization.
7f296bb3SBarry Smith
7f296bb3SBarry SmithDuring successive calls to `FormJacobian()`, the user can either
7f296bb3SBarry Smithinsert new matrix contexts or reuse old ones, depending on the
7f296bb3SBarry Smithapplication requirements. For many sparse matrix formats, reusing the
7f296bb3SBarry Smithold space (and merely changing the matrix elements) is more efficient;
7f296bb3SBarry Smithhowever, if the matrix nonzero structure completely changes, creating an
7f296bb3SBarry Smithentirely new matrix context may be preferable. Upon subsequent calls to
7f296bb3SBarry Smiththe `FormJacobian()` routine, the user may wish to reinitialize the
7f296bb3SBarry Smithmatrix entries to zero by calling `MatZeroEntries()`. See
7f296bb3SBarry Smith{any}`sec_othermat` for details on the reuse of the matrix
7f296bb3SBarry Smithcontext.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe directory `$PETSC_DIR/src/snes/tutorials` provides a variety of
7f296bb3SBarry Smithexamples.
7f296bb3SBarry Smith
7f296bb3SBarry SmithSometimes a nonlinear solver may produce a step that is not within the domain
7f296bb3SBarry Smithof a given function, for example one with a negative pressure. When this occurs
7f296bb3SBarry Smithone can call `SNESSetFunctionDomainError()` or `SNESSetJacobianDomainError()`
7f296bb3SBarry Smithto indicate to `SNES` the step is not valid. One must also use `SNESGetConvergedReason()`
7f296bb3SBarry Smithand check the reason to confirm if the solver succeeded. See {any}`sec_vi` for how to
7f296bb3SBarry Smithprovide `SNES` with bounds on the variables to solve (differential) variational inequalities
7f296bb3SBarry Smithand how to control properties of the line step computed.
7f296bb3SBarry Smith
76c63389SBarry Smith## Function Domain Errors and infinity or NaN
76c63389SBarry Smith
76c63389SBarry SmithOccasionally nonlinear solvers will propose solutions $u$, where the function value (or the objective function set with `SNESSetObjective()`) contains infinity or NaN.
76c63389SBarry SmithThis can be due to bugs in the application code or because the proposed solution is not in the domain of the function. The application function can call `SNESSetFunctionDomainError()` or
76c63389SBarry Smith`SNESSetObjectiveDomainError()` to indicate $u$ is not in the function's domain.
76c63389SBarry Smith
76c63389SBarry SmithSome `SNESSolve()` implementations (and related `SNESLineSearchApply()` routines) attempt to recover from the infinity or NaN; generally by shrinking the step size.
76c63389SBarry SmithIf they are unable to recover the `SNESConvergedReason` returned will be `SNES_DIVERGED_FUNCTION_DOMAIN`, `SNES_DIVERGED_OJECTIVE_DOMAIN`, `SNES_DIVERGED_FUNCTION_NANORINF`, or `SNES_DIVERGED_OJECTIVE_NANORINF`.
76c63389SBarry Smith
76c63389SBarry Smith
7f296bb3SBarry Smith(sec_nlsolvers)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith## The Nonlinear Solvers
7f296bb3SBarry Smith
7f296bb3SBarry SmithAs summarized in Table {any}`tab-snesdefaults`, `SNES` includes
7f296bb3SBarry Smithseveral Newton-like nonlinear solvers based on line search techniques
7f296bb3SBarry Smithand trust region methods. Also provided are several nonlinear Krylov
7f296bb3SBarry Smithmethods, as well as nonlinear methods involving decompositions of the
7f296bb3SBarry Smithproblem.
7f296bb3SBarry Smith
7f296bb3SBarry SmithEach solver may have associated with it a set of options, which can be
7f296bb3SBarry Smithset with routines and options database commands provided for this
7f296bb3SBarry Smithpurpose. A complete list can be found by consulting the manual pages or
7f296bb3SBarry Smithby running a program with the `-help` option; we discuss just a few in
7f296bb3SBarry Smiththe sections below.
7f296bb3SBarry Smith
7f296bb3SBarry Smith```{eval-rst}
7f296bb3SBarry Smith.. list-table:: PETSc Nonlinear Solvers
7f296bb3SBarry Smith   :name: tab-snesdefaults
7f296bb3SBarry Smith   :header-rows: 1
7f296bb3SBarry Smith
7f296bb3SBarry Smith   * - Method
7f296bb3SBarry Smith     - SNESType
7f296bb3SBarry Smith     - Options Name
7f296bb3SBarry Smith     - Default Line Search
7f296bb3SBarry Smith   * - Line Search Newton
7f296bb3SBarry Smith     - ``SNESNEWTONLS``
7f296bb3SBarry Smith     - ``newtonls``
7f296bb3SBarry Smith     - ``SNESLINESEARCHBT``
7f296bb3SBarry Smith   * - Trust region Newton
7f296bb3SBarry Smith     - ``SNESNEWTONTR``
7f296bb3SBarry Smith     - ``newtontr``
7f296bb3SBarry Smith     - —
7f296bb3SBarry Smith   * - Newton with Arc Length Continuation
7f296bb3SBarry Smith     - ``SNESNEWTONAL``
7f296bb3SBarry Smith     - ``newtonal``
7f296bb3SBarry Smith     - —
7f296bb3SBarry Smith   * - Nonlinear Richardson
7f296bb3SBarry Smith     - ``SNESNRICHARDSON``
7f296bb3SBarry Smith     - ``nrichardson``
a99ef635SJonas Heinzmann     - ``SNESLINESEARCHSECANT``
7f296bb3SBarry Smith   * - Nonlinear CG
7f296bb3SBarry Smith     - ``SNESNCG``
7f296bb3SBarry Smith     - ``ncg``
7f296bb3SBarry Smith     - ``SNESLINESEARCHCP``
7f296bb3SBarry Smith   * - Nonlinear GMRES
7f296bb3SBarry Smith     - ``SNESNGMRES``
7f296bb3SBarry Smith     - ``ngmres``
a99ef635SJonas Heinzmann     - ``SNESLINESEARCHSECANT``
7f296bb3SBarry Smith   * - Quasi-Newton
7f296bb3SBarry Smith     - ``SNESQN``
7f296bb3SBarry Smith     - ``qn``
7f296bb3SBarry Smith     - see :any:`tab-qndefaults`
7f296bb3SBarry Smith   * - Full Approximation Scheme
7f296bb3SBarry Smith     - ``SNESFAS``
7f296bb3SBarry Smith     - ``fas``
7f296bb3SBarry Smith     - —
7f296bb3SBarry Smith   * - Nonlinear ASM
7f296bb3SBarry Smith     - ``SNESNASM``
7f296bb3SBarry Smith     - ``nasm``
7f296bb3SBarry Smith     - –
7f296bb3SBarry Smith   * - ASPIN
7f296bb3SBarry Smith     - ``SNESASPIN``
7f296bb3SBarry Smith     - ``aspin``
7f296bb3SBarry Smith     - ``SNESLINESEARCHBT``
7f296bb3SBarry Smith   * - Nonlinear Gauss-Seidel
7f296bb3SBarry Smith     - ``SNESNGS``
7f296bb3SBarry Smith     - ``ngs``
7f296bb3SBarry Smith     - –
7f296bb3SBarry Smith   * - Anderson Mixing
7f296bb3SBarry Smith     - ``SNESANDERSON``
7f296bb3SBarry Smith     - ``anderson``
7f296bb3SBarry Smith     - –
7f296bb3SBarry Smith   * -  Newton with constraints (1)
7f296bb3SBarry Smith     - ``SNESVINEWTONRSLS``
7f296bb3SBarry Smith     - ``vinewtonrsls``
7f296bb3SBarry Smith     - ``SNESLINESEARCHBT``
7f296bb3SBarry Smith   * -  Newton with constraints (2)
7f296bb3SBarry Smith     - ``SNESVINEWTONSSLS``
7f296bb3SBarry Smith     - ``vinewtonssls``
7f296bb3SBarry Smith     - ``SNESLINESEARCHBT``
7f296bb3SBarry Smith   * - Multi-stage Smoothers
7f296bb3SBarry Smith     - ``SNESMS``
7f296bb3SBarry Smith     - ``ms``
7f296bb3SBarry Smith     - –
7f296bb3SBarry Smith   * - Composite
7f296bb3SBarry Smith     - ``SNESCOMPOSITE``
7f296bb3SBarry Smith     - ``composite``
7f296bb3SBarry Smith     - –
7f296bb3SBarry Smith   * - Linear solve only
7f296bb3SBarry Smith     - ``SNESKSPONLY``
7f296bb3SBarry Smith     - ``ksponly``
7f296bb3SBarry Smith     - –
7f296bb3SBarry Smith   * - Python Shell
7f296bb3SBarry Smith     - ``SNESPYTHON``
7f296bb3SBarry Smith     - ``python``
7f296bb3SBarry Smith     - –
7f296bb3SBarry Smith   * - Shell (user-defined)
7f296bb3SBarry Smith     - ``SNESSHELL``
7f296bb3SBarry Smith     - ``shell``
7f296bb3SBarry Smith     - –
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Line Search Newton
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe method `SNESNEWTONLS` (`-snes_type newtonls`) provides a
7f296bb3SBarry Smithline search Newton method for solving systems of nonlinear equations. By
7f296bb3SBarry Smithdefault, this technique employs cubic backtracking
7f296bb3SBarry Smith{cite}`dennis:83`. Alternative line search techniques are
7f296bb3SBarry Smithlisted in Table {any}`tab-linesearches`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith```{eval-rst}
7f296bb3SBarry Smith.. table:: PETSc Line Search Methods
7f296bb3SBarry Smith   :name: tab-linesearches
7f296bb3SBarry Smith
7f296bb3SBarry Smith   ==================== =========================== ================
7f296bb3SBarry Smith   **Line Search**      **SNESLineSearchType**      **Options Name**
7f296bb3SBarry Smith   ==================== =========================== ================
7f296bb3SBarry Smith   Backtracking         ``SNESLINESEARCHBT``        ``bt``
7f296bb3SBarry Smith   (damped) step        ``SNESLINESEARCHBASIC``     ``basic``
7f296bb3SBarry Smith   identical to above   ``SNESLINESEARCHNONE``      ``none``
a99ef635SJonas Heinzmann   Secant method        ``SNESLINESEARCHSECANT``    ``secant``
7f296bb3SBarry Smith   Critical point       ``SNESLINESEARCHCP``        ``cp``
a99ef635SJonas Heinzmann   Error-oriented       ``SNESLINESEARCHNLEQERR``   ``nleqerr``
7f296bb3SBarry Smith   Bisection            ``SNESLINESEARCHBISECTION`` ``bisection``
7f296bb3SBarry Smith   Shell                ``SNESLINESEARCHSHELL``     ``shell``
7f296bb3SBarry Smith   ==================== =========================== ================
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithEvery `SNES` has a line search context of type `SNESLineSearch` that
7f296bb3SBarry Smithmay be retrieved using
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESGetLineSearch(SNES snes, SNESLineSearch *ls);.
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThere are several default options for the line searches. The order of
7f296bb3SBarry Smithpolynomial approximation may be set with `-snes_linesearch_order` or
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESLineSearchSetOrder(SNESLineSearch ls, PetscInt order);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithfor instance, 2 for quadratic or 3 for cubic. Sometimes, it may not be
7f296bb3SBarry Smithnecessary to monitor the progress of the nonlinear iteration. In this
7f296bb3SBarry Smithcase, `-snes_linesearch_norms` or
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESLineSearchSetComputeNorms(SNESLineSearch ls, PetscBool norms);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithmay be used to turn off function, step, and solution norm computation at
7f296bb3SBarry Smiththe end of the linesearch.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe default line search for the line search Newton method,
7f296bb3SBarry Smith`SNESLINESEARCHBT` involves several parameters, which are set to
7f296bb3SBarry Smithdefaults that are reasonable for many applications. The user can
7f296bb3SBarry Smithoverride the defaults by using the following options:
7f296bb3SBarry Smith
7f296bb3SBarry Smith- `-snes_linesearch_alpha <alpha>`
7f296bb3SBarry Smith- `-snes_linesearch_maxstep <max>`
7f296bb3SBarry Smith- `-snes_linesearch_minlambda <tol>`
7f296bb3SBarry Smith
a99ef635SJonas HeinzmannBesides the backtracking linesearch, there are `SNESLINESEARCHSECANT`,
a99ef635SJonas Heinzmannwhich uses a polynomial secant minimization of $||F(x)||_2$ or an objective function
a99ef635SJonas Heinzmannif set, and `SNESLINESEARCHCP`, which minimizes $F(x) \cdot Y$ where
7f296bb3SBarry Smith$Y$ is the search direction. These are both potentially iterative
7f296bb3SBarry Smithline searches, which may be used to find a better-fitted steplength in
7f296bb3SBarry Smiththe case where a single secant search is not sufficient. The number of
7f296bb3SBarry Smithiterations may be set with `-snes_linesearch_max_it`. In addition, the
7f296bb3SBarry Smithconvergence criteria of the iterative line searches may be set using
7f296bb3SBarry Smithfunction tolerances `-snes_linesearch_rtol` and
7f296bb3SBarry Smith`-snes_linesearch_atol`, and steplength tolerance
7f296bb3SBarry Smith`snes_linesearch_ltol`.
7f296bb3SBarry Smith
7f296bb3SBarry SmithFor highly non-linear problems, the bisection line search `SNESLINESEARCHBISECTION`
7f296bb3SBarry Smithmay prove useful due to its robustness. Similar to the critical point line search
7f296bb3SBarry Smith`SNESLINESEARCHCP`, it seeks to find the root of $F(x) \cdot Y$.
7f296bb3SBarry SmithWhile the latter does so through a secant method, the bisection line search
7f296bb3SBarry Smithdoes so by iteratively bisecting the step length interval.
7f296bb3SBarry SmithIt works as follows (with $f(\lambda)=F(x-\lambda Y) \cdot Y / ||Y||$ for brevity):
7f296bb3SBarry Smith
7f296bb3SBarry Smith1. initialize: $j=1$, $\lambda_0 = \lambda_{\text{left}} = 0.0$, $\lambda_j = \lambda_{\text{right}} = \alpha$, compute $f(\lambda_0)$ and $f(\lambda_j)$
7f296bb3SBarry Smith
7f296bb3SBarry Smith2. check whether there is a change of sign in the interval: $f(\lambda_{\text{left}}) f(\lambda_j) \leq 0$; if not accept the full step length $\lambda_1$
7f296bb3SBarry Smith
7f296bb3SBarry Smith3. if there is a change of sign, enter iterative bisection procedure
7f296bb3SBarry Smith
7f296bb3SBarry Smith   1. check convergence/ exit criteria:
7f296bb3SBarry Smith
7f296bb3SBarry Smith      - absolute tolerance $f(\lambda_j) < \mathtt{atol}$
7f296bb3SBarry Smith      - relative tolerance $f(\lambda_j) < \mathtt{rtol} \cdot f(\lambda_0)$
7f296bb3SBarry Smith      - change of step length $\lambda_j - \lambda_{j-1} < \mathtt{ltol}$
7f296bb3SBarry Smith      - number of iterations $j < \mathtt{max\_it}$
7f296bb3SBarry Smith
7f296bb3SBarry Smith   2. if $j > 1$, determine direction of bisection
7f296bb3SBarry Smith
7f296bb3SBarry Smith   $$
7f296bb3SBarry Smith   \begin{aligned}\lambda_{\text{left}} &= \begin{cases}\lambda_{\text{left}} &f(\lambda_{\text{left}}) f(\lambda_j) \leq 0\\\lambda_{j} &\text{else}\\ \end{cases}\\ \lambda_{\text{right}} &= \begin{cases} \lambda_j &f(\lambda_{\text{left}}) f(\lambda_j) \leq 0\\\lambda_{\text{right}} &\text{else}\\ \end{cases}\\\end{aligned}
7f296bb3SBarry Smith   $$
7f296bb3SBarry Smith
7f296bb3SBarry Smith   3. bisect the interval: $\lambda_{j+1} = (\lambda_{\text{left}} + \lambda_{\text{right}})/2$, compute $f(\lambda_{j+1})$
7f296bb3SBarry Smith   4. update variables for the next iteration: $\lambda_j \gets \lambda_{j+1}$, $f(\lambda_j) \gets f(\lambda_{j+1})$, $j \gets j+1$
7f296bb3SBarry Smith
7f296bb3SBarry SmithCustom line search types may either be defined using
7f296bb3SBarry Smith`SNESLineSearchShell`, or by creating a custom user line search type
7f296bb3SBarry Smithin the model of the preexisting ones and register it using
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESLineSearchRegister(const char sname[], PetscErrorCode (*function)(SNESLineSearch));.
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Trust Region Methods
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe trust region method in `SNES` for solving systems of nonlinear
7f296bb3SBarry Smithequations, `SNESNEWTONTR` (`-snes_type newtontr`), is similar to the one developed in the
7f296bb3SBarry SmithMINPACK project {cite}`more84`. Several parameters can be
7f296bb3SBarry Smithset to control the variation of the trust region size during the
7f296bb3SBarry Smithsolution process. In particular, the user can control the initial trust
7f296bb3SBarry Smithregion radius, computed by
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\Delta = \Delta_0 \| F_0 \|_2,
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithby setting $\Delta_0$ via the option `-snes_tr_delta0 <delta0>`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Newton with Arc Length Continuation
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe Newton method with arc length continuation reformulates the linearized system
7f296bb3SBarry Smith$K\delta \mathbf x = -\mathbf F(\mathbf x)$ by introducing the load parameter
7f296bb3SBarry Smith$\lambda$ and splitting the residual into two components, commonly
7f296bb3SBarry Smithcorresponding to internal and external forces:
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\mathbf F(x, \lambda) = \mathbf F^{\mathrm{int}}(\mathbf x) - \mathbf F^{\mathrm{ext}}(\mathbf x, \lambda)
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry SmithOften, $\mathbf F^{\mathrm{ext}}(\mathbf x, \lambda)$ is linear in $\lambda$,
7f296bb3SBarry Smithwhich can be thought of as applying the external force in proportional load
7f296bb3SBarry Smithincrements. By default, this is how the right-hand side vector is handled in the
7f296bb3SBarry Smithimplemented method. Generally, however, $\mathbf F^{\mathrm{ext}}(\mathbf x, \lambda)$
7f296bb3SBarry Smithmay depend non-linearly on $\lambda$ or $\mathbf x$, or both.
7f296bb3SBarry SmithTo accommodate this possibility, we provide the `SNESNewtonALGetLoadParameter()`
7f296bb3SBarry Smithfunction, which allows for the current value of $\lambda$ to be queried in the
7f296bb3SBarry Smithfunctions provided to `SNESSetFunction()` and `SNESSetJacobian()`.
7f296bb3SBarry Smith
7f296bb3SBarry SmithAdditionally, we split the solution update into two components:
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\delta \mathbf x = \delta s\delta\mathbf x^F + \delta\lambda\delta\mathbf x^Q,
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere $\delta s = 1$ unless partial corrections are used (discussed more
7f296bb3SBarry Smithbelow). Each of $\delta \mathbf x^F$ and $\delta \mathbf x^Q$ are found via
7f296bb3SBarry Smithsolving a linear system with the Jacobian $K$:
7f296bb3SBarry Smith
7f296bb3SBarry Smith- $\delta \mathbf x^F$ is the full Newton step for a given value of $\lambda$: $K \delta \mathbf x^F = -\mathbf F(\mathbf x, \lambda)$
7f296bb3SBarry Smith- $\delta \mathbf x^Q$ is the variation in $\mathbf x$ with respect to $\lambda$, computed by $K \delta\mathbf x^Q = \mathbf Q(\mathbf x, \lambda)$, where $\mathbf Q(\mathbf x, \lambda) = -\partial \mathbf F (\mathbf x, \lambda) / \partial \lambda$ is the tangent load vector.
7f296bb3SBarry Smith
7f296bb3SBarry SmithOften, the tangent load vector $\mathbf Q$ is constant within a load increment,
7f296bb3SBarry Smithwhich corresponds to the case of proportional loading discussed above. By default,
7f296bb3SBarry Smith$\mathbf Q$ is the full right-hand-side vector, if one was provided.
7f296bb3SBarry SmithThe user can also provide a function which computes $\mathbf Q$ to
7f296bb3SBarry Smith`SNESNewtonALSetFunction()`. This function should have the same signature as for
7f296bb3SBarry Smith`SNESSetFunction`, and the user should use `SNESNewtonALGetLoadParameter()` to get
7f296bb3SBarry Smith$\lambda$ if it is needed.
7f296bb3SBarry Smith
7f296bb3SBarry Smith**The Constraint Surface.** Considering the $n+1$ dimensional space of
7f296bb3SBarry Smith$\mathbf x$ and $\lambda$, we define the linearized equilibrium line to be
7f296bb3SBarry Smiththe set of points for which the linearized equilibrium equations are satisfied.
7f296bb3SBarry SmithGiven the previous iterative solution
7f296bb3SBarry Smith$\mathbf t^{(j-1)} = [\mathbf x^{(j-1)}, \lambda^{(j-1)}]$,
7f296bb3SBarry Smiththis line is defined by the point $\mathbf t^{(j-1)} + [\delta\mathbf x^F, 0]$ and
7f296bb3SBarry Smiththe vector $\mathbf t^Q [\delta\mathbf x^Q, 1]$.
7f296bb3SBarry SmithThe arc length method seeks the intersection of this linearized equilibrium line
7f296bb3SBarry Smithwith a quadratic constraint surface, defined by
7f296bb3SBarry Smith
7f296bb3SBarry Smith% math::L^2 = \|\Delta x\|^2 + \psi^2 (\Delta\lambda)^2,
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere $L$ is a user-provided step size corresponding to the radius of the
7f296bb3SBarry Smithconstraint surface, $\Delta\mathbf x$ and $\Delta\lambda$ are the
7f296bb3SBarry Smithaccumulated updates over the current load step, and $\psi^2$ is a
7f296bb3SBarry Smithuser-provided consistency parameter determining the shape of the constraint surface.
7f296bb3SBarry SmithGenerally, $\psi^2 > 0$ leads to a hyper-sphere constraint surface, while
7f296bb3SBarry Smith$\psi^2 = 0$ leads to a hyper-cylinder constraint surface.
7f296bb3SBarry Smith
7f296bb3SBarry SmithSince the solution will always fall on the constraint surface, the method will often
7f296bb3SBarry Smithrequire multiple incremental steps to fully solve the non-linear problem.
7f296bb3SBarry SmithThis is necessary to accurately trace the equilibrium path.
7f296bb3SBarry SmithImportantly, this is fundamentally different from time stepping.
7f296bb3SBarry SmithWhile a similar process could be implemented as a `TS`, this method is
7f296bb3SBarry Smithparticularly designed to be used as a SNES, either standalone or within a `TS`.
7f296bb3SBarry Smith
7f296bb3SBarry SmithTo this end, by default, the load parameter is used such that the full external
7f296bb3SBarry Smithforces are applied at $\lambda = 1$, although we allow for the user to specify
7f296bb3SBarry Smitha different value via `-snes_newtonal_lambda_max`.
7f296bb3SBarry SmithTo ensure that the solution corresponds exactly to the external force prescribed by
7f296bb3SBarry Smiththe user, i.e. that the load parameter is exactly $\lambda_{max}$ at the end
7f296bb3SBarry Smithof the SNES solve, we clamp the value before computing the solution update.
7f296bb3SBarry SmithAs such, the final increment will likely be a hybrid of arc length continuation and
7f296bb3SBarry Smithnormal Newton iterations.
7f296bb3SBarry Smith
7f296bb3SBarry Smith**Choosing the Continuation Step.** For the first iteration from an equilibrium
7f296bb3SBarry Smithpoint, there is a single correct way to choose $\delta\lambda$, which follows
7f296bb3SBarry Smithfrom the constraint equations. Specifically the constraint equations yield the
7f296bb3SBarry Smithquadratic equation $a\delta\lambda^2 + b\delta\lambda + c = 0$, where
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\begin{aligned}
7f296bb3SBarry Smitha &= \|\delta\mathbf x^Q\|^2 + \psi^2,\\
7f296bb3SBarry Smithb &= 2\delta\mathbf x^Q\cdot (\Delta\mathbf x + \delta s\delta\mathbf x^F) + 2\psi^2 \Delta\lambda,\\
7f296bb3SBarry Smithc &= \|\Delta\mathbf x + \delta s\delta\mathbf x^F\|^2 + \psi^2 \Delta\lambda^2 - L^2.
7f296bb3SBarry Smith\end{aligned}
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry SmithSince in the first iteration, $\Delta\mathbf x = \delta\mathbf x^F = \mathbf 0$ and
7f296bb3SBarry Smith$\Delta\lambda = 0$, $b = 0$ and the equation simplifies to a pair of
7f296bb3SBarry Smithreal roots:
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\delta\lambda = \pm\frac{L}{\sqrt{\|\delta\mathbf x^Q\|^2 + \psi^2}},
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere the sign is positive for the first increment and is determined by the previous
7f296bb3SBarry Smithincrement otherwise as
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\text{sign}(\delta\lambda) = \text{sign}\big(\delta\mathbf x^Q \cdot (\Delta\mathbf x)_{i-1} + \psi^2(\Delta\lambda)_{i-1}\big),
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere $(\Delta\mathbf x)_{i-1}$ and $(\Delta\lambda)_{i-1}$ are the
7f296bb3SBarry Smithaccumulated updates over the previous load step.
7f296bb3SBarry Smith
7f296bb3SBarry SmithIn subsequent iterations, there are different approaches to selecting
7f296bb3SBarry Smith$\delta\lambda$, all of which have trade-offs.
7f296bb3SBarry SmithThe main difference is whether the iterative solution falls on the constraint
7f296bb3SBarry Smithsurface at every iteration, or only when fully converged.
10999371SStefano ZampiniPETSc implements two approaches, set via
7f296bb3SBarry Smith`SNESNewtonALSetCorrectionType()` or
7f296bb3SBarry Smith`-snes_newtonal_correction_type <normal|exact>` on the command line.
7f296bb3SBarry Smith
7f296bb3SBarry Smith**Corrections in the Normal Hyperplane.** The `SNES_NEWTONAL_CORRECTION_NORMAL`
7f296bb3SBarry Smithoption is simpler and computationally less expensive, but may fail to converge, as
7f296bb3SBarry Smiththe constraint equation is not satisfied at every iteration.
7f296bb3SBarry SmithThe update $\delta \lambda$ is chosen such that the update is within the
7f296bb3SBarry Smithnormal hyper-surface to the quadratic constraint surface.
7f296bb3SBarry SmithMathematically, that is
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\delta \lambda = -\frac{\Delta \mathbf x \cdot \delta \mathbf x^F}{\Delta\mathbf x \cdot \delta\mathbf x^Q + \psi^2 \Delta\lambda}.
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry SmithThis implementation is based on {cite}`LeonPaulinoPereiraMenezesLages_2011`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith**Exact Corrections.** The `SNES_NEWTONAL_CORRECTION_EXACT` option is far more
7f296bb3SBarry Smithcomplex, but ensures that the constraint is exactly satisfied at every Newton
7f296bb3SBarry Smithiteration. As such, it is generally more robust.
7f296bb3SBarry SmithBy evaluating the intersection of constraint surface and equilibrium line at each
7f296bb3SBarry Smithiteration, $\delta\lambda$ is chosen as one of the roots of the above
7f296bb3SBarry Smithquadratic equation $a\delta\lambda^2 + b\delta\lambda + c = 0$.
7f296bb3SBarry SmithThis method encounters issues, however, if the linearized equilibrium line and
7f296bb3SBarry Smithconstraint surface do not intersect due to particularly large linearized error.
7f296bb3SBarry SmithIn this case, the roots are complex.
7f296bb3SBarry SmithTo continue progressing toward a solution, this method uses a partial correction by
7f296bb3SBarry Smithchoosing $\delta s$ such that the quadratic equation has a single real root.
7f296bb3SBarry SmithGeometrically, this is selecting the point on the constraint surface closest to the
7f296bb3SBarry Smithlinearized equilibrium line. See the code or {cite}`Ritto-CorreaCamotim2008` for a
7f296bb3SBarry Smithmathematical description of these partial corrections.
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Nonlinear Krylov Methods
7f296bb3SBarry Smith
7f296bb3SBarry SmithA number of nonlinear Krylov methods are provided, including Nonlinear
7f296bb3SBarry SmithRichardson (`SNESNRICHARDSON`), nonlinear conjugate gradient (`SNESNCG`), nonlinear GMRES (`SNESNGMRES`), and Anderson Mixing (`SNESANDERSON`). These
7f296bb3SBarry Smithmethods are described individually below. They are all instrumental to
7f296bb3SBarry SmithPETSc’s nonlinear preconditioning.
7f296bb3SBarry Smith
7f296bb3SBarry Smith**Nonlinear Richardson.** The nonlinear Richardson iteration, `SNESNRICHARDSON`, merely
7f296bb3SBarry Smithtakes the form of a line search-damped fixed-point iteration of the form
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\mathbf{x}_{k+1} = \mathbf{x}_k - \lambda \mathbf{F}(\mathbf{x}_k), \;\; k=0,1, \ldots,
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
a99ef635SJonas Heinzmannwhere the default linesearch is `SNESLINESEARCHSECANT`. This simple solver
7f296bb3SBarry Smithis mostly useful as a nonlinear smoother, or to provide line search
7f296bb3SBarry Smithstabilization to an inner method.
7f296bb3SBarry Smith
7f296bb3SBarry Smith**Nonlinear Conjugate Gradients.** Nonlinear CG, `SNESNCG`, is equivalent to linear
7f296bb3SBarry SmithCG, but with the steplength determined by line search
7f296bb3SBarry Smith(`SNESLINESEARCHCP` by default). Five variants (Fletcher-Reed,
7f296bb3SBarry SmithHestenes-Steifel, Polak-Ribiere-Polyak, Dai-Yuan, and Conjugate Descent)
7f296bb3SBarry Smithare implemented in PETSc and may be chosen using
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESNCGSetType(SNES snes, SNESNCGType btype);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smith**Anderson Mixing and Nonlinear GMRES Methods.** Nonlinear GMRES (`SNESNGMRES`), and
7f296bb3SBarry SmithAnderson Mixing (`SNESANDERSON`) methods combine the last $m$ iterates, plus a new
7f296bb3SBarry Smithfixed-point iteration iterate, into an approximate residual-minimizing new iterate.
7f296bb3SBarry Smith
7f296bb3SBarry SmithAll of the above methods have support for using a nonlinear preconditioner to compute the preliminary update step, rather than the default
7f296bb3SBarry Smithwhich is the nonlinear function's residual, \$ mathbf\{F}(mathbf\{x}\_k)\$. The different update is obtained by solving a nonlinear preconditioner nonlinear problem, which has its own
7f296bb3SBarry Smith`SNES` object that may be obtained with `SNESGetNPC()`.
7f296bb3SBarry SmithQuasi-Newton Methods
7f296bb3SBarry Smith^^^^^^^^^^^^^^^^^^^^
7f296bb3SBarry Smith
7f296bb3SBarry SmithQuasi-Newton methods store iterative rank-one updates to the Jacobian
7f296bb3SBarry Smithinstead of computing the Jacobian directly. Three limited-memory quasi-Newton
7f296bb3SBarry Smithmethods are provided, L-BFGS, which are described in
7f296bb3SBarry SmithTable {any}`tab-qndefaults`. These all are encapsulated under
7f296bb3SBarry Smith`-snes_type qn` and may be changed with `snes_qn_type`. The default
7f296bb3SBarry Smithis L-BFGS, which provides symmetric updates to an approximate Jacobian.
7f296bb3SBarry SmithThis iteration is similar to the line search Newton methods.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe quasi-Newton methods support the use of a nonlinear preconditioner that can be obtained with `SNESGetNPC()` and then configured; or that can be configured with
7f296bb3SBarry Smith`SNES`, `KSP`, and `PC` options using the options database prefix `-npc_`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith```{eval-rst}
7f296bb3SBarry Smith.. list-table:: PETSc quasi-Newton solvers
7f296bb3SBarry Smith   :name: tab-qndefaults
7f296bb3SBarry Smith   :header-rows: 1
7f296bb3SBarry Smith
7f296bb3SBarry Smith   * - QN Method
7f296bb3SBarry Smith     - ``SNESQNType``
7f296bb3SBarry Smith     - Options Name
7f296bb3SBarry Smith     - Default Line Search
7f296bb3SBarry Smith   * - L-BFGS
7f296bb3SBarry Smith     - ``SNES_QN_LBFGS``
7f296bb3SBarry Smith     - ``lbfgs``
7f296bb3SBarry Smith     - ``SNESLINESEARCHCP``
7f296bb3SBarry Smith   * - “Good” Broyden
7f296bb3SBarry Smith     - ``SNES_QN_BROYDEN``
7f296bb3SBarry Smith     - ``broyden``
7f296bb3SBarry Smith     - ``SNESLINESEARCHBASIC`` (or equivalently ``SNESLINESEARCHNONE``
7f296bb3SBarry Smith   * - “Bad” Broyden
7f296bb3SBarry Smith     - ``SNES_QN_BADBROYDEN``
7f296bb3SBarry Smith     - ``badbroyden``
a99ef635SJonas Heinzmann     - ``SNESLINESEARCHSECANT``
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithOne may also control the form of the initial Jacobian approximation with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESQNSetScaleType(SNES snes, SNESQNScaleType stype);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithand the restart type with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESQNSetRestartType(SNES snes, SNESQNRestartType rtype);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smith### The Full Approximation Scheme
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe Nonlinear Full Approximation Scheme (FAS) `SNESFAS`, is a nonlinear multigrid method. At
7f296bb3SBarry Smitheach level, there is a recursive cycle control `SNES` instance, and
7f296bb3SBarry Smitheither one or two nonlinear solvers that act as smoothers (up and down). Problems
7f296bb3SBarry Smithset up using the `SNES` `DMDA` interface are automatically
7f296bb3SBarry Smithcoarsened. FAS, `SNESFAS`, differs slightly from linear multigrid `PCMG`, in that the hierarchy is
7f296bb3SBarry Smithconstructed recursively. However, much of the interface is a one-to-one
7f296bb3SBarry Smithmap. We describe the “get” operations here, and it can be assumed that
7f296bb3SBarry Smitheach has a corresponding “set” operation. For instance, the number of
7f296bb3SBarry Smithlevels in the hierarchy may be retrieved using
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESFASGetLevels(SNES snes, PetscInt *levels);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThere are four `SNESFAS` cycle types, `SNES_FAS_MULTIPLICATIVE`,
7f296bb3SBarry Smith`SNES_FAS_ADDITIVE`, `SNES_FAS_FULL`, and `SNES_FAS_KASKADE`. The
7f296bb3SBarry Smithtype may be set with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESFASSetType(SNES snes, SNESFASType fastype);.
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithand the cycle type, 1 for V, 2 for W, may be set with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESFASSetCycles(SNES snes, PetscInt cycles);.
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithMuch like the interface to `PCMG` described in {any}`sec_mg`, there are interfaces to recover the
7f296bb3SBarry Smithvarious levels’ cycles and smoothers. The level smoothers may be
7f296bb3SBarry Smithaccessed with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESFASGetSmoother(SNES snes, PetscInt level, SNES *smooth);
7f296bb3SBarry SmithSNESFASGetSmootherUp(SNES snes, PetscInt level, SNES *smooth);
7f296bb3SBarry SmithSNESFASGetSmootherDown(SNES snes, PetscInt level, SNES *smooth);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithand the level cycles with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESFASGetCycleSNES(SNES snes, PetscInt level, SNES *lsnes);.
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithAlso akin to `PCMG`, the restriction and prolongation at a level may
7f296bb3SBarry Smithbe acquired with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESFASGetInterpolation(SNES snes, PetscInt level, Mat *mat);
7f296bb3SBarry SmithSNESFASGetRestriction(SNES snes, PetscInt level, Mat *mat);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithIn addition, FAS requires special restriction for solution-like
7f296bb3SBarry Smithvariables, called injection. This may be set with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESFASGetInjection(SNES snes, PetscInt level, Mat *mat);.
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe coarse solve context may be acquired with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESFASGetCoarseSolve(SNES snes, SNES *smooth);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Nonlinear Additive Schwarz
7f296bb3SBarry Smith
7f296bb3SBarry SmithNonlinear Additive Schwarz methods (NASM) take a number of local
7f296bb3SBarry Smithnonlinear subproblems, solves them independently in parallel, and
7f296bb3SBarry Smithcombines those solutions into a new approximate solution.
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESNASMSetSubdomains(SNES snes, PetscInt n, SNES subsnes[], VecScatter iscatter[], VecScatter oscatter[], VecScatter gscatter[]);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithallows for the user to create these local subdomains. Problems set up
7f296bb3SBarry Smithusing the `SNES` `DMDA` interface are automatically decomposed. To
7f296bb3SBarry Smithbegin, the type of subdomain updates to the whole solution are limited
7f296bb3SBarry Smithto two types borrowed from `PCASM`: `PC_ASM_BASIC`, in which the
7f296bb3SBarry Smithoverlapping updates added. `PC_ASM_RESTRICT` updates in a
7f296bb3SBarry Smithnonoverlapping fashion. This may be set with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESNASMSetType(SNES snes, PCASMType type);.
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smith`SNESASPIN` is a helper `SNES` type that sets up a nonlinearly
7f296bb3SBarry Smithpreconditioned Newton’s method using NASM as the preconditioner.
7f296bb3SBarry Smith
7f296bb3SBarry Smith## General Options
7f296bb3SBarry Smith
7f296bb3SBarry SmithThis section discusses options and routines that apply to all `SNES`
7f296bb3SBarry Smithsolvers and problem classes. In particular, we focus on convergence
7f296bb3SBarry Smithtests, monitoring routines, and tools for checking derivative
7f296bb3SBarry Smithcomputations.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_snesconvergence)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Convergence Tests
7f296bb3SBarry Smith
7f296bb3SBarry SmithConvergence of the nonlinear solvers can be detected in a variety of
7f296bb3SBarry Smithways; the user can even specify a customized test, as discussed below.
7f296bb3SBarry SmithMost of the nonlinear solvers use `SNESConvergenceTestDefault()`,
7f296bb3SBarry Smithhowever, `SNESNEWTONTR` uses a method-specific additional convergence
7f296bb3SBarry Smithtest as well. The convergence tests involves several parameters, which
7f296bb3SBarry Smithare set by default to values that should be reasonable for a wide range
7f296bb3SBarry Smithof problems. The user can customize the parameters to the problem at
7f296bb3SBarry Smithhand by using some of the following routines and options.
7f296bb3SBarry Smith
7f296bb3SBarry SmithOne method of convergence testing is to declare convergence when the
7f296bb3SBarry Smithnorm of the change in the solution between successive iterations is less
7f296bb3SBarry Smiththan some tolerance, `stol`. Convergence can also be determined based
7f296bb3SBarry Smithon the norm of the function. Such a test can use either the absolute
7f296bb3SBarry Smithsize of the norm, `atol`, or its relative decrease, `rtol`, from an
7f296bb3SBarry Smithinitial guess. The following routine sets these parameters, which are
7f296bb3SBarry Smithused in many of the default `SNES` convergence tests:
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESSetTolerances(SNES snes, PetscReal atol, PetscReal rtol, PetscReal stol, PetscInt its, PetscInt fcts);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThis routine also sets the maximum numbers of allowable nonlinear
7f296bb3SBarry Smithiterations, `its`, and function evaluations, `fcts`. The
7f296bb3SBarry Smithcorresponding options database commands for setting these parameters are:
7f296bb3SBarry Smith
7f296bb3SBarry Smith- `-snes_atol <atol>`
7f296bb3SBarry Smith- `-snes_rtol <rtol>`
7f296bb3SBarry Smith- `-snes_stol <stol>`
7f296bb3SBarry Smith- `-snes_max_it <its>`
7f296bb3SBarry Smith- `-snes_max_funcs <fcts>` (use `unlimited` for no maximum)
7f296bb3SBarry Smith
7f296bb3SBarry SmithA related routine is `SNESGetTolerances()`. `PETSC_CURRENT` may be used
7f296bb3SBarry Smithfor any parameter to indicate the current value should be retained; use `PETSC_DETERMINE` to restore to the default value from when the object was created.
7f296bb3SBarry Smith
7f296bb3SBarry SmithUsers can set their own customized convergence tests in `SNES` by
7f296bb3SBarry Smithusing the command
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
*2a8381b2SBarry SmithSNESSetConvergenceTest(SNES snes, PetscErrorCode (*test)(SNES snes, PetscInt it, PetscReal xnorm, PetscReal gnorm, PetscReal f, SNESConvergedReason reason, PetscCtx cctx), PetscCtx cctx, PetscCtxDestroyFn *destroy);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe final argument of the convergence test routine, `cctx`, denotes an
7f296bb3SBarry Smithoptional user-defined context for private data. When solving systems of
7f296bb3SBarry Smithnonlinear equations, the arguments `xnorm`, `gnorm`, and `f` are
7f296bb3SBarry Smiththe current iterate norm, current step norm, and function norm,
7f296bb3SBarry Smithrespectively. `SNESConvergedReason` should be set positive for
7f296bb3SBarry Smithconvergence and negative for divergence. See `include/petscsnes.h` for
7f296bb3SBarry Smitha list of values for `SNESConvergedReason`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_snesmonitor)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Convergence Monitoring
7f296bb3SBarry Smith
7f296bb3SBarry SmithBy default the `SNES` solvers run silently without displaying
7f296bb3SBarry Smithinformation about the iterations. The user can initiate monitoring with
7f296bb3SBarry Smiththe command
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
*2a8381b2SBarry SmithSNESMonitorSet(SNES snes, PetscErrorCode (*mon)(SNES snes, PetscInt its, PetscReal norm, PetscCtx mctx), PetscCtx mctx, (PetscCtxDestroyFn *)*monitordestroy);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe routine, `mon`, indicates a user-defined monitoring routine, where
7f296bb3SBarry Smith`its` and `mctx` respectively denote the iteration number and an
7f296bb3SBarry Smithoptional user-defined context for private data for the monitor routine.
7f296bb3SBarry SmithThe argument `norm` is the function norm.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe routine set by `SNESMonitorSet()` is called once after every
7f296bb3SBarry Smithsuccessful step computation within the nonlinear solver. Hence, the user
7f296bb3SBarry Smithcan employ this routine for any application-specific computations that
7f296bb3SBarry Smithshould be done after the solution update. The option `-snes_monitor`
7f296bb3SBarry Smithactivates the default `SNES` monitor routine,
7f296bb3SBarry Smith`SNESMonitorDefault()`, while `-snes_monitor_lg_residualnorm` draws
7f296bb3SBarry Smitha simple line graph of the residual norm’s convergence.
7f296bb3SBarry Smith
7f296bb3SBarry SmithOne can cancel hardwired monitoring routines for `SNES` at runtime
7f296bb3SBarry Smithwith `-snes_monitor_cancel`.
7f296bb3SBarry Smith
7f296bb3SBarry SmithAs the Newton method converges so that the residual norm is small, say
7f296bb3SBarry Smith$10^{-10}$, many of the final digits printed with the
7f296bb3SBarry Smith`-snes_monitor` option are meaningless. Worse, they are different on
7f296bb3SBarry Smithdifferent machines; due to different round-off rules used by, say, the
7f296bb3SBarry SmithIBM RS6000 and the Sun SPARC. This makes testing between different
7f296bb3SBarry Smithmachines difficult. The option `-snes_monitor_short` causes PETSc to
7f296bb3SBarry Smithprint fewer of the digits of the residual norm as it gets smaller; thus
7f296bb3SBarry Smithon most of the machines it will always print the same numbers making
7f296bb3SBarry Smithcross-process testing easier.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe routines
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESGetSolution(SNES snes, Vec *x);
*2a8381b2SBarry SmithSNESGetFunction(SNES snes, Vec *r, PetscCtxRt ctx, int(**func)(SNES, Vec, Vec, PetscCtx));
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithreturn the solution vector and function vector from a `SNES` context.
7f296bb3SBarry SmithThese routines are useful, for instance, if the convergence test
7f296bb3SBarry Smithrequires some property of the solution or function other than those
7f296bb3SBarry Smithpassed with routine arguments.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_snesderivs)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith### Checking Accuracy of Derivatives
7f296bb3SBarry Smith
7f296bb3SBarry SmithSince hand-coding routines for Jacobian matrix evaluation can be error
7f296bb3SBarry Smithprone, `SNES` provides easy-to-use support for checking these matrices
7f296bb3SBarry Smithagainst finite difference versions. In the simplest form of comparison,
7f296bb3SBarry Smithusers can employ the option `-snes_test_jacobian` to compare the
7f296bb3SBarry Smithmatrices at several points. Although not exhaustive, this test will
7f296bb3SBarry Smithgenerally catch obvious problems. One can compare the elements of the
7f296bb3SBarry Smithtwo matrices by using the option `-snes_test_jacobian_view` , which
7f296bb3SBarry Smithcauses the two matrices to be printed to the screen.
7f296bb3SBarry Smith
7f296bb3SBarry SmithAnother means for verifying the correctness of a code for Jacobian
7f296bb3SBarry Smithcomputation is running the problem with either the finite difference or
7f296bb3SBarry Smithmatrix-free variant, `-snes_fd` or `-snes_mf`; see {any}`sec_fdmatrix` or {any}`sec_nlmatrixfree`.
7f296bb3SBarry SmithIf a
7f296bb3SBarry Smithproblem converges well with these matrix approximations but not with a
7f296bb3SBarry Smithuser-provided routine, the problem probably lies with the hand-coded
7f296bb3SBarry Smithmatrix. See the note in {any}`sec_snesjacobian` about
7f296bb3SBarry Smithassembling your Jabobian in the "preconditioner" slot `Pmat`.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe correctness of user provided `MATSHELL` Jacobians in general can be
7f296bb3SBarry Smithchecked with `MatShellTestMultTranspose()` and `MatShellTestMult()`.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe correctness of user provided `MATSHELL` Jacobians via `TSSetRHSJacobian()`
7f296bb3SBarry Smithcan be checked with `TSRHSJacobianTestTranspose()` and `TSRHSJacobianTest()`
7f296bb3SBarry Smiththat check the correction of the matrix-transpose vector product and the
7f296bb3SBarry Smithmatrix-product. From the command line, these can be checked with
7f296bb3SBarry Smith
7f296bb3SBarry Smith- `-ts_rhs_jacobian_test_mult_transpose`
7f296bb3SBarry Smith- `-mat_shell_test_mult_transpose_view`
7f296bb3SBarry Smith- `-ts_rhs_jacobian_test_mult`
7f296bb3SBarry Smith- `-mat_shell_test_mult_view`
7f296bb3SBarry Smith
7f296bb3SBarry Smith## Inexact Newton-like Methods
7f296bb3SBarry Smith
7f296bb3SBarry SmithSince exact solution of the linear Newton systems within {math:numref}`newton`
7f296bb3SBarry Smithat each iteration can be costly, modifications
7f296bb3SBarry Smithare often introduced that significantly reduce these expenses and yet
7f296bb3SBarry Smithretain the rapid convergence of Newton’s method. Inexact or truncated
7f296bb3SBarry SmithNewton techniques approximately solve the linear systems using an
7f296bb3SBarry Smithiterative scheme. In comparison with using direct methods for solving
7f296bb3SBarry Smiththe Newton systems, iterative methods have the virtue of requiring
7f296bb3SBarry Smithlittle space for matrix storage and potentially saving significant
7f296bb3SBarry Smithcomputational work. Within the class of inexact Newton methods, of
7f296bb3SBarry Smithparticular interest are Newton-Krylov methods, where the subsidiary
7f296bb3SBarry Smithiterative technique for solving the Newton system is chosen from the
7f296bb3SBarry Smithclass of Krylov subspace projection methods. Note that at runtime the
7f296bb3SBarry Smithuser can set any of the linear solver options discussed in {any}`ch_ksp`,
7f296bb3SBarry Smithsuch as `-ksp_type <ksp_method>` and
7f296bb3SBarry Smith`-pc_type <pc_method>`, to set the Krylov subspace and preconditioner
7f296bb3SBarry Smithmethods.
7f296bb3SBarry Smith
7f296bb3SBarry SmithTwo levels of iterations occur for the inexact techniques, where during
7f296bb3SBarry Smitheach global or outer Newton iteration a sequence of subsidiary inner
7f296bb3SBarry Smithiterations of a linear solver is performed. Appropriate control of the
7f296bb3SBarry Smithaccuracy to which the subsidiary iterative method solves the Newton
7f296bb3SBarry Smithsystem at each global iteration is critical, since these inner
7f296bb3SBarry Smithiterations determine the asymptotic convergence rate for inexact Newton
7f296bb3SBarry Smithtechniques. While the Newton systems must be solved well enough to
7f296bb3SBarry Smithretain fast local convergence of the Newton’s iterates, use of excessive
7f296bb3SBarry Smithinner iterations, particularly when $\| \mathbf{x}_k - \mathbf{x}_* \|$ is large,
7f296bb3SBarry Smithis neither necessary nor economical. Thus, the number of required inner
7f296bb3SBarry Smithiterations typically increases as the Newton process progresses, so that
7f296bb3SBarry Smiththe truncated iterates approach the true Newton iterates.
7f296bb3SBarry Smith
7f296bb3SBarry SmithA sequence of nonnegative numbers $\{\eta_k\}$ can be used to
7f296bb3SBarry Smithindicate the variable convergence criterion. In this case, when solving
7f296bb3SBarry Smitha system of nonlinear equations, the update step of the Newton process
7f296bb3SBarry Smithremains unchanged, and direct solution of the linear system is replaced
7f296bb3SBarry Smithby iteration on the system until the residuals
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\mathbf{r}_k^{(i)} =  \mathbf{F}'(\mathbf{x}_k) \Delta \mathbf{x}_k + \mathbf{F}(\mathbf{x}_k)
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithsatisfy
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\frac{ \| \mathbf{r}_k^{(i)} \| }{ \| \mathbf{F}(\mathbf{x}_k) \| } \leq \eta_k \leq \eta < 1.
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry SmithHere $\mathbf{x}_0$ is an initial approximation of the solution, and
7f296bb3SBarry Smith$\| \cdot \|$ denotes an arbitrary norm in $\Re^n$ .
7f296bb3SBarry Smith
7f296bb3SBarry SmithBy default a constant relative convergence tolerance is used for solving
7f296bb3SBarry Smiththe subsidiary linear systems within the Newton-like methods of
7f296bb3SBarry Smith`SNES`. When solving a system of nonlinear equations, one can instead
7f296bb3SBarry Smithemploy the techniques of Eisenstat and Walker {cite}`ew96`
7f296bb3SBarry Smithto compute $\eta_k$ at each step of the nonlinear solver by using
7f296bb3SBarry Smiththe option `-snes_ksp_ew` . In addition, by adding one’s own
7f296bb3SBarry Smith`KSP` convergence test (see {any}`sec_convergencetests`), one can easily create one’s own,
7f296bb3SBarry Smithproblem-dependent, inner convergence tests.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_nlmatrixfree)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith## Matrix-Free Methods
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe `SNES` class fully supports matrix-free methods. The matrices
7f296bb3SBarry Smithspecified in the Jacobian evaluation routine need not be conventional
7f296bb3SBarry Smithmatrices; instead, they can point to the data required to implement a
7f296bb3SBarry Smithparticular matrix-free method. The matrix-free variant is allowed *only*
7f296bb3SBarry Smithwhen the linear systems are solved by an iterative method in combination
7f296bb3SBarry Smithwith no preconditioning (`PCNONE` or `-pc_type` `none`), a
7addb90fSBarry Smithuser-provided matrix from which to construct the preconditioner, or a user-provided preconditioner
7f296bb3SBarry Smithshell (`PCSHELL`, discussed in {any}`sec_pc`); that
7f296bb3SBarry Smithis, obviously matrix-free methods cannot be used with a direct solver,
7f296bb3SBarry Smithapproximate factorization, or other preconditioner which requires access
7f296bb3SBarry Smithto explicit matrix entries.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe user can create a matrix-free context for use within `SNES` with
7f296bb3SBarry Smiththe routine
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithMatCreateSNESMF(SNES snes, Mat *mat);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThis routine creates the data structures needed for the matrix-vector
7f296bb3SBarry Smithproducts that arise within Krylov space iterative
7f296bb3SBarry Smithmethods {cite}`brownsaad:90`.
7f296bb3SBarry SmithThe default `SNES`
7f296bb3SBarry Smithmatrix-free approximations can also be invoked with the command
7f296bb3SBarry Smith`-snes_mf`. Or, one can retain the user-provided Jacobian
7f296bb3SBarry Smithpreconditioner, but replace the user-provided Jacobian matrix with the
7f296bb3SBarry Smithdefault matrix-free variant with the option `-snes_mf_operator`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith`MatCreateSNESMF()` uses
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithMatCreateMFFD(Vec x, Mat *mat);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhich can also be used directly for users who need a matrix-free matrix but are not using `SNES`.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe user can set one parameter to control the Jacobian-vector product
7f296bb3SBarry Smithapproximation with the command
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithMatMFFDSetFunctionError(Mat mat, PetscReal rerror);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe parameter `rerror` should be set to the square root of the
7f296bb3SBarry Smithrelative error in the function evaluations, $e_{rel}$; the default
7f296bb3SBarry Smithis the square root of machine epsilon (about $10^{-8}$ in double
7f296bb3SBarry Smithprecision), which assumes that the functions are evaluated to full
7f296bb3SBarry Smithfloating-point precision accuracy. This parameter can also be set from
7f296bb3SBarry Smiththe options database with `-mat_mffd_err <err>`
7f296bb3SBarry Smith
7f296bb3SBarry SmithIn addition, PETSc provides ways to register new routines to compute
7f296bb3SBarry Smiththe differencing parameter ($h$); see the manual page for
7f296bb3SBarry Smith`MatMFFDSetType()` and `MatMFFDRegister()`. We currently provide two
7f296bb3SBarry Smithdefault routines accessible via `-mat_mffd_type <ds or wp>`. For
7f296bb3SBarry Smiththe default approach there is one “tuning” parameter, set with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithMatMFFDDSSetUmin(Mat mat, PetscReal umin);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThis parameter, `umin` (or $u_{min}$), is a bit involved; its
7f296bb3SBarry Smithdefault is $10^{-6}$ . Its command line form is `-mat_mffd_umin <umin>`.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe Jacobian-vector product is approximated
7f296bb3SBarry Smithvia the formula
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry SmithF'(u) a \approx \frac{F(u + h*a) - F(u)}{h}
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere $h$ is computed via
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smithh = e_{\text{rel}} \cdot \begin{cases}
7f296bb3SBarry Smithu^{T}a/\lVert a \rVert^2_2                                 & \text{if $|u^T a| > u_{\min} \lVert a \rVert_{1}$} \\
7f296bb3SBarry Smithu_{\min} \operatorname{sign}(u^{T}a) \lVert a \rVert_{1}/\lVert a\rVert^2_2  & \text{otherwise}.
7f296bb3SBarry Smith\end{cases}
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry SmithThis approach is taken from Brown and Saad
7f296bb3SBarry Smith{cite}`brownsaad:90`. The second approach, taken from Walker and Pernice,
7f296bb3SBarry Smith{cite}`pw98`, computes $h$ via
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\begin{aligned}
7f296bb3SBarry Smith        h = \frac{\sqrt{1 + ||u||}e_{rel}}{||a||}\end{aligned}
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry SmithThis has no tunable parameters, but note that inside the nonlinear solve
7f296bb3SBarry Smithfor the entire *linear* iterative process $u$ does not change
7f296bb3SBarry Smithhence $\sqrt{1 + ||u||}$ need be computed only once. This
7f296bb3SBarry Smithinformation may be set with the options
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
4558fef0SBarry SmithMatMFFDWPSetComputeNormU(Mat, PetscBool);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithor `-mat_mffd_compute_normu <true or false>`. This information is used
7f296bb3SBarry Smithto eliminate the redundant computation of these parameters, therefore
7f296bb3SBarry Smithreducing the number of collective operations and improving the
7f296bb3SBarry Smithefficiency of the application code. This takes place automatically for the PETSc GMRES solver with left preconditioning.
7f296bb3SBarry Smith
7f296bb3SBarry SmithIt is also possible to monitor the differencing parameters h that are
7f296bb3SBarry Smithcomputed via the routines
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithMatMFFDSetHHistory(Mat, PetscScalar *, int);
7f296bb3SBarry SmithMatMFFDResetHHistory(Mat, PetscScalar *, int);
7f296bb3SBarry SmithMatMFFDGetH(Mat, PetscScalar *);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithWe include an explicit example of using matrix-free methods in {any}`ex3.c <snes_ex3>`.
7f296bb3SBarry SmithNote that by using the option `-snes_mf` one can
7f296bb3SBarry Smitheasily convert any `SNES` code to use a matrix-free Newton-Krylov
7f296bb3SBarry Smithmethod without a preconditioner. As shown in this example,
7f296bb3SBarry Smith`SNESSetFromOptions()` must be called *after* `SNESSetJacobian()` to
7f296bb3SBarry Smithenable runtime switching between the user-specified Jacobian and the
7f296bb3SBarry Smithdefault `SNES` matrix-free form.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(snes_ex3)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith:::{admonition} Listing: `src/snes/tutorials/ex3.c`
7f296bb3SBarry Smith```{literalinclude} /../src/snes/tutorials/ex3.c
7f296bb3SBarry Smith:end-before: /*TEST
7f296bb3SBarry Smith```
7f296bb3SBarry Smith:::
7f296bb3SBarry Smith
7f296bb3SBarry SmithTable {any}`tab-jacobians` summarizes the various matrix situations
7f296bb3SBarry Smiththat `SNES` supports. In particular, different linear system matrices
7f296bb3SBarry Smithand preconditioning matrices are allowed, as well as both matrix-free
7f296bb3SBarry Smithand application-provided preconditioners. If {any}`ex3.c <snes_ex3>` is run with
7f296bb3SBarry Smiththe options `-snes_mf` and `-user_precond` then it uses a
7f296bb3SBarry Smithmatrix-free application of the matrix-vector multiple and a user
7f296bb3SBarry Smithprovided matrix-free Jacobian.
7f296bb3SBarry Smith
7f296bb3SBarry Smith```{eval-rst}
7f296bb3SBarry Smith.. list-table:: Jacobian Options
7f296bb3SBarry Smith   :name: tab-jacobians
7f296bb3SBarry Smith
7f296bb3SBarry Smith   * - Matrix Use
7f296bb3SBarry Smith     - Conventional Matrix Formats
7f296bb3SBarry Smith     - Matrix-free versions
7f296bb3SBarry Smith   * - Jacobian Matrix
7f296bb3SBarry Smith     - Create matrix with ``MatCreate()``:math:`^*`.  Assemble matrix with user-defined routine :math:`^\dagger`
7f296bb3SBarry Smith     - Create matrix with ``MatCreateShell()``.  Use ``MatShellSetOperation()`` to set various matrix actions, or use ``MatCreateMFFD()`` or ``MatCreateSNESMF()``.
7addb90fSBarry Smith   * - Matrix used to construct the preconditioner
7f296bb3SBarry Smith     - Create matrix with ``MatCreate()``:math:`^*`.  Assemble matrix with user-defined routine :math:`^\dagger`
7f296bb3SBarry Smith     - Use ``SNESGetKSP()`` and ``KSPGetPC()`` to access the ``PC``, then use ``PCSetType(pc, PCSHELL)`` followed by ``PCShellSetApply()``.
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smith$^*$ Use either the generic `MatCreate()` or a format-specific variant such as `MatCreateAIJ()`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith$^\dagger$ Set user-defined matrix formation routine with `SNESSetJacobian()` or with a `DM` variant such as `DMDASNESSetJacobianLocal()`
7f296bb3SBarry Smith
7f296bb3SBarry SmithSNES also provides some less well-integrated code to apply matrix-free finite differencing using an automatically computed measurement of the
7f296bb3SBarry Smithnoise of the functions. This can be selected with `-snes_mf_version 2`; it does not use `MatCreateMFFD()` but has similar options that start with
7f296bb3SBarry Smith`-snes_mf_` instead of `-mat_mffd_`. Note that this alternative prefix **only** works for version 2 differencing.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_fdmatrix)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith## Finite Difference Jacobian Approximations
7f296bb3SBarry Smith
7f296bb3SBarry SmithPETSc provides some tools to help approximate the Jacobian matrices
7f296bb3SBarry Smithefficiently via finite differences. These tools are intended for use in
7f296bb3SBarry Smithcertain situations where one is unable to compute Jacobian matrices
7f296bb3SBarry Smithanalytically, and matrix-free methods do not work well without a
7f296bb3SBarry Smithpreconditioner, due to very poor conditioning. The approximation
7f296bb3SBarry Smithrequires several steps:
7f296bb3SBarry Smith
7f296bb3SBarry Smith- First, one colors the columns of the (not yet built) Jacobian matrix,
7f296bb3SBarry Smith  so that columns of the same color do not share any common rows.
7f296bb3SBarry Smith- Next, one creates a `MatFDColoring` data structure that will be
7f296bb3SBarry Smith  used later in actually computing the Jacobian.
7f296bb3SBarry Smith- Finally, one tells the nonlinear solvers of `SNES` to use the
7f296bb3SBarry Smith  `SNESComputeJacobianDefaultColor()` routine to compute the
7f296bb3SBarry Smith  Jacobians.
7f296bb3SBarry Smith
7f296bb3SBarry SmithA code fragment that demonstrates this process is given below.
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithISColoring    iscoloring;
7f296bb3SBarry SmithMatFDColoring fdcoloring;
7f296bb3SBarry SmithMatColoring   coloring;
7f296bb3SBarry Smith
7f296bb3SBarry Smith/*
7f296bb3SBarry Smith  This initializes the nonzero structure of the Jacobian. This is artificial
7f296bb3SBarry Smith  because clearly if we had a routine to compute the Jacobian we wouldn't
7f296bb3SBarry Smith  need to use finite differences.
7f296bb3SBarry Smith*/
7f296bb3SBarry SmithFormJacobian(snes, x, &J, &J, &user);
7f296bb3SBarry Smith
7f296bb3SBarry Smith/*
7f296bb3SBarry Smith   Color the matrix, i.e. determine groups of columns that share no common
7f296bb3SBarry Smith  rows. These columns in the Jacobian can all be computed simultaneously.
7f296bb3SBarry Smith*/
7f296bb3SBarry SmithMatColoringCreate(J, &coloring);
7f296bb3SBarry SmithMatColoringSetType(coloring, MATCOLORINGSL);
7f296bb3SBarry SmithMatColoringSetFromOptions(coloring);
7f296bb3SBarry SmithMatColoringApply(coloring, &iscoloring);
7f296bb3SBarry SmithMatColoringDestroy(&coloring);
7f296bb3SBarry Smith/*
7f296bb3SBarry Smith   Create the data structure that SNESComputeJacobianDefaultColor() uses
7f296bb3SBarry Smith   to compute the actual Jacobians via finite differences.
7f296bb3SBarry Smith*/
7f296bb3SBarry SmithMatFDColoringCreate(J, iscoloring, &fdcoloring);
7f296bb3SBarry SmithISColoringDestroy(&iscoloring);
2ba42892SBarry SmithMatFDColoringSetFunction(fdcoloring, (MatFDColoringFn *)FormFunction, &user);
7f296bb3SBarry SmithMatFDColoringSetFromOptions(fdcoloring);
7f296bb3SBarry Smith
7f296bb3SBarry Smith/*
7f296bb3SBarry Smith  Tell SNES to use the routine SNESComputeJacobianDefaultColor()
7f296bb3SBarry Smith  to compute Jacobians.
7f296bb3SBarry Smith*/
7f296bb3SBarry SmithSNESSetJacobian(snes, J, J, SNESComputeJacobianDefaultColor, fdcoloring);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithOf course, we are cheating a bit. If we do not have an analytic formula
7f296bb3SBarry Smithfor computing the Jacobian, then how do we know what its nonzero
7f296bb3SBarry Smithstructure is so that it may be colored? Determining the structure is
7f296bb3SBarry Smithproblem dependent, but fortunately, for most structured grid problems
7f296bb3SBarry Smith(the class of problems for which PETSc was originally designed) if one
7f296bb3SBarry Smithknows the stencil used for the nonlinear function one can usually fairly
7f296bb3SBarry Smitheasily obtain an estimate of the location of nonzeros in the matrix.
7f296bb3SBarry SmithThis is harder in the unstructured case, but one typically knows where the nonzero entries are from the mesh topology and distribution of degrees of freedom.
7f296bb3SBarry SmithIf using `DMPlex` ({any}`ch_unstructured`) for unstructured meshes, the nonzero locations will be identified in `DMCreateMatrix()` and the procedure above can be used.
7f296bb3SBarry SmithMost external packages for unstructured meshes have similar functionality.
7f296bb3SBarry Smith
7f296bb3SBarry SmithOne need not necessarily use a `MatColoring` object to determine a
7f296bb3SBarry Smithcoloring. For example, if a grid can be colored directly (without using
7f296bb3SBarry Smiththe associated matrix), then that coloring can be provided to
7f296bb3SBarry Smith`MatFDColoringCreate()`. Note that the user must always preset the
7f296bb3SBarry Smithnonzero structure in the matrix regardless of which coloring routine is
7f296bb3SBarry Smithused.
7f296bb3SBarry Smith
7f296bb3SBarry SmithPETSc provides the following coloring algorithms, which can be selected using `MatColoringSetType()` or via the command line argument `-mat_coloring_type`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith```{eval-rst}
7f296bb3SBarry Smith.. list-table::
7f296bb3SBarry Smith   :header-rows: 1
7f296bb3SBarry Smith
7f296bb3SBarry Smith   * - Algorithm
7f296bb3SBarry Smith     - ``MatColoringType``
7f296bb3SBarry Smith     - ``-mat_coloring_type``
7f296bb3SBarry Smith     - Parallel
7f296bb3SBarry Smith   * - smallest-last :cite:`more84`
7f296bb3SBarry Smith     - ``MATCOLORINGSL``
7f296bb3SBarry Smith     - ``sl``
7f296bb3SBarry Smith     - No
7f296bb3SBarry Smith   * - largest-first :cite:`more84`
7f296bb3SBarry Smith     - ``MATCOLORINGLF``
7f296bb3SBarry Smith     - ``lf``
7f296bb3SBarry Smith     - No
7f296bb3SBarry Smith   * - incidence-degree :cite:`more84`
7f296bb3SBarry Smith     - ``MATCOLORINGID``
7f296bb3SBarry Smith     - ``id``
7f296bb3SBarry Smith     - No
7f296bb3SBarry Smith   * - Jones-Plassmann :cite:`jp:pcolor`
7f296bb3SBarry Smith     - ``MATCOLORINGJP``
7f296bb3SBarry Smith     - ``jp``
7f296bb3SBarry Smith     - Yes
7f296bb3SBarry Smith   * - Greedy
7f296bb3SBarry Smith     - ``MATCOLORINGGREEDY``
7f296bb3SBarry Smith     - ``greedy``
7f296bb3SBarry Smith     - Yes
7f296bb3SBarry Smith   * - Natural (1 color per column)
7f296bb3SBarry Smith     - ``MATCOLORINGNATURAL``
7f296bb3SBarry Smith     - ``natural``
7f296bb3SBarry Smith     - Yes
7f296bb3SBarry Smith   * - Power (:math:`A^k` followed by 1-coloring)
7f296bb3SBarry Smith     - ``MATCOLORINGPOWER``
7f296bb3SBarry Smith     - ``power``
7f296bb3SBarry Smith     - Yes
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithAs for the matrix-free computation of Jacobians ({any}`sec_nlmatrixfree`), two parameters affect the accuracy of the
7f296bb3SBarry Smithfinite difference Jacobian approximation. These are set with the command
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithMatFDColoringSetParameters(MatFDColoring fdcoloring, PetscReal rerror, PetscReal umin);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe parameter `rerror` is the square root of the relative error in the
7f296bb3SBarry Smithfunction evaluations, $e_{rel}$; the default is the square root of
7f296bb3SBarry Smithmachine epsilon (about $10^{-8}$ in double precision), which
7f296bb3SBarry Smithassumes that the functions are evaluated approximately to floating-point
7f296bb3SBarry Smithprecision accuracy. The second parameter, `umin`, is a bit more
3ce01cb7SJose E. Romaninvolved; its default is $10^{-6}$. Column $i$ of the
7f296bb3SBarry SmithJacobian matrix (denoted by $F_{:i}$) is approximated by the
7f296bb3SBarry Smithformula
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry SmithF'_{:i} \approx \frac{F(u + h*dx_{i}) - F(u)}{h}
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhere $h$ is computed via:
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smithh = e_{\text{rel}} \cdot \begin{cases}
7f296bb3SBarry Smithu_{i}             &    \text{if $|u_{i}| > u_{\min}$} \\
7f296bb3SBarry Smithu_{\min} \cdot \operatorname{sign}(u_{i})  & \text{otherwise}.
7f296bb3SBarry Smith\end{cases}
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithfor `MATMFFD_DS` or:
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
3ce01cb7SJose E. Romanh = e_{\text{rel}} \sqrt{\|u\|}
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithfor `MATMFFD_WP` (default). These parameters may be set from the options
7f296bb3SBarry Smithdatabase with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry Smith-mat_fd_coloring_err <err>
7f296bb3SBarry Smith-mat_fd_coloring_umin <umin>
7f296bb3SBarry Smith-mat_fd_type <htype>
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithNote that `MatColoring` type `MATCOLORINGSL`, `MATCOLORINGLF`, and
7f296bb3SBarry Smith`MATCOLORINGID` are sequential algorithms. `MATCOLORINGJP` and
7f296bb3SBarry Smith`MATCOLORINGGREEDY` are parallel algorithms, although in practice they
7f296bb3SBarry Smithmay create more colors than the sequential algorithms. If one computes
7f296bb3SBarry Smiththe coloring `iscoloring` reasonably with a parallel algorithm or by
7f296bb3SBarry Smithknowledge of the discretization, the routine `MatFDColoringCreate()`
7f296bb3SBarry Smithis scalable. An example of this for 2D distributed arrays is given below
7f296bb3SBarry Smiththat uses the utility routine `DMCreateColoring()`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
b5ef2b50SBarry SmithDMCreateColoring(dm, IS_COLORING_GHOSTED, &iscoloring);
7f296bb3SBarry SmithMatFDColoringCreate(J, iscoloring, &fdcoloring);
7f296bb3SBarry SmithMatFDColoringSetFromOptions(fdcoloring);
7f296bb3SBarry SmithISColoringDestroy(&iscoloring);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithNote that the routine `MatFDColoringCreate()` currently is only
7f296bb3SBarry Smithsupported for the AIJ and BAIJ matrix formats.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_vi)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith## Variational Inequalities
7f296bb3SBarry Smith
7f296bb3SBarry Smith`SNES` can also solve (differential) variational inequalities with box (bound) constraints.
7f296bb3SBarry SmithThese are nonlinear algebraic systems with additional inequality
7f296bb3SBarry Smithconstraints on some or all of the variables:
7f296bb3SBarry Smith$L_i \le u_i \le H_i$. For example, the pressure variable cannot be negative.
7f296bb3SBarry SmithSome, or all, of the lower bounds may be
7f296bb3SBarry Smithnegative infinity (indicated to PETSc with `SNES_VI_NINF`) and some, or
7f296bb3SBarry Smithall, of the upper bounds may be infinity (indicated by `SNES_VI_INF`).
7f296bb3SBarry SmithThe commands
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
4558fef0SBarry SmithSNESVISetVariableBounds(SNES snes, Vec L, Vec H);
7f296bb3SBarry SmithSNESVISetComputeVariableBounds(SNES snes, PetscErrorCode (*compute)(SNES, Vec, Vec))
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithare used to indicate that one is solving a variational inequality. Problems with box constraints can be solved with
7f296bb3SBarry Smiththe reduced space, `SNESVINEWTONRSLS`, and semi-smooth `SNESVINEWTONSSLS` solvers.
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe
7f296bb3SBarry Smithoption `-snes_vi_monitor` turns on extra monitoring of the active set
7f296bb3SBarry Smithassociated with the bounds and `-snes_vi_type` allows selecting from
7f296bb3SBarry Smithseveral VI solvers, the default is preferred.
7f296bb3SBarry Smith
7f296bb3SBarry Smith`SNESLineSearchSetPreCheck()` and `SNESLineSearchSetPostCheck()` can also be used to control properties
7f296bb3SBarry Smithof the steps selected by `SNES`.
7f296bb3SBarry Smith
7f296bb3SBarry Smith(sec_snespc)=
7f296bb3SBarry Smith
7f296bb3SBarry Smith## Nonlinear Preconditioning
7f296bb3SBarry Smith
7f296bb3SBarry SmithThe mathematical framework of nonlinear preconditioning is explained in detail in {cite}`bruneknepleysmithtu15`.
7f296bb3SBarry SmithNonlinear preconditioning in PETSc involves the use of an inner `SNES`
7f296bb3SBarry Smithinstance to define the step for an outer `SNES` instance. The inner
7f296bb3SBarry Smithinstance may be extracted using
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESGetNPC(SNES snes, SNES *npc);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithand passed run-time options using the `-npc_` prefix. Nonlinear
7f296bb3SBarry Smithpreconditioning comes in two flavors: left and right. The side may be
7f296bb3SBarry Smithchanged using `-snes_npc_side` or `SNESSetNPCSide()`. Left nonlinear
7f296bb3SBarry Smithpreconditioning redefines the nonlinear function as the action of the
7f296bb3SBarry Smithnonlinear preconditioner $\mathbf{M}$;
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\mathbf{F}_{M}(x) = \mathbf{M}(\mathbf{x},\mathbf{b}) - \mathbf{x}.
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry SmithRight nonlinear preconditioning redefines the nonlinear function as the
7f296bb3SBarry Smithfunction on the action of the nonlinear preconditioner;
7f296bb3SBarry Smith
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith\mathbf{F}(\mathbf{M}(\mathbf{x},\mathbf{b})) = \mathbf{b},
7f296bb3SBarry Smith$$
7f296bb3SBarry Smith
7f296bb3SBarry Smithwhich can be interpreted as putting the preconditioner into “striking
7f296bb3SBarry Smithdistance” of the solution by outer acceleration.
7f296bb3SBarry Smith
7f296bb3SBarry SmithIn addition, basic patterns of solver composition are available with the
7f296bb3SBarry Smith`SNESType` `SNESCOMPOSITE`. This allows for two or more `SNES`
7f296bb3SBarry Smithinstances to be combined additively or multiplicatively. By command
7f296bb3SBarry Smithline, a set of `SNES` types may be given by comma separated list
7f296bb3SBarry Smithargument to `-snes_composite_sneses`. There are additive
7f296bb3SBarry Smith(`SNES_COMPOSITE_ADDITIVE`), additive with optimal damping
7f296bb3SBarry Smith(`SNES_COMPOSITE_ADDITIVEOPTIMAL`), and multiplicative
7f296bb3SBarry Smith(`SNES_COMPOSITE_MULTIPLICATIVE`) variants which may be set with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESCompositeSetType(SNES, SNESCompositeType);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry SmithNew subsolvers may be added to the composite solver with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESCompositeAddSNES(SNES, SNESType);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smithand accessed with
7f296bb3SBarry Smith
7f296bb3SBarry Smith```
7f296bb3SBarry SmithSNESCompositeGetSNES(SNES, PetscInt, SNES *);
7f296bb3SBarry Smith```
7f296bb3SBarry Smith
7f296bb3SBarry Smith```{eval-rst}
7f296bb3SBarry Smith.. bibliography:: /petsc.bib
7f296bb3SBarry Smith   :filter: docname in docnames
7f296bb3SBarry Smith```