
A review of the adjoint‐state method for computing the gradient of a functional with geophysical applications
Author(s) -
Plessix R.E.
Publication year - 2006
Publication title -
geophysical journal international
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.302
H-Index - 168
eISSN - 1365-246X
pISSN - 0956-540X
DOI - 10.1111/j.1365-246x.2006.02978.x
Subject(s) - jacobian matrix and determinant , adjoint equation , mathematics , state variable , minification , gradient method , data assimilation , state (computer science) , automatic differentiation , mathematical optimization , computer science , algorithm , mathematical analysis , partial differential equation , physics , meteorology , computation , thermodynamics
SUMMARY Estimating the model parameters from measured data generally consists of minimizing an error functional. A classic technique to solve a minimization problem is to successively determine the minimum of a series of linearized problems. This formulation requires the Fréchet derivatives (the Jacobian matrix), which can be expensive to compute. If the minimization is viewed as a non‐linear optimization problem, only the gradient of the error functional is needed. This gradient can be computed without the Fréchet derivatives. In the 1970s, the adjoint‐state method was developed to efficiently compute the gradient. It is now a well‐known method in the numerical community for computing the gradient of a functional with respect to the model parameters when this functional depends on those model parameters through state variables, which are solutions of the forward problem. However, this method is less well understood in the geophysical community. The goal of this paper is to review the adjoint‐state method. The idea is to define some adjoint‐state variables that are solutions of a linear system. The adjoint‐state variables are independent of the model parameter perturbations and in a way gather the perturbations with respect to the state variables. The adjoint‐state method is efficient because only one extra linear system needs to be solved. Several applications are presented. When applied to the computation of the derivatives of the ray trajectories, the link with the propagator of the perturbed ray equation is established.