Nonlinear Inverse Problems
Beyond Linearity — When the Forward Model Is Nonlinear
So far we have treated the forward operator as linear. Many RF imaging problems, however, involve nonlinear forward models:
- Microwave tomography: The scattered field depends nonlinearly on the permittivity distribution (multiple scattering beyond the Born approximation).
- Full-waveform inversion: The wave equation maps medium parameters to recorded waveforms nonlinearly via multiple reflections.
- Phase retrieval: Only is measured (intensity-only), and the phase is lost — a fundamentally nonlinear problem.
The linearization-based approach — compute the Fréchet derivative, apply regularized linear inversion, update — is the workhorse of practical nonlinear imaging. Born iterative and distorted-Born iterative methods are the radar-specific instances.
This section is intentionally brief: most of our forward models in Parts II–III use the linearized (Born) approximation, and the full nonlinear theory is treated in Chapters 5–6 when we derive the scattering equations.
Definition: Nonlinear Inverse Problems
Nonlinear Inverse Problems
A nonlinear inverse problem seeks from the equation
where is a nonlinear operator. The Fréchet derivative at a point is the bounded linear operator satisfying
is the Jacobian of the forward operator and plays the role of in the linear theory.
Nonlinear ill-posedness adds to the linear difficulties:
- Non-uniqueness from nonlinearity (multiple solutions, local minima).
- Non-convexity of the residual functional.
- Dependence of the linearization on the current iterate.
Definition: The Gauss–Newton Method and IRGNM
The Gauss–Newton Method and IRGNM
The Gauss–Newton method linearises at the current iterate and solves the resulting linear inverse problem:
where solves the linearised problem
Since is typically ill-posed (a compact operator), the linearised problem must itself be regularised. Using Tikhonov regularisation on each linearised step gives the iteratively regularised Gauss–Newton method (IRGNM):
where is a decreasing sequence of regularisation parameters and is an initial guess (e.g., the matched-filter image).
The IRGNM is the standard method for nonlinear ill-posed problems. The decreasing allows progressively finer resolution as the iterate approaches the solution. A typical choice is with and chosen via the discrepancy principle at the first iteration.
Definition: The Levenberg–Marquardt Method
The Levenberg–Marquardt Method
The Levenberg–Marquardt method is closely related to Gauss–Newton but uses the discrepancy principle to select adaptively at each step:
Choose such that
where . This ensures that each step reduces the residual by a fixed fraction. The update is:
Levenberg–Marquardt interpolates between Gauss–Newton () and gradient descent (). Far from the solution (large residual), is large and the step is short and robust. Near the solution (small residual), is small and the method converges rapidly.
Theorem: Convergence of the IRGNM
Let be Fréchet differentiable with a Lipschitz-continuous derivative in a ball . Assume a nonlinear source condition:
for some . If with appropriate and the iteration is stopped by the discrepancy principle at , then
The convergence rate matches the linear theory. The nonlinearity is handled by the fact that the linearisation error is of higher order: , which becomes negligible as .
Error decomposition at each step
The error satisfies a recurrence involving the linearisation residual. The linearisation error is bounded by the Lipschitz constant of times .
Induction and convergence
Under the source condition and with appropriate , an induction argument shows decreases geometrically until the discrepancy stopping criterion is triggered, after which the error is .
Example: Born Iterative Method for Microwave Tomography
In microwave tomography, the forward model maps the contrast function (relative permittivity minus 1) to the scattered field via the Lippmann–Schwinger integral equation:
where is the free-space Green's function and is the total field. The nonlinearity arises because depends on through the same equation.
Describe the Born iterative method (BIM) as a special case of IRGNM.
Linearisation (Born approximation)
At each iteration, approximate the total field by the field from the previous iterate: . This linearises the forward operator:
This is a linear integral operator (compact, hence ill-posed) — exactly the type of operator from Sections 2.1–2.4.
IRGNM update
Solve the Tikhonov-regularised linear problem:
Then recompute the forward field using Maxwell's equations (or the Lippmann–Schwinger equation) with .
Convergence considerations
BIM converges when the contrast is small (weak scattering) so the Born approximation is accurate. For strong scatterers (metals, high-permittivity media), the distorted-Born iterative method (DBIM) uses the Green's function of the current background rather than the free-space , improving convergence significantly.
Computational Cost of Nonlinear Imaging
Each IRGNM iteration requires: (1) A forward solve: compute by solving Maxwell's equations (e.g., FDTD or FEM) — for 3D problems. (2) A Jacobian solve: compute and apply it — another solve per measurement. (3) A regularised linear solve: the Tikhonov normal equation at the current linearisation point.
For a grid (modest 3D imaging domain), a single forward solve takes seconds to minutes on current hardware. The IRGNM typically needs – outer iterations, each with multiple inner CG iterations.
This cost is why most operational RF imaging systems use the Born approximation (single linearisation, no iteration) or limit iteration to 3–5 steps. Full nonlinear inversion is feasible only offline (post-processing) or with GPU-accelerated solvers and reduced-order models.
Common Mistake: Nonlinear Imaging Can Converge to a Local Minimum
Mistake:
Initialising the IRGNM from a zero starting point (no prior information) for a strongly nonlinear problem, expecting global convergence.
Correction:
The IRGNM convergence theorem (TConvergence of the IRGNM) assumes that the initial guess is sufficiently close to (within the ball ). For strongly scattering media, can be very small and may not satisfy this condition.
Practical mitigation strategies:
-
Use the matched-filter (backprojection) image as — it is often close enough to the true solution for the linearisation to be valid.
-
Perform a frequency continuation sweep: start at low frequencies (large , mild nonlinearity, large convergence basin) and progressively increase frequency.
-
Use a multi-scale approach: solve a coarsely discretised problem first, then refine the grid and warm-start from the coarse solution.
Methods for Nonlinear Inverse Problems
| Method | Key Idea | Convergence Rate | Computational Cost | Best For |
|---|---|---|---|---|
| Born approximation (single) | Linearise once, no iteration | Not convergent (one step) | One forward solve + one linear solve | Weak scattering; real-time |
| Born iterative (BIM) | Iterate linearisation with free-space Green's function | Geometric (slow, weak scattering) | forward solves | Moderate contrast |
| Distorted-Born iterative (DBIM) | Iterate with background Green's function | Faster than BIM | More expensive per step (Jacobian with background) | Higher contrast |
| IRGNM (Tikhonov inner) | Tikhonov at each linearisation | optimal | forward + adjoint solves | General nonlinear ill-posed problems |
| Levenberg–Marquardt | Adaptive via discrepancy | Same as IRGNM | Similar to IRGNM | Robust to poor initialisation |
Key Takeaway
Nonlinear inverse problems arise whenever the forward model involves multiple scattering, nonlinear constitutive relations, or intensity-only measurements. The iteratively regularised Gauss–Newton method (IRGNM) is the standard approach: linearise at each step and apply Tikhonov regularisation to the linearised problem. Convergence rates match the linear theory under appropriate source conditions — the nonlinearity becomes negligible near the solution. In RF imaging, the Born iterative method is the specific instantiation of IRGNM for electromagnetic scattering, using the Lippmann–Schwinger equation as the forward model. Computational cost limits full nonlinear inversion to offline processing or GPU-accelerated implementations.