Gaussian Measures on Function Spaces
Why Infinite-Dimensional Priors?
The discretized Bayesian formulation of the previous sections works on a fixed grid of pixels. But consider what happens as we refine the grid: . A prior with assigns variance per pixel, so the total prior energy . The prior is not discretization-invariant — its properties depend on the grid.
Gaussian measures on function spaces provide the rigorous framework for priors that are consistent across grid refinements. This section provides the theoretical foundation; it can be skimmed on first reading and returned to when working with continuum imaging problems.
Definition: Gaussian Measure on a Hilbert Space
Gaussian Measure on a Hilbert Space
A Gaussian measure on a separable Hilbert space is a probability measure such that every continuous linear functional has a Gaussian distribution:
where is the mean and is a symmetric, positive, trace-class operator (the covariance).
The trace-class requirement ensures that draws from have finite expected norm:
This is the infinite-dimensional analogue of requiring a finite covariance matrix.
Trace Class Means Finite Expected Norm
An operator is trace class if and only if where are its eigenvalues (in decreasing order). For the Whittle-Matérn covariance operator on ,
the eigenvalues decay as . The trace-class condition requires , i.e., . In 2D (), we need ; the Matérn-3/2 kernel () is the minimal choice satisfying this.
Naive covariances like (constant times identity) are not trace class in infinite dimensions — this is why the naive fails as .
Definition: Karhunen-Loève Expansion
Karhunen-Loève Expansion
Let with eigenpairs satisfying . The Karhunen-Loève (KL) expansion represents draws from as
This series converges in (in mean-square sense) precisely when is trace class: .
In practice, the KL expansion is truncated to the leading terms: giving a low-dimensional representation that captures most of the prior energy (since rapidly for smooth priors).
Theorem: Cameron-Martin Theorem
Let be a Gaussian measure on and let . The translated measure is absolutely continuous with respect to (written ) if and only if belongs to the Cameron-Martin space
In that case, the Radon-Nikodym derivative is
Sketch via Karhunen-Loève expansion
In the KL basis, with . The translation corresponds to shifting by where .
By the Cameron-Martin theorem for sequences of Gaussians, this shift yields an absolutely continuous measure if and only if , which is exactly .
The Radon-Nikodym derivative follows from the product formula for Gaussian likelihood ratios: .
Theorem: Stuart's Well-Posedness Theorem
Under the following conditions:
- The prior is with trace class.
- The forward operator is bounded.
- The noise .
The posterior measure (as a measure on ) satisfies:
- Existence: is well-defined and absolutely continuous with respect to .
- Uniqueness: The posterior is the unique measure with Radon-Nikodym derivative proportional to the likelihood: .
- Stability: depends continuously on in the Hellinger metric: for any there exists such that .
Existence via Bayes formula for measures
The unnormalized posterior density with respect to is , which is bounded by 1. Therefore .
Positivity: since is bounded and has full support on , . Hence is a well-defined Radon-Nikodym derivative.
Stability via Hellinger metric
The Hellinger distance satisfies . Continuity of in and dominated convergence give the result.
Discretization-Invariant Algorithms
The Gaussian measure framework motivates discretization-invariant algorithms: methods whose performance does not degrade as the mesh is refined.
The key insight: naive random-walk Metropolis proposals , achieve optimal acceptance rate only when — making effective step sizes vanish as .
The preconditioned Crank-Nicolson (pCN) proposal respects the prior covariance:
This proposal preserves : if then . The acceptance probability in pCN involves only the likelihood ratio (the prior cancels), giving dimension-independent acceptance rates of .
Choosing the Prior Covariance for RF Imaging
In practice, the Gaussian prior covariance must be chosen based on domain knowledge about the scene. For RF imaging:
- Whittle-Matérn: with length scale and smoothness . For point-target scenes, (exponential covariance, draws) is appropriate; for extended objects, (Matérn-3/2, draws) or higher.
- Length scale : Should match the expected spatial extent of reflectors. For radar at 77 GHz, is calibrated to the range/cross-range resolution.
- Tensor-product structure: For a 2D scene, the KL expansion of can be computed independently in each dimension, reducing to .
Key constraint: For realistic scenes (), computing the full posterior covariance requires operations — infeasible. Low-rank approximations (§Uncertainty Quantification) or MCMC ([?s05:alg-pcn]) are required.
- •
scene: full posterior covariance computation requires TB memory
- •
Low-rank approximation with eigenmodes reduces to MB and FLOP
- •
pCN sampler scales to at per sample when has fast matrix-vector products
Common Mistake: Grid-Dependent Prior Hyperparameters
Mistake:
When refining the pixel grid from to , keeping the prior variance fixed gives a different effective prior — the coarser grid has fewer pixels so less total prior energy.
Correction:
Use a continuum-consistent prior (e.g., Whittle-Matérn) where the length scale and smoothness are physical parameters independent of grid resolution. When discretizing, scale the covariance matrix by the grid spacing to maintain the continuous-limit behavior: where is the spatial dimension.
Historical Note: From Wiener Measure to Modern Bayesian Imaging
1923-2010The theory of Gaussian measures on infinite-dimensional spaces traces back to Norbert Wiener's 1923 construction of Brownian motion as a measure on the space of continuous functions — the first rigorous infinite-dimensional Gaussian measure. Irving Segal, Leonard Gross, and others developed the abstract framework through the 1950s-70s.
The systematic application of Gaussian measures to Bayesian inverse problems was synthesized by Andrew Stuart's landmark 2010 paper "Inverse Problems: A Bayesian Perspective" in Acta Numerica. Stuart unified the finite-dimensional and infinite-dimensional theories, proving well-posedness and stability results that provided the theoretical foundation for the now-thriving field of Bayesian imaging. The Cameron-Martin theorem — originally proved by Robert H. Cameron and William T. Martin in 1944 for Wiener measure — plays a central role in characterizing which MAP estimates are meaningful as elements of function space.
Key Takeaway
-
Gaussian measures on Hilbert spaces provide discretization-invariant priors for infinite-dimensional Bayesian inverse problems.
-
The covariance operator must be trace class () for the prior to assign finite expected norm to draws — the Whittle-Matérn family satisfies this for .
-
The Karhunen-Loève expansion provides a countable representation of draws from a Gaussian measure in terms of i.i.d. standard Gaussians.
-
The Cameron-Martin theorem characterizes when translated Gaussian measures remain absolutely continuous: the shift must lie in .
-
Stuart's theorem guarantees existence, uniqueness, and stability of the posterior measure under mild conditions on the forward operator.
-
Discretization-invariant algorithms (pCN sampler) exploit the Gaussian measure structure to achieve mesh-independent performance.
Trace-class operator
A compact operator is trace class if , where are its eigenvalues in decreasing order. Trace-class covariance operators define valid Gaussian measures on infinite-dimensional Hilbert spaces: they ensure draws from the measure have finite expected norm .
Related: Gaussian Measure on a Hilbert Space, Cameron-Martin space
Cameron-Martin space
Given a Gaussian measure on a Hilbert space , the Cameron-Martin space is equipped with norm . It characterizes which translations of remain absolutely continuous with respect to : a shift preserves absolute continuity if and only if . For the Whittle-Matérn prior , the Cameron-Martin space is the Sobolev space .
Related: Trace-class operator, Gaussian Measure on a Hilbert Space, Sobolev Spaces