Uncertainty Quantification
A Reconstruction Without Uncertainty Is Incomplete
A point estimate (MAP or MMSE) tells us what we think the scene looks like, but not how confident we are. In safety-critical applications β autonomous driving radar, non-destructive testing, medical imaging β decision-makers need to know which features of the reconstruction are reliable and which are uncertain. Declaring a scatterer present when the posterior assigns only 60% probability to that pixel is fundamentally different from 99% probability.
Uncertainty quantification (UQ) extracts this confidence information from the posterior distribution, turning a Bayesian model into actionable guidance for the engineer or clinician. A reconstruction without uncertainty bars is, in this sense, scientifically incomplete.
Definition: Credible Intervals and Credible Regions
Credible Intervals and Credible Regions
A credible interval for a scalar quantity (e.g., a single pixel value) is an interval such that
Common choices:
- Highest posterior density (HPD): The shortest interval containing probability . For unimodal posteriors, it is symmetric about the mode.
- Equal-tailed: .
For the Gaussian posterior , the credible interval for pixel is
In higher dimensions, the credible region (ellipsoidal) is
Definition: Posterior Variance Map
Posterior Variance Map
The posterior variance map displays the diagonal of the posterior covariance as an image:
This pixel-wise uncertainty map reveals:
- Low-variance regions: Well-constrained by data (dense measurements, high SNR, good forward-operator coverage).
- High-variance regions: Poorly constrained (few measurements, null-space directions of , low SNR).
For the Gaussian model, , so the variance map depends on the measurement geometry through β a direct tool for optimal sensor placement (A-optimal experimental design, Eex-optimal-design).
Theorem: Posterior Contraction Rate
Under regularity conditions on the forward operator and a Gaussian prior with the true scene (the Cameron-Martin space), the posterior contracts around as noise level :
where is the Sobolev regularity of relative to the prior covariance. This rate matches the minimax-optimal rate for the corresponding deterministic inverse problem.
Spectral analysis in the SVD basis
Expand in the joint SVD of (singular values ) and eigenfunctions of (eigenvalues ). In this basis, the posterior variance in mode is
The bias in mode is . Summing over with (forward operator decay) and (prior regularity), the bias-variance trade-off gives the stated rate.
Laplace Approximation for Non-Gaussian Posteriors
For non-Gaussian priors (Laplace, horseshoe) the posterior has no closed-form covariance. The Laplace approximation fits a Gaussian to the posterior at its mode:
where is the Hessian of the negative log-posterior at the MAP estimate.
For Gaussian noise: .
Warning: The Laplace approximation systematically underestimates uncertainty in multimodal or heavy-tailed posteriors β it only sees the local curvature at the mode, missing the tails.
Definition: Metropolis-Hastings MCMC
Metropolis-Hastings MCMC
The Metropolis-Hastings (MH) algorithm generates a Markov chain with stationary distribution :
- Propose from a proposal distribution .
- Accept with probability
- Set if accepted, otherwise .
The acceptance ratio involves only the unnormalized posterior , since cancels β no partition function needed.
pCN: Preconditioned Crank-Nicolson Sampler
Complexity: per iteration where is the cost of evaluating . Total: for a dense .The pCN proposal preserves : if then . As a consequence, the acceptance rate depends only on the likelihood ratio β it is independent of the discretization dimension . In contrast, optimal random-walk MH requires step size , giving acceptance rate as .
MCMC Samplers for Imaging-Scale Posterior Inference
| Method | Gradient needed? | Dimension scaling | Best for |
|---|---|---|---|
| Random Walk MH | No | step size | Low-dim, simple posteriors |
| pCN | No | Dimension-independent | Gaussian priors, function space |
| Gibbs | No | Depends on conditionals | Conjugate hierarchical models (SBL) |
| HMC | Yes () | leapfrog steps | High-dim, smooth posteriors |
| NUTS (auto-HMC) | Yes | , auto-tuned | General-purpose; Stan/PyMC |
| Proximal MCMC (MYULA) | Yes (proximal) | per step | Non-smooth priors (TV, ) |
Scalable Uncertainty Quantification for Imaging
Computing the full posterior covariance is infeasible for imaging-scale problems (--). Scalable alternatives:
- Diagonal approximation: Compute only via Hutchinson's randomized trace estimator: with random .
- Low-rank approximation: using the leading eigenpairs of (computed via randomized SVD).
- MCMC-based: Posterior variance estimated from sample variance: .
- Bootstrap: Resample data, re-solve, use ensemble spread as uncertainty proxy.
Posterior Credible Intervals β Bayesian vs Bootstrap
This plot compares uncertainty quantification methods for a 1D imaging inverse problem .
Top panel: True signal (black), posterior mean reconstruction (blue), and credible bands (shaded). Wide bands indicate poor observability of those pixels; narrow bands indicate high confidence.
Bottom panel: Posterior standard deviation map , showing how observability depends on position through the measurement operator. Compare Bayesian credible bands with bootstrap confidence bands β Bayesian UQ correctly reflects the spatial structure of .
Parameters
MCMC Posterior Sampling for a 2D Inverse Problem
Visualize MCMC sampling on a 2D posterior arising from a simple imaging problem with two unknown pixels and three measurements.
Left panel: Posterior contours with MCMC sample trajectory overlaid. Random-walk MH shows diffusive, slow exploration; pCN shows more efficient traversal of the posterior.
Center panel: Trace plots of each coordinate showing mixing. Well-mixed chains explore the full support rapidly; slow chains exhibit long autocorrelations.
Right panel: Running posterior mean estimate with standard error bands, illustrating convergence speed for each algorithm.
Parameters
Calibration β Are Credible Intervals Trustworthy?
A posterior is well-calibrated if its credible intervals have the correct frequentist coverage:
where the probability is over repeated data realizations. Calibration can be assessed by:
- Simulation studies: Generate many pairs, compute credible intervals, and check empirical coverage vs nominal level.
- Calibration plots: Plot observed coverage vs nominal level. A well-calibrated posterior lies on the diagonal.
- CRPS (Continuous Ranked Probability Score): A proper scoring rule that jointly evaluates sharpness and calibration.
Miscalibration arises from: misspecified noise models, incorrect priors, approximate inference (Laplace approximation underestimates uncertainty in multimodal posteriors), or model mismatch (e.g., using a Gaussian prior for a clearly sparse scene).
Common Mistake: Credible Intervals Are Not Confidence Intervals
Mistake:
A Bayesian credible interval is interpreted as having a frequentist coverage probability β i.e., "in repeated experiments, the true value lies in this interval of the time."
Correction:
A credible interval means: given the observed data and the model, the posterior assigns probability to . This is a conditional probability, conditioned on . It coincides with frequentist coverage only when the prior is correct. A frequentist confidence interval , by contrast, is a random interval with the property that for all β a different statement. Both are valid uncertainty quantifiers, but they answer different questions.
Practical UQ in Deployed RF Imaging Systems
In commercially deployed radar and SAR systems, full posterior UQ is rarely implemented due to computational cost. The standard practice is:
- Matched filter + empirical noise floor: Report reconstructed reflectivity with a detection threshold based on empirical clutter statistics. No formal UQ β binary detect/non-detect.
- Sparse recovery (LASSO/OMP) + posterior linearization: Run sparse recovery, then compute the Laplace approximation covariance on the estimated support. Fast but underestimates uncertainty.
- Full Bayesian (SBL or MCMC): Deployed in high-value applications (medical imaging, subsurface sensing, ISAR tracking) where decision quality justifies the -- computational overhead vs matched filter.
The trend toward GPU-accelerated MCMC and differentiable probabilistic programming (PyMC, NumPyro) is lowering this barrier. For real-time radar ( kHz update rate), variational Bayes and approximate MCMC remain the only feasible options.
- β’
Real-time radar: ms per frame β precludes MCMC, requires LASSO or matched filter
- β’
SAR post-processing: seconds to minutes per image β SBL feasible for
- β’
ISAR target classification: minutes per target β full Bayesian with pCN viable
Key Takeaway
-
Credible intervals extract pixel-wise uncertainty from the posterior: for Gaussian posteriors, .
-
Posterior variance maps reveal which regions are well-constrained by data (near-zero variance) and which are dominated by the prior (high variance, null-space directions of ).
-
The posterior contracts at the minimax-optimal rate when the prior matches the regularity of the truth.
-
The Laplace approximation provides fast Gaussian UQ at the MAP solution but underestimates uncertainty for non-Gaussian posteriors.
-
pCN is the sampler of choice for Gaussian priors in high dimensions β dimension-independent acceptance rates via Cameron-Martin space proposals.
-
Calibration is essential: always validate that reported credible intervals achieve their nominal coverage before trusting the UQ in production.
Quick Check
What property of the pCN proposal , , makes it dimension-independent?
It uses gradient information to make directed proposals
It preserves the prior measure , so the acceptance probability depends only on the likelihood ratio
It adapts the step size automatically to the local posterior curvature
It uses a Kronecker product structure to reduce per-step cost from to
Correct. Since the pCN proposal preserves , the prior ratio cancels in the MH acceptance probability, leaving only the likelihood ratio. This ratio does not depend on the dimension of the parameter space (only on measurements), giving an acceptance rate independent of grid refinement.
Why This Matters: Uncertainty Maps for ISAC System Design
In integrated sensing and communications (ISAC) systems, the posterior variance map directly informs adaptive resource allocation: pixels with high uncertainty should receive more measurements (additional transmit beams, wider bandwidth), while confident pixels need no further sensing.
This posterior-variance-driven adaptive sensing is the Bayesian analogue of A-optimal experimental design (Eex-optimal-design) and connects to the capacity-distortion tradeoff in ISAC ([?ch34:s01]): reducing the posterior variance of the sensing channel corresponds to increasing the sensing mutual information term in the capacity-distortion region derived in Caire et al.
See full treatment in Chapter 34Ψ Section 1