Singular Value Decomposition for Operators

The Spectral Decomposition β€” Why It Determines Reconstruction

The singular value decomposition of the forward operator A\mathcal{A} is the central tool of classical imaging theory. It answers three fundamental questions simultaneously:

  1. What does the sensor see? The right singular functions {vn}\{v_n\} are the "scene modes" β€” orthogonal components of the scene that each contribute independently to the measurements.

  2. How much does each mode contribute? The singular value Οƒn\sigma_n is the gain for mode nn. Large Οƒn\sigma_n: well observed; small Οƒn\sigma_n: barely observable.

  3. Is the inverse stable? Inverting A\mathcal{A} requires dividing by σn\sigma_n. When σn→0\sigma_n \to 0, the inverse amplifies noise without bound. The Picard condition quantifies exactly when stable inversion is possible.

This section develops the SVD for compact operators β€” the infinite-dimensional generalisation of the matrix SVD from Telecom Ch~1 (TSVD Existence Theorem).

Theorem: Spectral Theorem for Compact Self-Adjoint Operators

Let B\mathcal{B} be a compact self-adjoint operator on a separable Hilbert space H\mathcal{H}. Then:

(i) All eigenvalues of B\mathcal{B} are real.

(ii) Eigenvectors corresponding to distinct eigenvalues are orthogonal.

(iii) The non-zero eigenvalues form a countable sequence ∣λ1∣β‰₯∣λ2∣β‰₯β‹―|\lambda_1| \geq |\lambda_2| \geq \cdots with Ξ»nβ†’0\lambda_n \to 0.

(iv) There exists an orthonormal system {en}\{e_n\} of eigenvectors such that

Bf=βˆ‘n=1∞λnβ€‰βŸ¨f,enβŸ©β€‰enβˆ€β€‰f∈H.\mathcal{B}f = \sum_{n=1}^\infty \lambda_n\, \langle f, e_n \rangle\, e_n \quad \forall\, f \in \mathcal{H}.

Self-adjointness and compactness together guarantee that the spectral theory is as "clean" as in the finite-dimensional case: real eigenvalues, orthogonal eigenvectors, and a completely explicit spectral expansion. The only infinite-dimensional feature is that eigenvalues can accumulate only at zero.

,

Definition:

Singular System of a Compact Operator

Let A:Xβ†’Y\mathcal{A}: \mathcal{X} \to \mathcal{Y} be a compact linear operator between Hilbert spaces. The singular system of A\mathcal{A} is a triple {(Οƒn,vn,un)}n=1∞\{(\sigma_n, v_n, u_n)\}_{n=1}^\infty where:

  • Οƒn>0\sigma_n > 0 are the singular values (the non-zero square roots of the eigenvalues of Aβˆ—A\mathcal{A}^*\mathcal{A}), ordered Οƒ1β‰₯Οƒ2β‰₯β‹―β†’0\sigma_1 \geq \sigma_2 \geq \cdots \to 0.
  • {vn}\{v_n\} are the right singular functions in X\mathcal{X} (eigenfunctions of Aβˆ—A\mathcal{A}^*\mathcal{A}: Aβˆ—A vn=Οƒn2 vn\mathcal{A}^*\mathcal{A}\,v_n = \sigma_n^2\,v_n).
  • {un}\{u_n\} are the left singular functions in Y\mathcal{Y} (eigenfunctions of AAβˆ—\mathcal{A}\mathcal{A}^*: AAβˆ—β€‰un=Οƒn2 un\mathcal{A}\mathcal{A}^*\,u_n = \sigma_n^2\,u_n).

They satisfy the cross-relations A vn=Οƒn un\mathcal{A}\,v_n = \sigma_n\,u_n and Aβˆ—un=Οƒn vn\mathcal{A}^*u_n = \sigma_n\,v_n, and the SVD expansion

A f=βˆ‘n=1βˆžΟƒnβ€‰βŸ¨f,vn⟩X unβˆ€β€‰f∈X.\mathcal{A}\,f = \sum_{n=1}^\infty \sigma_n\, \langle f, v_n \rangle_{\mathcal{X}}\,u_n \quad \forall\, f \in \mathcal{X}.

This is the exact infinite-dimensional generalisation of the matrix SVD A=UΣVH\mathbf{A} = \mathbf{U}\boldsymbol{\Sigma}\mathbf{V}^H. The key difference: in finite dimensions there are finitely many singular values, while a compact operator has σn→0\sigma_n \to 0. This decay to zero is the mathematical signature of ill-posedness.

,

Example: SVD of Compact Operators β€” Connection to the Matrix SVD

Illustrate the SVD of a compact operator with the simple example of the finite-rank operator A:L2([0,1])β†’CM\mathcal{A}: L^2([0,1]) \to \mathbb{C}^M defined by (Af)m=∫01Ο•m(x) f(x) dx(\mathcal{A}f)_m = \int_0^1 \phi_m(x)\,f(x)\,dx with Ο•1,…,Ο•M\phi_1, \ldots, \phi_M an orthonormal set. Find the singular system.

Example: Eigenvalue Decay of a Gaussian Convolution Operator

For the convolution operator with kernel K(rβˆ’rβ€²)=eβˆ’βˆ£rβˆ’rβ€²βˆ£2/(2Οƒ2)K(r - r') = e^{-|r - r'|^2 / (2\sigma^2)} on R\mathbb{R}, determine the singular value decay and explain why deblurring is severely ill-posed.

Theorem: Picard Condition for Solvability

Let {(Οƒn,vn,un)}\{(\sigma_n, v_n, u_n)\} be the singular system of a compact operator A:Xβ†’Y\mathcal{A}: \mathcal{X} \to \mathcal{Y} with Οƒn>0\sigma_n > 0 for all nn. Given data y∈Y\mathbf{y} \in \mathcal{Y}, the equation Af=y\mathcal{A}f = \mathbf{y} has a solution f∈Xf \in \mathcal{X} if and only if:

  1. y\mathbf{y} is orthogonal to ker⁑(Aβˆ—)\ker(\mathcal{A}^*) (the consistency condition), and

  2. The Picard condition holds:

    βˆ‘n=1∞∣⟨y,un⟩∣2Οƒn2<∞.\sum_{n=1}^{\infty} \frac{|\langle \mathbf{y}, u_n \rangle|^2} {\sigma_n^2} < \infty.

When it holds, the minimum-norm solution is

f†=βˆ‘n=1∞⟨y,unβŸ©Οƒn vn.f^\dagger = \sum_{n=1}^\infty \frac{\langle \mathbf{y}, u_n \rangle}{\sigma_n}\,v_n.

The Picard condition says that the "Fourier coefficients" of the data in the unu_n basis must decay faster than the singular values. When the Οƒn\sigma_n decay rapidly (severe ill-posedness), this is a very strong requirement: even small noise components projected onto unu_n with small nn (low singular values) would violate the condition and preclude a finite-norm solution. This is the precise mathematical statement of why ill-posed problems are unstable.

,

Singular Value Decay of Imaging Operators

The singular values Οƒn\sigma_n of imaging operators decay to zero. The decay rate determines the degree of ill-posedness: polynomial decay means "mildly ill-posed" (deconvolution), super-exponential means "severely ill-posed" (analytic continuation, back-scattering).

Observe: (a) the Picard condition requires data coefficients ∣⟨y,un⟩∣|\langle \mathbf{y}, u_n \rangle| to decay faster than Οƒn\sigma_n; (b) noise β€” which has roughly constant spectral power β€” violates the Picard condition beyond index nβˆ—n^* where Οƒnβˆ—βˆΌβˆ₯wβˆ₯\sigma_{n^*} \sim \|\mathbf{w}\|.

Parameters
64

The Picard Plot β€” Diagnosing Ill-Posedness

The Picard plot shows three curves on a log scale: (1) singular values Οƒn\sigma_n (blue, decaying), (2) exact data coefficients ∣⟨Af,un⟩∣|\langle \mathcal{A}f, u_n \rangle| (green, should decay faster than Οƒn\sigma_n), (3) noisy data coefficients ∣⟨y,un⟩∣|\langle \mathbf{y}, u_n \rangle| (red, eventually dominated by noise level Οƒ\sigma).

The index nβˆ—n^* where the red curve crosses the noise floor determines the achievable reconstruction bandwidth. Regularisation effectively truncates the SVD at nβˆ—n^*.

Parameters
20
1.5

Example: Eigenvalue Decay β€” Mildly vs. Severely Ill-Posed

Compare the singular value decay for (a) the integration operator (mildly ill-posed), (b) the Laplace transform operator (moderately ill-posed), and (c) the analytic continuation operator (severely ill-posed). Classify each according to how many terms satisfy the Picard condition at noise level Ξ΄\delta.

Eigenvalue Decay: Mildly vs. Severely Ill-Posed

Compare singular value decay rates for different operator types. The classification (mildly / moderately / severely ill-posed) determines the achievable number of reliably recoverable modes at a given noise level and, equivalently, the spatial resolution attainable in the reconstructed image.

Parameters
30

Historical Note: The SVD in Inverse Problems β€” Schmidt, Picard, and Tikhonov

1907–1963

The singular value decomposition for integral operators was developed by Erhard Schmidt (1907) and independently by Γ‰mile Picard. Schmidt's work on bilinear forms of square-integrable kernels produced what we now recognise as the infinite-dimensional SVD. Picard applied it to derive the solvability condition for Fredholm integral equations of the first kind β€” now called the Picard condition (1910).

The connection to stability and regularisation was made explicit only in the 1950s–60s. Andrei Tikhonov's 1963 paper on regularisation of ill-posed problems provided the key idea: rather than solving Af=y\mathcal{A}f = \mathbf{y} exactly (unstable), minimise βˆ₯Afβˆ’yβˆ₯2+Ξ±βˆ₯fβˆ₯2\|\mathcal{A}f - \mathbf{y}\|^2 + \alpha\|f\|^2 (stable). In the SVD basis this amounts to replacing 1/Οƒn1/\sigma_n with Οƒn/(Οƒn2+Ξ±)\sigma_n/(\sigma_n^2 + \alpha) β€” a smooth dampening of small singular values rather than the abrupt truncation of Picard.

For RF imaging the SVD of the forward operator was first computed numerically by David Slepian (1961) for the 1-D diffraction problem using prolate spheroidal wave functions β€” the singular functions that optimally concentrate spatial bandwidth. This computation revealed, for the first time, the precise number of degrees of freedom in a diffraction-limited image.

⚠️Engineering Note

Practical SVD Computation for Imaging Operators

The theoretical SVD of a compact operator is defined on an infinite-dimensional space, but in practice we work with the discrete sensing matrix A∈CMΓ—N\mathbf{A} \in \mathbb{C}^{M \times N}.

For realistic problems (e.g., 3-D SAR at 1 mm resolution: N∼106N \sim 10^6 voxels, M∼105M \sim 10^5 measurements), storing A\mathbf{A} explicitly is infeasible (101110^{11} entries). Instead, use:

  • Randomised SVD (Halko et al., 2011): computes the top-kk singular vectors in O(MNk)O(MN k) using random projections. Accuracy within Οƒk+1\sigma_{k+1} of exact. Suitable for k≀1000k \leq 1000.
  • Lanczos bidiagonalisation (Paige–Saunders LSQR): matrix-free, requires only Av\mathcal{A}\mathbf{v} and AHu\mathcal{A}^H\mathbf{u} multiplications. Standard in large-scale LASSO solvers.
  • Fourier-domain methods: when A\mathcal{A} is a convolution, its SVD is the DFT β€” no explicit computation needed.
Practical Constraints
  • β€’

    Storing the full MΓ—NM \times N matrix requires O(MN)O(MN) memory; for M=N=104M = N = 10^4 this is 1.6 GB in float32 β€” feasible. For N=106N = 10^6 this is 80 TB β€” infeasible without matrix-free methods.

  • β€’

    Randomised SVD accuracy degrades for slowly decaying singular values (mild ill-posedness); works well for exponentially decaying cases (severe ill-posedness) since few singular values matter.

  • β€’

    CUDA-accelerated NUFFT reduces the A\mathcal{A} multiplication from O(MN)O(MN) to O(M+Nlog⁑N)O(M + N\log N) when the acquisition geometry is near-uniform.

Common Mistake: SVD β‰ \neq Eigendecomposition for Non-Self-Adjoint Operators

Mistake:

Confusing the SVD of A\mathcal{A} with the eigendecomposition of A\mathcal{A}, or applying the spectral theorem (for self-adjoint operators) to a general A\mathcal{A}.

Correction:

The spectral theorem applies to compact self-adjoint operators. For a general compact operator A\mathcal{A}:

  • The SVD of A\mathcal{A} uses TWO sets of orthonormal functions ({vn}\{v_n\} in the input space, {un}\{u_n\} in the output space).
  • The eigendecomposition uses ONE set. It exists only if A\mathcal{A} is self-adjoint (or at least normal: AAβˆ—=Aβˆ—A\mathcal{A}\mathcal{A}^* = \mathcal{A}^*\mathcal{A}).
  • The forward operator A:L2(Ξ©)β†’CM\mathcal{A}: L^2(\Omega) \to \mathbb{C}^M maps between different spaces β€” it cannot even have eigenvectors in the usual sense.

The correct object is the SVD of A\mathcal{A}, or equivalently the spectral decomposition of Aβˆ—A\mathcal{A}^*\mathcal{A} (a self-adjoint operator on the input space).

Why This Matters: SVD, Degrees of Freedom, and the Resolution Limit

The number of degrees of freedom (DoF) of an imaging system is the effective rank of the sensing operator β€” the number of singular values that are non-negligible compared to the noise level Οƒ\sigma:

DoF=#{n:Οƒn>Οƒ}.\mathrm{DoF} = \#\bigl\{n : \sigma_n > \sigma\bigr\}.

For a monostatic radar with aperture DD and wavelength Ξ»\lambda illuminating a scene at range RR, the number of DoF in the cross-range direction is approximately D2/(Ξ»R)D^2 / (\lambda R) β€” the ratio of the aperture area to the diffraction-limited resolution cell area. This is the fundamental Nyquist rate for imaging, analogous to the sampling theorem for 1-D signals.

The singular functions {vn}\{v_n\} are the prolate spheroidal wave functions (PSWFs) β€” discovered by Slepian (1961) precisely by computing the SVD of the diffraction operator. The PSWFs achieve the maximum spatial concentration within the aperture footprint, making them the natural basis for band-limited imaging.

See full treatment in Chapter 7

Quick Check

The Picard condition for the equation Af=y\mathcal{A}f = \mathbf{y} to have a solution requires ___.

βˆ‘n∣⟨y,un⟩∣2/Οƒn2<∞\sum_n |\langle \mathbf{y}, u_n \rangle|^2 / \sigma_n^2 < \infty

βˆ‘n∣⟨y,un⟩∣2β‹…Οƒn2<∞\sum_n |\langle \mathbf{y}, u_n \rangle|^2 \cdot \sigma_n^2 < \infty

All singular values Οƒn\sigma_n are bounded below by a positive constant.

The operator A\mathcal{A} is self-adjoint.

Quick Check

A compact operator with exponentially decaying singular values is classified as ___.

well-posed

mildly ill-posed

moderately ill-posed

severely ill-posed

Singular system

The triple {(Οƒn,vn,un)}\{(\sigma_n, v_n, u_n)\} of singular values and singular functions of a compact operator A\mathcal{A}. Provides the complete spectral characterisation of A\mathcal{A} and the pseudoinverse A†\mathcal{A}^\dagger.

Related: Singular System of a Compact Operator, Picard Condition for Solvability

Picard condition

The condition βˆ‘n∣⟨y,un⟩∣2/Οƒn2<∞\sum_n |\langle \mathbf{y}, u_n \rangle|^2 / \sigma_n^2 < \infty necessary and sufficient for the equation Af=y\mathcal{A}f = \mathbf{y} to have a solution. Noise always violates the Picard condition eventually, motivating regularisation.

Related: Picard Condition for Solvability, Singular System of a Compact Operator

Key Takeaway

Four core messages of this section:

(1) The singular system {(Οƒn,vn,un)}\{(\sigma_n, v_n, u_n)\} provides the complete spectral picture of a compact operator: which scene modes are well-observed (Οƒn\sigma_n large) and which are invisible (Οƒnβ‰ˆ0\sigma_n \approx 0).

(2) Singular values decay to zero for compact operators. The decay rate classifies ill-posedness: polynomial = mild, sub-exponential = moderate, exponential = severe.

(3) The Picard condition says: the data coefficients must decay faster than the singular values for the inverse problem to be solvable. Noise violates this condition at high frequencies β€” the fundamental reason all imaging problems require regularisation.

(4) The number of degrees of freedom DoF∼#{n:Οƒn>Οƒ}\mathrm{DoF} \sim \#\{n : \sigma_n > \sigma\} is the effective rank at a given SNR and determines the achievable spatial resolution. This connects SVD theory directly to the resolution limit of any RF imaging system.