DOA Estimation via Sparsity

DOA Estimation Is a Sparse-Inverse Problem

A uniform linear array with NaN_a antennas observes signals arriving from KK directions θ1,,θK\theta_1, \ldots, \theta_K. The classical subspace methods — MUSIC, ESPRIT — rely on the eigenstructure of the sample covariance and need K<NaK < N_a. They also need many snapshots to estimate the covariance reliably, and they break down at low SNR, with coherent sources, or when sources move between snapshots. The point is that the directions θk\theta_k are sparse on the angular domain: we have KK active angles in a potentially dense grid. Posing DOA as a CS problem lets us work with single snapshots, handle coherent sources, and exploit structure across frequencies.

Definition:

Sparse DOA Model on an Angular Grid

Let a(θ)CNa\mathbf{a}(\theta) \in \mathbb{C}^{N_a} denote the ULA steering vector at angle θ\theta, with [a(θ)]n=ejπnsinθ[\mathbf{a}(\theta)]_n = e^{j\pi n \sin\theta} (half-wavelength spacing). Fix a grid {θg}g=1G\{\theta_g\}_{g=1}^G on [90,90][-90^\circ, 90^\circ] and stack the steering vectors into A=[a(θ1)a(θG)]CNa×G\mathbf{A} = [\mathbf{a}(\theta_1) \cdots \mathbf{a}(\theta_G)] \in \mathbb{C}^{N_a \times G}. A TT-snapshot observation from KK sources obeys Y=AX+W,YCNa×T,\mathbf{Y} = \mathbf{A}\,\mathbf{X} + \mathbf{W},\qquad \mathbf{Y} \in \mathbb{C}^{N_a \times T}, where the rows of XCG×T\mathbf{X} \in \mathbb{C}^{G \times T} are row-sparse with exactly KK nonzero rows — one per source. The sensing matrix here is A\mathbf{A} (the steering dictionary), and GKG \gg K.

With T=1T = 1 the problem reduces to standard single-measurement CS. With T>1T > 1 we exploit the fact that all snapshots see the same sources — a joint-sparsity (MMV) problem.

Theorem: 2,1\ell_{2,1} Recovery for Joint-Sparse DOA

Consider the MMV model Y=AX+W\mathbf{Y} = \mathbf{A}\mathbf{X} + \mathbf{W} with KK nonzero rows. The 2,1\ell_{2,1} regularization X^=argminX 12YAXF2+λg=1Gxg2\hat{\mathbf{X}} = \arg\min_{\mathbf{X}}\ \tfrac{1}{2}\|\mathbf{Y} - \mathbf{A}\mathbf{X}\|_F^2 + \lambda \sum_{g=1}^G \|\mathbf{x}_g\|_2 enjoys the same RIP-based recovery guarantees as single-measurement LASSO, with the number of required snapshots replaced by effective measurements NaTN_a\cdot T in the concentration analysis. In particular, if A\mathbf{A} has RIP of order 2K2K and δ2K<21\delta_{2K} < \sqrt{2}-1, then X^XFC0WF/Na.\|\hat{\mathbf{X}} - \mathbf{X}\|_F \leq C_0\, \|\mathbf{W}\|_F / \sqrt{N_a}.

The key observation is that averaging across snapshots reduces noise by T\sqrt{T}, so joint sparsity trades snapshots for angular resolution. A single snapshot of MUSIC fails; a single snapshot of 2,1\ell_{2,1} still recovers the sources.

Definition:

Atomic Norm for Gridless DOA

Define the set of atoms A={a(θ)ejφ:θ[90,90], φ[0,2π)}.\mathcal{A} = \{\mathbf{a}(\theta)\,e^{j\varphi} : \theta \in [-90^\circ, 90^\circ],\ \varphi \in [0, 2\pi)\}. The atomic norm of a vector uCNa\mathbf{u} \in \mathbb{C}^{N_a} is uA=inf{kck:u=kcka(θk)ejφk}.\|\mathbf{u}\|_\mathcal{A} = \inf\Big\{ \sum_k |c_k| : \mathbf{u} = \sum_k c_k \mathbf{a}(\theta_k) e^{j\varphi_k}\Big\}. A gridless DOA estimator solves minu 12yu22+λuA.\min_{\mathbf{u}}\ \tfrac{1}{2}\|\mathbf{y} - \mathbf{u}\|_2^2 + \lambda\, \|\mathbf{u}\|_\mathcal{A}.

Unlike on-grid CS, atomic-norm minimization places sources at arbitrary continuous angles, eliminating basis mismatch and achieving "super-resolution" beyond the Rayleigh limit when sources are well separated.

Theorem: SDP Characterization of Atomic Norm

For uCNa\mathbf{u} \in \mathbb{C}^{N_a} and Vandermonde atoms, the atomic norm equals the optimum of uA=infT,t 12 ⁣(t+1Natr(T))s.t.(TuuHt)0,T Hermitian Toeplitz.\|\mathbf{u}\|_\mathcal{A} = \inf_{\mathbf{T},\,t}\ \tfrac{1}{2}\!\left(t + \tfrac{1}{N_a}\mathrm{tr}(\mathbf{T})\right) \quad \text{s.t.} \quad \begin{pmatrix}\mathbf{T} & \mathbf{u} \\ \mathbf{u}^H & t\end{pmatrix} \succeq 0,\quad \mathbf{T}\text{ Hermitian Toeplitz}.

A seemingly infinite-dimensional optimization over θ\theta becomes a finite semidefinite program. The Vandermonde decomposition lemma (Carathéodory) recovers the angles from the optimal T\mathbf{T}.

DOA Spectrum: Sparse Matched Filter vs MUSIC

Sparse matched-filter recovery and MUSIC both exhibit sharp peaks at true angles. Vary SNR and snapshots: MUSIC degrades when TT is small or sources coherent; sparse recovery is more robust.

Parameters
16
50
15
-25
10
35

Example: Single-Snapshot DOA Recovery

A ULA with Na=16N_a = 16 antennas observes two coherent sources at 20-20^\circ and +15+15^\circ in a single snapshot (T=1T=1). Explain why MUSIC fails but sparse recovery succeeds.

⚠️Engineering Note

Joint Communication and Sensing (JCAS)

Integrated sensing and communication (ISAC / JCAS) in 6G uses the same waveform to transmit data and estimate target angles/ranges. The sensing channel is typically sparse — few targets in a large angular-range cell. Atomic-norm DOA estimation is the algorithm of choice for off-grid targets because grid mismatch otherwise limits angular accuracy. In 3GPP RAN1 ISAC discussions, gridless estimators are proposed as a benchmark for target localisation performance.

📋 Ref: 3GPP TR 22.837 (Sensing Use Cases), TR 38.837

Common Mistake: Basis Mismatch Dominates at High SNR

Mistake:

Refining the angular grid indefinitely to resolve off-grid sources.

Correction:

As the grid GG \to \infty, the columns of A\mathbf{A} become increasingly correlated and CS recovery becomes ill-conditioned — RIP deteriorates, and 1\ell_1 fails to be selective. Grid-mismatch error Δθ2\sim \Delta\theta^2 (Malioutov et al.) eventually dominates the noise error. Use atomic-norm minimization or iterative grid refinement instead of blindly shrinking Δθ\Delta\theta.

Historical Note: From MUSIC to Atomic Norm

1986-2013

Schmidt introduced MUSIC in 1986; Roy and Kailath introduced ESPRIT in 1989. These eigenstructure methods dominated array processing for two decades. Malioutov, Cetin, and Willsky (2005) were the first to formulate DOA as a sparse recovery problem, opening the CS-for-DOA research line. Tang, Bhaskar, Shah, and Recht (2013) gave the atomic-norm framework that made gridless super-resolution rigorous, building on the continuous-time super-resolution results of Candès and Fernandez-Granda.

Atomic norm

The gauge function of the convex hull of an atomic set A\mathcal{A}. Generalises 1\ell_1 (atoms = signed standard basis) and nuclear norm (atoms = rank-1 matrices) to arbitrary parametric dictionaries — crucially, continuous ones like {a(θ)}\{\mathbf{a}(\theta)\}.

Related: Atomic Norm for Gridless DOA, SDP Characterization of Atomic Norm

Steering vector

The array response to a plane wave from angle θ\theta. For a ULA with half-wavelength spacing, [a(θ)]n=ejπnsinθ[\mathbf{a}(\theta)]_n = e^{j\pi n \sin\theta}, a Vandermonde vector.

Related: Sparse DOA Model on an Angular Grid

DOA Methods Compared

PropertyMUSIC2,1\ell_{2,1}-MMV CSAtomic Norm (Gridless)
Single-snapshotfailsworksworks
Coherent sourcesfails (subspace)worksworks
KNaK \geq N_a sourcesfailsworks if RIP holdsworks
Off-grid biasnone (continuous)O(Δθ2)O(\Delta\theta^2)none
ComplexityO(Na3)O(N_a^3) eigO(NaG)O(N_a G)/iterSDP: O(Na6.5)O(N_a^{6.5})
SNR thresholdhighmoderatelow (best)

Key Takeaway

DOA estimation is sparse recovery on an angular grid. 2,1\ell_{2,1} joint-sparsity handles multiple snapshots and coherent sources; atomic-norm minimization removes the grid altogether at the cost of solving an SDP. Both outperform classical subspace methods in the snapshot-limited, coherent-source, or low-SNR regimes that dominate ISAC and integrated sensing workloads.

Why This Matters: mmWave Beam Training

At mmWave frequencies, narrow beams must be steered toward a handful of scattering clusters. Beam training is exactly sparse DOA estimation: the UE sweeps a few directions and the BS recovers the cluster angles via on-grid or atomic-norm CS. This is why 5G NR beam-management procedures (SSB, CSI-RS) align naturally with CS dictionaries.

Quick Check

Why can 1\ell_1-based DOA estimation resolve KK coherent sources from a single snapshot while MUSIC cannot?

MUSIC needs orthogonal sources

MUSIC estimates the covariance matrix, which has rank at most equal to the number of snapshots; CS uses the snapshot directly.

CS uses more antennas

CS assumes known DOAs