LASSO and Basis Pursuit for Imaging
LASSO and Basis Pursuit for RF Imaging
This section connects the LASSO and Basis Pursuit recovery programs to practical RF image reconstruction. The algorithm derivations (ISTA/FISTA, ADMM) live in RFI Ch 04 and Telecom Ch 03; here we focus on the imaging-specific aspects: setting up the optimization for the RF sensing matrix , choosing the regularization parameter , handling complex-valued reflectivity, and debiasing the solution on the estimated support.
Definition: Basis Pursuit for Noiseless RF Imaging
Basis Pursuit for Noiseless RF Imaging
When the noise level is negligible (high ), solve the Basis Pursuit problem:
This is a linear program (for real signals) or a second-order cone program (SOCP, for complex signals) solvable by interior-point methods in .
When to use BP:
- Known exact forward model (no model mismatch).
- Very high ( dB).
- Moderate problem size ().
For large-scale imaging, BP is too expensive; the LASSO solved by FISTA or ADMM is preferred.
Definition: LASSO and BPDN for Noisy RF Imaging
LASSO and BPDN for Noisy RF Imaging
BPDN (constrained form):
where is the noise budget ().
LASSO (penalized form):
Equivalence: For every , there exists a such that the BPDN and LASSO solutions coincide (and vice versa). The mapping depends on the data.
For RF imaging: The LASSO is almost always preferred because:
- Efficient first-order solvers (FISTA, ADMM) are available.
- is easier to tune than .
- Same theoretical guarantees (§The MIMO Sensing Matrix).
Theorem: Complex Soft-Thresholding for RF Imaging
RF measurements are complex-valued. The proximal operator of for complex vectors is the complex soft-thresholding:
This shrinks the magnitude and preserves the phase.
Phase preservation is essential for coherent RF imaging where phase carries spatial information. Unlike real soft-thresholding (which simply shifts toward zero), complex soft-thresholding shrinks the complex magnitude while keeping the argument unchanged.
Wirtinger gradient of the data fidelity
— identical to the real case via Wirtinger calculus.
Proximal operator derivation
The proximal operator of for complex is . Writing , the minimizer of has , yielding the stated formula.
Phase preservation
Since for , the phase is preserved exactly.
Definition: Regularization Parameter Selection for RF Imaging
Regularization Parameter Selection for RF Imaging
The choice of critically affects reconstruction quality. Three principled selection strategies:
1. Discrepancy principle (known ): Choose such that . This matches the residual to the expected noise level.
2. Cross-validation (unknown ): Hold out a random subset of measurements , solve on , evaluate the prediction error on . Repeat for folds and select minimizing the average prediction error.
3. SURE (Stein's Unbiased Risk Estimate): For the LASSO with Gaussian noise, , where is the number of nonzero entries.
For radar imaging: The discrepancy principle is most reliable because the noise level is typically known from the receiver noise figure.
Definition: Debiasing on the Estimated Support
Debiasing on the Estimated Support
The LASSO solution is biased: the penalty shrinks all nonzero amplitudes toward zero.
Debiasing procedure:
- Estimate the support .
- Solve the least-squares problem restricted to :
- This is equivalent to where contains only the columns indexed by .
When to debias: Always for amplitude-critical applications (radar cross-section estimation, quantitative imaging). Not needed for pure detection (support recovery).
LASSO Reconstruction of Point Scatterers
Demonstrates LASSO reconstruction for an RF imaging scene with point scatterers. Vary to observe the bias-variance tradeoff: too large kills weak scatterers (over-regularization); too small retains noise (under-regularization).
Left: True scene and LASSO reconstruction. Right: Support recovery and amplitude error vs. .
Parameters
Example: LASSO Reconstruction for Compressed SAR
Setup: Stripmap SAR with 50% random frequency subsampling. pixels, measurements. Scene: 25 point scatterers + 5 extended targets (buildings).
Results:
| Method | NMSE (dB) | SSIM | Time (s) |
|---|---|---|---|
| Matched filter | 0.42 | 0.01 | |
| LASSO (FISTA, 100 iter) | 0.89 | 2.1 | |
| LASSO (ADMM, 40 iter) | 0.90 | 2.5 | |
| BP (SOCP) | 0.91 | 180 |
FISTA and ADMM achieve nearly the same quality as BP at 100 lower computational cost. The matched filter provides a quick preview but with significant sidelobe artifacts.
Key observation
FISTA and ADMM reach near-BP quality at a fraction of the cost because each iteration only requires two matrix-vector products ( and ), which exploit Kronecker structure for cost.
Debiasing effect
Applying debiasing to the FISTA solution improves NMSE from to dB by removing the shrinkage bias.
FISTA vs. ADMM vs. Coordinate Descent for Imaging
The imaging-specific sensing matrix has structure (Kronecker, partial DFT) that makes some solvers more natural:
| Solver | Cost/iter | Best when |
|---|---|---|
| FISTA | matvec | has fast matvec (Kronecker/FFT) |
| ADMM | Linear system + prox | Composite penalties ( + TV) |
| Coordinate descent | per coordinate | stored explicitly, moderate |
For RF imaging with Kronecker , FISTA is typically fastest for the pure LASSO. ADMM dominates for composite penalties because FISTA would require an inner proximal loop. Coordinate descent is rarely used because the matvec with does not decompose coordinate-wise.
Common Mistake: Selection Pitfalls in RF Imaging
Mistake:
Setting by visual inspection or by using the universal threshold without accounting for the sensing matrix structure. The universal threshold assumes an orthonormal , but the RF sensing matrix has non-uniform column norms and high coherence, so the effective threshold is different.
Correction:
Use the discrepancy principle when is known (standard in radar). If is unknown, use 5-fold cross-validation. Always validate the chosen by inspecting the residual image for systematic structure (which indicates under-regularization).
Choosing the Sparsifying Basis for RF Imaging
The LASSO assumes sparsity in some domain. For RF imaging:
| Basis | Sparse for | LASSO becomes |
|---|---|---|
| Identity | Point scatterers | |
| Wavelet | Natural scenes with edges | |
| DCT | Smooth scenes | Same form with DCT |
| Learned dictionary | Complex scenes | Requires dictionary learning |
Practical advice:
- Start with the identity (canonical sparsity) for scenes dominated by point scatterers.
- Use wavelets for scenes with extended targets.
- Use TV (analysis sparsity, §Total Variation Reconstruction) for piecewise-constant scenes.
- For mixed scenes, combine + TV via ADMM.
Quick Check
After solving the LASSO for an RF imaging problem, you obtain . Why might the estimated scatterer amplitudes be systematically too small?
The penalty shrinks all nonzero entries toward zero (shrinkage bias).
The sensing matrix has too few rows.
The noise level is too high.
The grid resolution is too coarse.
The penalty introduces a bias: every nonzero amplitude is reduced by approximately . Debiasing by least-squares on the detected support removes this effect.
Historical Note: From Statistics to Radar Imaging
1996--2006The LASSO was introduced by Robert Tibshirani in 1996 as a statistical regularization method for linear regression. Its adoption in signal processing and imaging was catalyzed by the compressed sensing revolution (Candes, Romberg, Tao, 2006; Donoho, 2006), which provided the theoretical guarantees showing that minimization could exactly recover sparse signals from incomplete measurements. Cetin and Karl (2001) were among the first to apply sparsity-promoting regularization to SAR imaging, demonstrating dramatic improvements over matched filter reconstruction.
LASSO
Least Absolute Shrinkage and Selection Operator: the penalized least-squares problem . Promotes sparsity through the penalty.
Related: Basis Pursuit, FISTA
Basis Pursuit
The minimization problem under equality constraints: s.t. . Equivalent to the LASSO with in the noiseless case.
Related: LASSO
FISTA
Fast Iterative Shrinkage-Thresholding Algorithm: Nesterov- accelerated proximal gradient for the LASSO, achieving convergence vs. for ISTA.
Related: LASSO
Debiasing
Post-processing step that removes the shrinkage bias by solving least-squares restricted to the estimated support.
Related: LASSO
Discrepancy Principle
Regularization parameter selection rule that chooses so the residual norm matches the expected noise level: .
Key Takeaway
Basis Pursuit is the gold standard for noiseless recovery but too expensive () for large-scale imaging. The LASSO solved by FISTA or ADMM achieves near-BP quality at orders-of-magnitude lower cost. Complex soft-thresholding preserves phase, essential for coherent RF imaging. The discrepancy principle is the recommended strategy for choosing in radar imaging where the noise level is known. Always debias on the detected support when amplitude accuracy matters.
Why This Matters: LASSO in Wireless Channel Estimation
The same LASSO formulation used for RF image reconstruction appears in mmWave and sub-THz channel estimation (Telecom Ch 35), where the channel is sparse in the angle-delay domain. The sensing matrix is a partial DFT (pilot tones), and the LASSO recovers the channel taps. The debiasing step is equally important there to obtain accurate channel gain estimates for beamforming.
See full treatment in Chapter 35