Exercises
ex-ch15-01
EasyA wideband channel has delay taps and significant paths. Estimate the pilot overhead required by (a) full least squares, (b) the CS bound . Compare.
For LS, .
For CS use as an illustrative constant.
LS overhead
pilots.
CS overhead
. With a tighter constant (e.g.\ 2) one gets .
Ratio
CS saves roughly of pilots relative to LS for this channel.
ex-ch15-02
EasyA massive-access system has devices and at most active per coherence block. Estimate the order-of-magnitude pilot length needed for CS-based activity detection.
.
Log factor
.
Pilot length
pilots (before SNR factor). With SNR penalty factor at low SNR, .
ex-ch15-03
EasyWrite down the -norm of the vector with groups .
Compute the Euclidean norm of each group, then sum.
Per-group norms
Group 1: . Group 2: . Group 3: . Group 4: .
Sum
.
ex-ch15-04
EasyWhy can sparse DOA recovery resolve coherent sources from a single snapshot, whereas MUSIC cannot? Give a two-sentence answer.
Consider the rank of .
Answer
MUSIC relies on the eigenstructure of the sample covariance; a single snapshot gives rank 1, so subspace separation fails. CS treats the snapshot as measurements of a -sparse angular vector and recovers the DOAs directly without covariance estimation.
ex-ch15-05
MediumShow that the group LASSO subdifferential at equals and derive the corresponding proximal operator (group soft-thresholding).
Differentiate w.r.t.\ .
Write the proximal problem and solve in closed form per group.
Subdifferential
for ; at the subdifferential is the unit ball .
Proximal operator
β group soft-thresholding: shrink the whole group toward zero if , otherwise scale by .
ex-ch15-06
MediumConsider a -block-sparse signal with block size and total blocks. Compare the information-theoretic sample-complexity lower bounds for (a) plain , (b) group LASSO, and (c) hierarchical sparsity with in-group sparsity .
Count the log of the number of possible supports.
Counting supports
Plain : supports of size among entries, so when . Group LASSO: groups out of : plus bits per coefficient. Hierarchical: groups plus per group: .
Comparison
Plain . Group . Hierarchical . Hierarchical is best when , saving a factor over group.
ex-ch15-07
MediumDerive the OMP update for sparse channel estimation: starting from residual and support , write the explicit form of and .
Use the normal equations restricted to .
Support update
; .
Restricted LS
.
Residual
. By construction for all β orthogonality is the "O" in OMP.
ex-ch15-08
MediumA ULA with antennas observes a single source at with SNR = 10 dB across snapshot. Show that the matched-filter DOA spectrum has mainlobe width approximately in and compute the corresponding angular resolution in degrees.
Compute as a Dirichlet kernel.
Dirichlet kernel
, where .
Mainlobe width
First null at , so mainlobe width in is .
Angular resolution
Near , . rad .
ex-ch15-09
MediumShow that for a Gaussian measurement matrix, block-RIP of order with block size holds with high probability when . Sketch the union-bound argument.
Count the number of block supports and cover each -dim subspace.
Concentration on one support
For each fixed block support, a Gaussian matrix acts as a near-isometry on the -dim subspace with probability provided .
Union over supports
There are block supports. Union bound requires for simultaneous isometry.
Saving
Compared to plain , block-RIP saves a factor β the benefit of grouped structure.
ex-ch15-10
MediumIn unsourced random access with active users and codebook size , explain why coded CS (message-splitting into chunks) reduces the per-slot dictionary from to and why this is a strict complexity win.
CS recovery complexity scales polynomially in dictionary size.
Per-slot dictionary
Each chunk of bits indexes a -sized sub-dictionary. Parallel CS recovery is run independently in each of the slots.
Complexity
LASSO / AMP recovery is per slot, total β exponentially faster than a single recovery when .
Stitching cost
The outer tree code requires extra parity and an -level belief-propagation stitcher. The tradeoff: polynomial overhead in , exponential saving in .
ex-ch15-11
HardProve that the atomic-norm estimator succeeds exactly (zero recovery error) when source angles are separated by at least on a noise-free ULA. Reference the dual-certificate construction.
Construct a trigonometric polynomial achieving sign(coefficients) on the support.
Bound the polynomial off-support using Dirichlet kernel decay.
Dual certificate
For each source angle we need a polynomial with everywhere and . Candès and Fernandez-Granda (2014) constructed such a polynomial as a bump-function around each with controlled derivative bounds.
Minimum-separation condition
For the construction to satisfy strictly off-support, neighbouring bumps must not overlap, which requires .
Conclude exact recovery
KKT conditions with this dual certificate imply the atomic-norm primal has the -sparse representation at as its unique optimum.
ex-ch15-12
HardFor the covariance-based activity detector, show that the non-Bayesian maximum-likelihood objective is convex in each coordinate and derive the closed-form coordinate-descent update.
Use the Sherman-Morrison update for .
Reduce the per-coordinate problem to a scalar quadratic.
Sherman-Morrison
If , then .
Univariate objective
Substituting into yields , with , .
Convex in $\gamma_k$
for ; the minimizer is . Iterate coordinate-by-coordinate until convergence. This is the Fengler-Haghighatshoar-Jung-Caire algorithm.
ex-ch15-13
HardFor FDD massive MIMO, show that the uplink and downlink channels share the angular support but have different phases. Use this to justify CS-based downlink training with pilot overhead bounded by the uplink-estimated support size.
Write the channel as and analyse frequency dependence.
Path-wise model
. The angles are frequency-agnostic (geometry); only the complex gains differ between UL and DL.
Support reuse
Estimate UL support via CS on uplink pilots. DL channel estimation then reduces to estimating complex gains β a problem of dimension , not .
Pilot budget
DL pilots needed: (gain estimation) rather than (support + gains). For massive MIMO with angular bins and paths, this reduces DL pilot overhead by .
ex-ch15-14
HardShow that the HiHTP projection onto the -hierarchical-sparse set is separable: one first picks the largest entries inside each group, then picks the groups with the largest in-group energy. Prove this is optimal.
Frame the projection as a constrained least-squares problem over hierarchical supports.
Problem
subject to supported on groups, each with nonzero entries.
Inner optimisation
Fix active-group set . Inside group , best -sparse approximation of is its top- coordinates. In-group error: .
Outer optimisation
Error as a function of : . Minimised by picking the groups maximising .
Separability
Thus the two-level procedure β inner top-, outer top- by in-group energy β is jointly optimal.
ex-ch15-15
HardDerive the phase-transition threshold for DOA recovery from a single snapshot using Gaussian widths. For sources and an -antenna ULA, show that recovery succeeds when up to constants.
Treat the ULA dictionary as an approximate RIP matrix and apply Chen-Chi-Fannjiang.
Gaussian width of $K$-sparse set
The Gaussian width of the -sparse unit ball in is .
ULA dictionary incoherence
When angles are separated by , columns of have mutual coherence , enabling RIP-like concentration.
Sample complexity
Candès-Plan's matrix-from-random-samples result gives for successful recovery.
ex-ch15-16
ChallengeProve that in unsourced random access, the Polyanskiy bound on per-user energy behaves like const as with fixed, and discuss why coded CS can approach this bound.
Use the union-bound analysis of a MAC with i.i.d.\ Gaussian codebooks and symmetric decoding.
Polyanskiy's bound
Polyanskiy (2017) derives the achievable finite-blocklength bound via a converse on typical-set decoding of an i.i.d.\ MAC with active users. As with fixed, the bound converges to a constant depending on the per-user error target.
Coded CS achievability
Coded CS uses i.i.d.\ Gaussian codebooks and AMP-based decoding (Amalladinne et al.), which in the large-system limit achieves the i.i.d.-Gaussian-input MAC capacity via state evolution. Thus coded CS matches Polyanskiy's bound to within the AMP-MMSE gap.
Remark on CommIT direction
Closing the small residual gap to Polyanskiy's bound is an open research direction; current best approaches combine HiHTP with outer LDPC codes.
ex-ch15-17
ChallengeConsider joint channel estimation and data detection in a grant-free uplink. Formulate the problem as a sparse-recovery problem with a structured sensing matrix and discuss when the joint problem is identifiable.
Write .
Identifiability requires pairing constraints (e.g., pilot symbol known).
Bilinear model
where each active user contributes a pilot-channel-data product. This is a bilinear sparse-recovery problem.
Identifiability
Without constraints, is determined only up to a complex scalar per user. Identifiability is restored by (a) known pilot symbols, (b) finite alphabets, or (c) multiple receive antennas (MMV structure).
Algorithms
Expectation-maximization, bilinear AMP (BiG-AMP), and Caire's joint AD-and-decoding framework combine CS recovery with data-symbol constellation constraints.
ex-ch15-18
ChallengeProve that for hierarchical sparsity with one outer-group active (), HiHTP reduces to ordinary HTP on the active group, and the sample complexity is β the single-group CS bound.
Track the HiHTP iteration when .
Reduction
With , outer-group selection is trivial; HiHTP picks the single group with the largest in-group top- energy. Inside that group, HiHTP reduces to HTP.
Sample complexity
HTP on a -dimensional ambient space with -sparse signal needs by Foucart's analysis.
Contrast with plain CS
Plain CS on the full ambient -space needs . HiHTP exploits the known outer-group support to remove the factor β a pure structural gain.
ex-ch15-19
ChallengeConsider a massive-MIMO BS with receive antennas and active users. Derive the asymptotic behaviour of the covariance-based detector as with fixed.
Show that in operator norm.
Law of large numbers
almost surely as (assuming ergodic LSFC).
ML in the limit
The ML objective becomes deterministic; at its minimum is attained at the true LSFC vector .
$K_a > M$ regime
Even for , the Fisher information of via the covariance entries remains non-singular in generic pilot books β this is why the covariance detector breaks the barrier. This is the core CommIT result.
ex-ch15-20
ChallengeShow that the atomic-norm SDP for DOA has dual s.t.\ and interpret the dual variable geometrically.
Lagrangianize the primal atomic norm and maximise over the Lagrange multiplier.
Primal-dual
The atomic norm is the support function of the polar , so . Plugging in and noting yields the dual.
Dual polynomial
is a trigonometric polynomial of degree . The constraint for all is the bounded-real lemma LMI.
Geometric interpretation
The optimal touches precisely at the true source angles; these are the active contact points that certify primal optimality. Reading off the touch points gives the DOA estimates.