Ferkans — Interactive Telecom Tutor

ex-ch15-01

Easy

A wideband channel has $L = 64$ delay taps and $s = 4$ significant paths. Estimate the pilot overhead required by (a) full least squares, (b) the CS bound $M \sim s \log(L/s)$ . Compare.

Show Hint

For LS, $M \geq L$ .

For CS use $M = 3 s \log_2(L/s)$ as an illustrative constant.

Solution

LS overhead

$M_{\text{LS}} \geq L = 64$ pilots.

CS overhead

$M_{\text{CS}} \approx 3 \times 4 \times \log_2(16) = 3 \times 4 \times 4 = 48$ . With a tighter constant (e.g.\ 2) one gets $M \approx 32$ .

Ratio

CS saves roughly $25\text{-}50\%$ of pilots relative to LS for this channel.

ex-ch15-02

Easy

A massive-access system has $K_{\text{total}} = 1000$ devices and at most $K_a = 20$ active per coherence block. Estimate the order-of-magnitude pilot length needed for CS-based activity detection.

Show Hint

$M \sim K_a \log(K_{\text{total}}/K_a)$ .

Solution

Log factor

$\log(1000/20) = \log(50) \approx 3.9$ .

Pilot length

$M \sim 20 \times 3.9 \approx 78$ pilots (before SNR factor). With SNR penalty factor $\approx 3$ at low SNR, $M \approx 240$ .

ex-ch15-03

Easy

Write down the $\ell_{2,1}$ -norm of the vector $\mathbf{x} = (1, 0, 2, 2, 0, 0, 1, 1)^T$ with groups $\{1,2\}, \{3,4\}, \{5,6\}, \{7,8\}$ .

Show Hint

Compute the Euclidean norm of each group, then sum.

Solution

Per-group norms

Group 1: $\sqrt{1^2+0^2}=1$ . Group 2: $\sqrt{4+4}=2\sqrt{2}$ . Group 3: $0$ . Group 4: $\sqrt{1+1}=\sqrt{2}$ .

Sum

$\|\mathbf{x}\|_{2,1} = 1 + 2\sqrt{2} + 0 + \sqrt{2} = 1 + 3\sqrt{2} \approx 5.24$ .

ex-ch15-04

Easy

Why can sparse DOA recovery resolve coherent sources from a single snapshot, whereas MUSIC cannot? Give a two-sentence answer.

Show Hint

Consider the rank of $\mathbf{y}\mathbf{y}^H$ .

Solution

Answer

MUSIC relies on the eigenstructure of the sample covariance; a single snapshot gives rank 1, so subspace separation fails. CS treats the snapshot as $N_a$ measurements of a $K$ -sparse angular vector and recovers the DOAs directly without covariance estimation.

ex-ch15-05

Medium

Show that the group LASSO subdifferential at $\mathbf{x}_g \neq \mathbf{0}$ equals $\lambda \mathbf{x}_g / \|\mathbf{x}_g\|_2$ and derive the corresponding proximal operator (group soft-thresholding).

Show Hint

Differentiate $\|\mathbf{x}_g\|_2$ w.r.t.\ $\mathbf{x}_g$ .

Write the proximal problem and solve in closed form per group.

Solution

Subdifferential

$\nabla_{\mathbf{x}_g} \|\mathbf{x}_g\|_2 = \mathbf{x}_g/\|\mathbf{x}_g\|_2$ for $\mathbf{x}_g \neq \mathbf{0}$ ; at $\mathbf{x}_g = \mathbf{0}$ the subdifferential is the unit ball $\{\mathbf{v} : \|\mathbf{v}\|_2 \leq 1\}$ .

Proximal operator

$\text{prox}_{\lambda\|\cdot\|_2}(\mathbf{z}_g) = (1 - \lambda/\|\mathbf{z}_g\|_2)_+ \, \mathbf{z}_g$ — group soft-thresholding: shrink the whole group toward zero if $\|\mathbf{z}_g\|_2 \leq \lambda$ , otherwise scale by $(1 - \lambda/\|\mathbf{z}_g\|_2)$ .

ex-ch15-06

Medium

Consider a $K$ -block-sparse signal with block size $B$ and $G$ total blocks. Compare the information-theoretic sample-complexity lower bounds for (a) plain $\ell_1$ , (b) group LASSO, and (c) hierarchical sparsity with in-group sparsity $s < B$ .

Show Hint

Count the log of the number of possible supports.

Solution

Counting supports

Plain $\ell_1$ : supports of size $KB$ among $GB$ entries, so $\log \binom{GB}{KB} \approx KB \log(G/K)$ when $B \ll G$ . Group LASSO: $K$ groups out of $G$ : $\log\binom{G}{K} \approx K\log(G/K)$ plus $KB$ bits per coefficient. Hierarchical: $K$ groups plus $s$ per group: $K\log(G/K) + Ks\log(B/s) + Ks$ .

Comparison

Plain $\ell_1 \gtrsim KB\log(G/K)$ . Group $\gtrsim K\log(G/K) + KB$ . Hierarchical $\gtrsim K\log(G/K) + Ks\log(B/s) + Ks$ . Hierarchical is best when $s \ll B$ , saving a factor $\approx B/s$ over group.

ex-ch15-07

Medium

Derive the OMP update for sparse channel estimation: starting from residual $\mathbf{r}^{(k-1)}$ and support $\mathcal{S}^{(k-1)}$ , write the explicit form of $\hat{\mathbf{h}}_{\mathcal{S}^{(k)}}$ and $\mathbf{r}^{(k)}$ .

Show Hint

Use the normal equations restricted to $\mathcal{S}^{(k)}$ .

Solution

Support update

$i^\star = \arg\max_i |\boldsymbol{\phi}_i^H \mathbf{r}^{(k-1)}|$ ; $\mathcal{S}^{(k)} = \mathcal{S}^{(k-1)} \cup \{i^\star\}$ .

Restricted LS

$\hat{\mathbf{h}}_{\mathcal{S}^{(k)}} = (\boldsymbol{\Phi}_{\mathcal{S}^{(k)}}^H \boldsymbol{\Phi}_{\mathcal{S}^{(k)}})^{-1} \boldsymbol{\Phi}_{\mathcal{S}^{(k)}}^H \mathbf{y}$ .

Residual

$\mathbf{r}^{(k)} = \mathbf{y} - \boldsymbol{\Phi}_{\mathcal{S}^{(k)}} \hat{\mathbf{h}}_{\mathcal{S}^{(k)}}$ . By construction $\mathbf{r}^{(k)} \perp \boldsymbol{\phi}_i$ for all $i \in \mathcal{S}^{(k)}$ — orthogonality is the "O" in OMP.

ex-ch15-08

Medium

A ULA with $N_a = 8$ antennas observes a single source at $\theta = 30^\circ$ with SNR = 10 dB across $T = 1$ snapshot. Show that the matched-filter DOA spectrum has mainlobe width approximately $2/N_a$ in $\sin\theta$ and compute the corresponding angular resolution in degrees.

Show Hint

Compute $|\mathbf{a}(\theta)^H \mathbf{a}(\theta_0)|^2$ as a Dirichlet kernel.

Solution

Dirichlet kernel

$|\mathbf{a}(\theta)^H \mathbf{a}(\theta_0)|^2 = |\sum_{n=0}^{N_a-1} e^{j\pi n (\sin\theta - \sin\theta_0)}|^2 = \frac{\sin^2(N_a \pi \Delta/2)}{\sin^2(\pi \Delta/2)}$ , where $\Delta = \sin\theta - \sin\theta_0$ .

Mainlobe width

First null at $\Delta = 2/N_a$ , so mainlobe width in $\sin\theta$ is $2/N_a = 0.25$ .

Angular resolution

Near $\theta_0 = 30^\circ$ , $d\sin\theta/d\theta = \cos 30^\circ \approx 0.866$ . $\Delta\theta \approx 0.25/0.866 \approx 0.289$ rad $\approx 16.5^\circ$ .

ex-ch15-09

Medium

Show that for a Gaussian measurement matrix, block-RIP of order $K$ with block size $B$ holds with high probability when $M \geq c\,(KB + K\log(G/K))$ . Sketch the union-bound argument.

Show Hint

Count the number of block supports and cover each $KB$ -dim subspace.

Solution

Concentration on one support

For each fixed block support, a Gaussian matrix acts as a near-isometry on the $KB$ -dim subspace with probability $1 - 2e^{-c\delta^2 M}$ provided $M \geq c_0 KB/\delta^2$ .

Union over supports

There are $\binom{G}{K} \leq (eG/K)^K$ block supports. Union bound requires $M \geq c_0 (KB + K\log(G/K))/\delta^2$ for simultaneous isometry.

Saving

Compared to plain $s\log(N/s) = KB\log(G/K) + KB\log B$ , block-RIP saves a factor $\log B$ — the benefit of grouped structure.

ex-ch15-10

Medium

In unsourced random access with $K_a$ active users and codebook size $2^B$ , explain why coded CS (message-splitting into $L$ chunks) reduces the per-slot dictionary from $2^B$ to $2^{B/L}$ and why this is a strict complexity win.

Show Hint

CS recovery complexity scales polynomially in dictionary size.

Solution

Per-slot dictionary

Each chunk of $B/L$ bits indexes a $2^{B/L}$ -sized sub-dictionary. Parallel CS recovery is run independently in each of the $L$ slots.

Complexity

LASSO / AMP recovery is $O(M \cdot 2^{B/L})$ per slot, total $O(L M \cdot 2^{B/L})$ — exponentially faster than a single $O(M \cdot 2^B)$ recovery when $L \geq 2$ .

Stitching cost

The outer tree code requires extra parity and an $O(K_a^2)$ -level belief-propagation stitcher. The tradeoff: polynomial overhead in $K_a$ , exponential saving in $B$ .

ex-ch15-11

Hard

Prove that the atomic-norm estimator succeeds exactly (zero recovery error) when source angles are separated by at least $\Delta\theta_{\min} \gtrsim 1/N_a$ on a noise-free ULA. Reference the dual-certificate construction.

Show Hint

Construct a trigonometric polynomial achieving sign(coefficients) on the support.

Bound the polynomial off-support using Dirichlet kernel decay.

Solution

Dual certificate

For each source angle $\theta_k$ we need a polynomial $q(\theta)$ with $|q(\theta)| \leq 1$ everywhere and $q(\theta_k) = \text{sign}(c_k)$ . Candès and Fernandez-Granda (2014) constructed such a polynomial as a bump-function around each $\theta_k$ with controlled derivative bounds.

Minimum-separation condition

For the construction to satisfy $|q| \leq 1$ strictly off-support, neighbouring bumps must not overlap, which requires $|\sin\theta_i - \sin\theta_j| \geq c/N_a$ .

Conclude exact recovery

KKT conditions with this dual certificate imply the atomic-norm primal has the $K$ -sparse representation at $\{\theta_k\}$ as its unique optimum. $\blacksquare$

ex-ch15-12

Hard

For the covariance-based activity detector, show that the non-Bayesian maximum-likelihood objective $\mathcal{L}(\boldsymbol{\gamma}) = \log\det \boldsymbol{\Sigma}_y(\boldsymbol{\gamma}) + \mathrm{tr}(\boldsymbol{\Sigma}_y(\boldsymbol{\gamma})^{-1} \widehat{\boldsymbol{\Sigma}}_y)$ is convex in each coordinate $\gamma_k$ and derive the closed-form coordinate-descent update.

Show Hint

Use the Sherman-Morrison update for $\boldsymbol{\Sigma}_y^{-1}$ .

Reduce the per-coordinate problem to a scalar quadratic.

Solution

Sherman-Morrison

If $\boldsymbol{\Sigma}_y = \boldsymbol{\Sigma}_{-k} + \gamma_k \boldsymbol{\phi}_k \boldsymbol{\phi}_k^H$ , then $\boldsymbol{\Sigma}_y^{-1} = \boldsymbol{\Sigma}_{-k}^{-1} - \frac{\gamma_k \boldsymbol{\Sigma}_{-k}^{-1}\boldsymbol{\phi}_k \boldsymbol{\phi}_k^H \boldsymbol{\Sigma}_{-k}^{-1}}{1 + \gamma_k \boldsymbol{\phi}_k^H \boldsymbol{\Sigma}_{-k}^{-1}\boldsymbol{\phi}_k}$ .

Univariate objective

Substituting into $\mathcal{L}$ yields $f(\gamma_k) = \log(1 + \gamma_k u_k) - \gamma_k v_k/(1+\gamma_k u_k) + \text{const}$ , with $u_k = \boldsymbol{\phi}_k^H\boldsymbol{\Sigma}_{-k}^{-1}\boldsymbol{\phi}_k$ , $v_k = \boldsymbol{\phi}_k^H\boldsymbol{\Sigma}_{-k}^{-1}\widehat{\boldsymbol{\Sigma}}_y\boldsymbol{\Sigma}_{-k}^{-1}\boldsymbol{\phi}_k$ .

Convex in $\gamma_k$

$f''(\gamma_k) > 0$ for $\gamma_k \geq 0$ ; the minimizer is $\gamma_k^\star = \max(0, (v_k - u_k)/u_k^2)$ . Iterate coordinate-by-coordinate until convergence. This is the Fengler-Haghighatshoar-Jung-Caire algorithm.

ex-ch15-13

Hard

For FDD massive MIMO, show that the uplink and downlink channels share the angular support but have different phases. Use this to justify CS-based downlink training with pilot overhead bounded by the uplink-estimated support size.

Show Hint

Write the channel as $\mathbf{h} = \sum_\ell \alpha_\ell \mathbf{a}(\theta_\ell)$ and analyse frequency dependence.

Solution

Path-wise model

$\mathbf{h}(f) = \sum_\ell \alpha_\ell e^{-j2\pi f \tau_\ell} \mathbf{a}(\theta_\ell)$ . The angles $\theta_\ell$ are frequency-agnostic (geometry); only the complex gains differ between UL and DL.

Support reuse

Estimate UL support $\{\theta_\ell\}$ via CS on uplink pilots. DL channel estimation then reduces to estimating $L$ complex gains — a problem of dimension $L$ , not $G$ .

Pilot budget

DL pilots needed: $M_{\text{DL}} \sim L$ (gain estimation) rather than $L\log(G/L)$ (support + gains). For massive MIMO with $G = 256$ angular bins and $L \approx 5$ paths, this reduces DL pilot overhead by $\approx \log 50 \approx 6\times$ .

ex-ch15-14

Hard

Show that the HiHTP projection onto the $(K, s)$ -hierarchical-sparse set is separable: one first picks the $s$ largest entries inside each group, then picks the $K$ groups with the largest in-group energy. Prove this is optimal.

Show Hint

Frame the projection as a constrained least-squares problem over hierarchical supports.

Solution

Problem

$\hat{\mathbf{x}} = \arg\min \|\mathbf{x} - \mathbf{z}\|_2^2$ subject to $\mathbf{x}$ supported on $\leq K$ groups, each with $\leq s$ nonzero entries.

Inner optimisation

Fix active-group set $\mathcal{G}$ . Inside group $g$ , best $s$ -sparse approximation of $\mathbf{z}_g$ is its top- $s$ coordinates. In-group error: $\sum_{i \notin \text{top-}s(g)} |z_{g,i}|^2$ .

Outer optimisation

Error as a function of $\mathcal{G}$ : $\sum_{g \notin \mathcal{G}} \|\mathbf{z}_g\|_2^2 + \sum_{g \in \mathcal{G}} \sum_{i\notin\text{top-}s(g)} |z_{g,i}|^2$ . Minimised by picking the $K$ groups maximising $\|\mathbf{z}_g\|_2^2 - \sum_{i\notin\text{top-}s(g)} |z_{g,i}|^2 = \text{top-}s\text{-energy of } g$ .

Separability

Thus the two-level procedure — inner top- $s$ , outer top- $K$ by in-group energy — is jointly optimal. $\blacksquare$

ex-ch15-15

Hard

Derive the phase-transition threshold for DOA recovery from a single snapshot using Gaussian widths. For $K$ sources and an $N_a$ -antenna ULA, show that recovery succeeds when $N_a \geq c K \log(G/K)$ up to constants.

Show Hint

Treat the ULA dictionary as an approximate RIP matrix and apply Chen-Chi-Fannjiang.

Solution

Gaussian width of $K$-sparse set

The Gaussian width of the $K$ -sparse unit ball in $\mathbb{C}^G$ is $\omega_K \asymp \sqrt{K \log(G/K)}$ .

ULA dictionary incoherence

When angles are separated by $\gtrsim 1/N_a$ , columns of $\mathbf{A}$ have mutual coherence $\mu \lesssim 1/\sqrt{N_a}$ , enabling RIP-like concentration.

Sample complexity

Candès-Plan's matrix-from-random-samples result gives $N_a \geq c \omega_K^2 = c K \log(G/K)$ for successful $\ell_1$ recovery. $\blacksquare$

ex-ch15-16

Challenge

Prove that in unsourced random access, the Polyanskiy bound on per-user energy behaves like $E_b/N_0 \to$ const as $K_a, n \to \infty$ with $K_a/n$ fixed, and discuss why coded CS can approach this bound.

Show Hint

Use the union-bound analysis of a MAC with i.i.d.\ Gaussian codebooks and symmetric decoding.

Solution

Polyanskiy's bound

Polyanskiy (2017) derives the achievable finite-blocklength $E_b/N_0$ bound via a converse on typical-set decoding of an i.i.d.\ MAC with $K_a$ active users. As $n\to\infty$ with $K_a/n$ fixed, the bound converges to a constant depending on the per-user error target.

Coded CS achievability

Coded CS uses i.i.d.\ Gaussian codebooks and AMP-based decoding (Amalladinne et al.), which in the large-system limit achieves the i.i.d.-Gaussian-input MAC capacity via state evolution. Thus coded CS matches Polyanskiy's bound to within the AMP-MMSE gap.

Remark on CommIT direction

Closing the small residual gap to Polyanskiy's bound is an open research direction; current best approaches combine HiHTP with outer LDPC codes.

ex-ch15-17

Challenge

Consider joint channel estimation and data detection in a grant-free uplink. Formulate the problem as a sparse-recovery problem with a structured sensing matrix and discuss when the joint problem is identifiable.

Show Hint

Write $\mathbf{y} = \sum_k \boldsymbol{\phi}_k h_k x_k + \mathbf{w}$ .

Identifiability requires pairing constraints (e.g., pilot symbol known).

Solution

Bilinear model

$\mathbf{y} = \sum_k \boldsymbol{\phi}_k h_k x_k + \mathbf{w}$ where each active user contributes a pilot-channel-data product. This is a bilinear sparse-recovery problem.

Identifiability

Without constraints, $(h_k, x_k)$ is determined only up to a complex scalar per user. Identifiability is restored by (a) known pilot symbols, (b) finite alphabets, or (c) multiple receive antennas (MMV structure).

Algorithms

Expectation-maximization, bilinear AMP (BiG-AMP), and Caire's joint AD-and-decoding framework combine CS recovery with data-symbol constellation constraints.

ex-ch15-18

Challenge

Prove that for hierarchical sparsity with one outer-group active ( $K=1$ ), HiHTP reduces to ordinary HTP on the active group, and the sample complexity is $M \gtrsim s\log(B/s)$ — the single-group CS bound.

Show Hint

Track the HiHTP iteration when $K=1$ .

Solution

Reduction

With $K=1$ , outer-group selection is trivial; HiHTP picks the single group $g$ with the largest in-group top- $s$ energy. Inside that group, HiHTP reduces to HTP.

Sample complexity

HTP on a $B$ -dimensional ambient space with $s$ -sparse signal needs $M \gtrsim s\log(B/s)$ by Foucart's analysis.

Contrast with plain CS

Plain CS on the full ambient $GB$ -space needs $M \gtrsim s\log(GB/s)$ . HiHTP exploits the known outer-group support to remove the $\log G$ factor — a pure structural gain. $\blacksquare$

ex-ch15-19

Challenge

Consider a massive-MIMO BS with $N_r$ receive antennas and $K_a$ active users. Derive the asymptotic behaviour of the covariance-based detector as $N_r \to \infty$ with $M, K_a, K_{\text{total}}$ fixed.

Show Hint

Show that $\widehat{\boldsymbol{\Sigma}}_y \to \boldsymbol{\Sigma}_y$ in operator norm.

Solution

Law of large numbers

$\widehat{\boldsymbol{\Sigma}}_y = \frac{1}{N_r}\mathbf{Y}\mathbf{Y}^H \to \mathbf{E}\boldsymbol{\Sigma}_y$ almost surely as $N_r \to \infty$ (assuming ergodic LSFC).

ML in the limit

The ML objective $\log\det + \mathrm{tr}(\cdot)$ becomes deterministic; at $\widehat{\boldsymbol{\Sigma}}_y = \boldsymbol{\Sigma}_y$ its minimum is attained at the true LSFC vector $\boldsymbol{\gamma}^\star$ .

$K_a > M$ regime

Even for $K_a > M$ , the Fisher information of $\boldsymbol{\gamma}$ via the $M^2$ covariance entries remains non-singular in generic pilot books — this is why the covariance detector breaks the $M$ barrier. This is the core CommIT result.

ex-ch15-20

Challenge

Show that the atomic-norm SDP for DOA has dual $\max_{\mathbf{q}}\ \Re\langle\mathbf{y}, \mathbf{q}\rangle$ s.t.\ $\|\mathbf{q}^H\mathbf{a}(\theta)\|_\infty \leq 1$ and interpret the dual variable $\mathbf{q}$ geometrically.

Show Hint

Lagrangianize the primal atomic norm and maximise over the Lagrange multiplier.

Solution

Primal-dual

The atomic norm is the support function of the polar $\mathcal{A}^\circ$ , so $\|\mathbf{u}\|_\mathcal{A} = \sup_{\mathbf{q} \in \mathcal{A}^\circ} \Re\langle\mathbf{q},\mathbf{u}\rangle$ . Plugging in and noting $\mathcal{A}^\circ = \{\mathbf{q} : |\mathbf{q}^H\mathbf{a}(\theta)| \leq 1\ \forall\theta\}$ yields the dual.

Dual polynomial

$q(\theta) = \mathbf{q}^H \mathbf{a}(\theta)$ is a trigonometric polynomial of degree $N_a - 1$ . The constraint $|q(\theta)| \leq 1$ for all $\theta$ is the bounded-real lemma LMI.

Geometric interpretation

The optimal $q(\theta)$ touches $\pm 1$ precisely at the true source angles; these are the active contact points that certify primal optimality. Reading off the touch points gives the DOA estimates.

Exercises

ex-ch15-01

LS overhead

CS overhead

Ratio

ex-ch15-02

Log factor

Pilot length

ex-ch15-03

Per-group norms

Sum

ex-ch15-04

Answer

ex-ch15-05

Subdifferential

Proximal operator

ex-ch15-06

Counting supports

Comparison

ex-ch15-07

Support update

Restricted LS

Residual

ex-ch15-08

Dirichlet kernel

Mainlobe width

Angular resolution

ex-ch15-09

Concentration on one support

Union over supports

Saving

ex-ch15-10

Per-slot dictionary

Complexity

Stitching cost

ex-ch15-11

Dual certificate

Minimum-separation condition

Conclude exact recovery

ex-ch15-12

Sherman-Morrison

Univariate objective

Convex in $\gamma_k$

ex-ch15-13

Path-wise model

Support reuse

Pilot budget

ex-ch15-14

Problem

Inner optimisation

Outer optimisation

Separability

ex-ch15-15

Gaussian width of $K$-sparse set

ULA dictionary incoherence

Sample complexity

ex-ch15-16

Polyanskiy's bound

Coded CS achievability

Remark on CommIT direction

ex-ch15-17

Bilinear model

Identifiability

Algorithms

ex-ch15-18

Reduction

Sample complexity

Contrast with plain CS

ex-ch15-19

Law of large numbers

ML in the limit

$K_a > M$ regime

ex-ch15-20

Primal-dual

Dual polynomial

Geometric interpretation