Ferkans — Interactive Telecom Tutor

ex-ch03-01

Easy

A massive MIMO system operates with coherence interval $\tau_c = 300$ samples, serves $K = 15$ users per cell, and devotes fraction $f_p = \tau_p/\tau_c$ to pilot training.

(a) What is the minimum pilot fraction $f_p^{\min}$ to allow orthogonal pilots?

(b) If $\tau_u = \tau_d$ (equal uplink/downlink), what fraction of resources are available for data transmission?

(c) If $K$ doubles to 30, what is the new minimum pilot fraction?

Show Hint

Orthogonal pilots in $\mathbb{C}^{\tau_p}$ require $\tau_p \geq K$ .

Total resources: $\tau_p + \tau_u + \tau_d = \tau_c$ . Data fraction = $(\tau_c - \tau_p)/\tau_c$ .

Solution

(a) Minimum pilot fraction

Minimum pilot length: $\tau_p^{\min} = K = 15$ .

$f_p^{\min} = \tau_p^{\min}/\tau_c = 15/300 = 5\%$

(b) Data fraction

Data samples: $\tau_c - \tau_p = 300 - 15 = 285$ .

Data fraction: $285/300 = 95\%$ .

(c) With doubled users

$\tau_p^{\min} = 30$ , so $f_p^{\min} = 30/300 = 10\%$ .

Doubling users doubles the pilot overhead and reduces data to 90%.

ex-ch03-02

Easy

A base station with $N_t = 64$ antennas estimates user channels via uplink pilots. The user's spatial covariance matrix is $\mathbf{R}_k = \beta_k \mathbf{I}_{N_t}$ (i.i.d. Rayleigh channel), with $\beta_k = 1$ and pilot SNR $p_u\tau_p/\sigma^2 = \rho_p = 5$ dB.

(a) Compute the LS estimation MSE.

(b) Compute the MMSE estimation MSE. Why does MMSE provide no gain over LS here?

(c) What happens to the MSE comparison if $\mathbf{R}_k$ has effective rank $r_k = 8$ ?

Show Hint

For $\mathbf{R}_k = \beta_k\mathbf{I}$ , all eigenvalues equal $\beta_k$ .

Convert $\rho_p = 5$ dB to linear: $\rho_p = 10^{5/10} \approx 3.16$ .

Solution

(a) LS MSE

$\rho_p = 10^{0.5} \approx 3.16$ .

$\text{MSE}^{\text{LS}} = \frac{N_t}{\rho_p} = \frac{64}{3.16} \approx 20.3$

(b) MMSE MSE for i.i.d. channel

All eigenvalues $\lambda_i = \beta_k = 1$ :

$\text{MSE}^{\text{MMSE}} = N_t \cdot \frac{\beta_k}{\rho_p\beta_k + 1} = 64 \cdot \frac{1}{3.16 + 1} \approx 15.4$

MMSE does better than LS by factor $(1 + 1/\rho_p)$ — a modest gain. When all eigenvalues are equal, MMSE reduces to a scalar shrinkage $\hat{h}_i = \rho_p/(1+\rho_p) \cdot h_i^{\text{LS}}$ , providing no subspace gain.

(c) Low-rank channel

With $r_k = 8$ equal eigenvalues $\lambda_i = N_t\beta_k/r_k = 64/8 = 8$ :

$\text{MSE}^{\text{MMSE}} = 8 \cdot \frac{8}{3.16 \times 8 + 1} \approx \frac{64}{26.3} \approx 2.43$

MMSE gain: $20.3/2.43 \approx 8.3 \approx 9.2$ dB. The gain scales with $N_t/r_k = 8$ .

ex-ch03-03

Medium

Derive the MMSE channel estimator for the case where the pilot sequences are non-orthogonal: $\boldsymbol{\phi}_k^H\boldsymbol{\phi}_j = c_{kj}\tau_p$ where $|c_{kj}| < 1$ for $k \neq j$ .

(a) Write the observation model after correlating with user $k$ 's pilot.

(b) What "effective noise" does user $j$ 's non-orthogonal pilot create?

(c) Derive the MMSE estimator and its error covariance for this case.

Show Hint

After correlating $\mathbf{y}_k = \mathbf{Y}_p\boldsymbol{\phi}_k^*$ , terms with $j \neq k$ give $\boldsymbol{\phi}_j^T\boldsymbol{\phi}_k^* = c_{kj}^*\tau_p\mathbf{h}_j$ — correlated interference.

The observation model is $\mathbf{y}_k = \sqrt{p_u\tau_p}\mathbf{h}_k + \sum_{j\neq k}\sqrt{p_u}c_{kj}^*\tau_p\mathbf{h}_j + \mathbf{n}_k$ .

Apply LMMSE: cross-covariance $\boldsymbol{\Sigma}_{hy} = \sqrt{p_u\tau_p}\mathbf{R}_k$ , observation covariance $\boldsymbol{\Sigma}_{yy} = p_u\tau_p\mathbf{R}_k + \sum_{j\neq k}p_u|c_{kj}|^2\tau_p^2\mathbf{R}_j + \sigma^2\mathbf{I}$ .

Solution

Observation model

$\mathbf{y}_k = \sqrt{p_u\tau_p}\mathbf{h}_k + \sqrt{p_u\tau_p}\sum_{j\neq k}c_{kj}^*\mathbf{h}_j + \mathbf{n}_k$ $where$ \mathbf{n}_k = \mathbf{N}_p\boldsymbol{\phi}k^*/\sqrt{\tau_p} \sim \mathcal{CN}(\mathbf{0},\sigma^2\mathbf{I}) $. Non-orthogonality introduces an effective interference term that scales with$ |c{kj}|$.

Compute covariances

Cross-covariance: $\boldsymbol{\Sigma}_{\mathbf{h}_k\mathbf{y}_k} = \sqrt{p_u\tau_p}\mathbf{R}_k$

Observation covariance: $\boldsymbol{\Sigma}_{\mathbf{y}_k\mathbf{y}_k} = p_u\tau_p\mathbf{R}_k + p_u\tau_p\sum_{j\neq k}|c_{kj}|^2\mathbf{R}_j + \sigma^2\mathbf{I}$

LMMSE formula

$\hat{\mathbf{h}}_k = \sqrt{p_u\tau_p}\mathbf{R}_k\boldsymbol{\Sigma}_{\mathbf{y}_k\mathbf{y}_k}^{-1}\mathbf{y}_k$ $The error covariance is$ \mathbf{C}_k = \mathbf{R}_k - p_u\tau_p\mathbf{R}k\boldsymbol{\Sigma}{\mathbf{y}_k\mathbf{y}_k}^{-1}\mathbf{R}k $. Non-orthogonal pilots increase$ \boldsymbol{\Sigma}{\mathbf{y}_k\mathbf{y}k} $, enlarging the error covariance. Orthogonal pilots ($ c{kj} = 0 $,$ j\neq k $) recover the standard MMSE formula.$ \blacksquare$

ex-ch03-04

Medium

In a two-cell system ( $L = 2$ ), each cell has one user ( $K = 1$ ) and both share the same pilot. User 1 has covariance $\mathbf{R}_{1,1}$ and user 2 (in cell 2) has covariance $\mathbf{R}_{2,1}$ .

(a) Write the MMSE estimate of $\mathbf{h}_{1,1}$ from the contaminated observation.

(b) Show that the estimation error covariance does NOT vanish as $N_t \to \infty$ when $\mathbf{R}_{1,1}$ and $\mathbf{R}_{2,1}$ are proportional: $\mathbf{R}_{2,1} = \alpha\mathbf{R}_{1,1}$ .

(c) Show that when $\mathbf{R}_{1,1}\mathbf{R}_{2,1} = \mathbf{0}$ (orthogonal subspaces), the subspace-projected MMSE estimator achieves $\mathbf{C}_{1,1} \to \mathbf{0}$ as $N_t\to\infty$ .

Show Hint

Contaminated observation: $\mathbf{y}_{1,1} = \sqrt{p_u\tau_p}(\mathbf{h}_{1,1} + \mathbf{h}_{2,1}) + \mathbf{n}$ .

For (b): When $\mathbf{R}_{2,1} = \alpha\mathbf{R}_{1,1}$ , compute $\mathbf{C}_{1,1}$ and show it equals a fixed matrix independent of $N_t$ .

For (c): Use the subspace projection theorem (Theorem thm-subspace-projection-decontamination).

Solution

(a) Contaminated MMSE estimate

$\hat{\mathbf{h}}_{1,1} = \sqrt{p_u\tau_p}\mathbf{R}_{1,1}(p_u\tau_p(\mathbf{R}_{1,1}+\mathbf{R}_{2,1}) + \sigma^2\mathbf{I})^{-1}\mathbf{y}_{1,1}$ $

(b) No vanishing error when covariances are proportional

With $\mathbf{R}_{2,1} = \alpha\mathbf{R}_{1,1}$ :

$\mathbf{C}_{1,1} = \mathbf{R}_{1,1} - p_u\tau_p\mathbf{R}_{1,1}(p_u\tau_p(1+\alpha)\mathbf{R}_{1,1} + \sigma^2\mathbf{I})^{-1}\mathbf{R}_{1,1}$

As $\sigma^2\to 0$ (high SNR), this approaches:

$\mathbf{C}_{1,1} \to \mathbf{R}_{1,1} - \frac{1}{1+\alpha}\mathbf{R}_{1,1} = \frac{\alpha}{1+\alpha}\mathbf{R}_{1,1} \neq \mathbf{0}$

The error remains of order $\mathbf{R}_{1,1}$ — the contamination floor is proportional to $\alpha/(1+\alpha)$ .

(c) Vanishing error with orthogonal subspaces

When $\mathbf{R}_{1,1}\mathbf{R}_{2,1} = \mathbf{0}$ , the projected observation is:

$\mathbf{P}_{1,1}\mathbf{y}_{1,1} = \sqrt{p_u\tau_p}\mathbf{h}_{1,1} + \mathbf{P}_{1,1}\mathbf{n}$

The MMSE estimator from this clean observation has error covariance:

$\mathbf{C}_{1,1}^{\text{proj}} = \mathbf{R}_{1,1} - p_u\tau_p\mathbf{R}_{1,1}(p_u\tau_p\mathbf{R}_{1,1} + \sigma^2\mathbf{P}_{1,1})^{-1}\mathbf{R}_{1,1}$

As $p_u\tau_p/\sigma^2 \to \infty$ , $\mathbf{C}_{1,1}^{\text{proj}} \to \mathbf{0}$ . $\blacksquare$

ex-ch03-05

Medium

Prove that the MMSE estimation error $\tilde{\mathbf{h}}_k = \mathbf{h}_k - \hat{\mathbf{h}}_k^{\text{MMSE}}$ is uncorrelated with both the estimate $\hat{\mathbf{h}}_k^{\text{MMSE}}$ and the received pilot observation $\mathbf{y}_k$ (the orthogonality principle).

Show Hint

The MMSE estimator minimizes MSE, so the gradient with respect to any linear function of $\hat{\mathbf{h}}_k$ must be zero at the optimum.

Show $\mathbb{E}[\tilde{\mathbf{h}}_k \hat{\mathbf{h}}_k^H] = \mathbf{0}$ by substituting $\hat{\mathbf{h}}_k = \mathbf{A}\mathbf{y}_k$ and computing cross-covariance.

Solution

MMSE estimator is linear

The MMSE estimator takes the form $\hat{\mathbf{h}}_k = \mathbf{A}\mathbf{y}_k$ where $\mathbf{A} = \sqrt{p_u\tau_p}\mathbf{R}_k(p_u\tau_p\mathbf{R}_k + \sigma^2\mathbf{I})^{-1}$ .

Orthogonality to observation

$\mathbb{E}[\tilde{\mathbf{h}}_k\mathbf{y}_k^H] = \mathbb{E}[(\mathbf{h}_k - \mathbf{A}\mathbf{y}_k)\mathbf{y}_k^H]KATEXPLACEHOLDER0END= \mathbb{E}[\mathbf{h}_k\mathbf{y}_k^H] - \mathbf{A}\mathbb{E}[\mathbf{y}_k\mathbf{y}_k^H]KATEXPLACEHOLDER1END= \boldsymbol{\Sigma}_{\mathbf{h}_k\mathbf{y}_k} - \mathbf{A}\boldsymbol{\Sigma}_{\mathbf{y}_k\mathbf{y}_k}$ $By definition of the LMMSE estimator,$ \mathbf{A} = \boldsymbol{\Sigma}_{\mathbf{h}_k\mathbf{y}k}\boldsymbol{\Sigma}{\mathbf{y}_k\mathbf{y}_k}^{-1} $, so the expression equals$ \mathbf{0}$.

Orthogonality to estimate

Since $\hat{\mathbf{h}}_k = \mathbf{A}\mathbf{y}_k$ is a linear function of $\mathbf{y}_k$ :

$\mathbb{E}[\tilde{\mathbf{h}}_k\hat{\mathbf{h}}_k^H] = \mathbb{E}[\tilde{\mathbf{h}}_k\mathbf{y}_k^H]\mathbf{A}^H = \mathbf{0} \cdot \mathbf{A}^H = \mathbf{0}$

$\blacksquare$

This property is critical: it means estimation error and estimate are uncorrelated, which is what makes the "use-and-then-forget" bound (Ch. 4) tight.

ex-ch03-06

Medium

A DFT-based pilot matrix $\boldsymbol{\Phi} \in \mathbb{C}^{K\times K}$ has entries $[\boldsymbol{\Phi}]_{k,n} = e^{j2\pi(k-1)(n-1)/K}/\sqrt{K}$ , forming an orthogonal set of $K$ pilot sequences each of length $K$ .

(a) Show that $\boldsymbol{\Phi}\boldsymbol{\Phi}^H = \mathbf{I}_K$ (mutual pilot orthogonality).

(b) What is the PAPR (peak-to-average power ratio) of each pilot sequence?

(c) Why are Zadoff-Chu sequences preferred over DFT rows in practice for 5G NR?

Show Hint

$(\boldsymbol{\Phi}\boldsymbol{\Phi}^H)_{kj} = \sum_{n=0}^{K-1} e^{j2\pi(k-j)n/K}/K$ . Geometric series.

Each DFT row has constant magnitude $1/\sqrt{K}$ , so PAPR = 1.

Look up Zadoff-Chu: constant amplitude, zero autocorrelation sidelobes.

Solution

(a) Orthogonality

$[\boldsymbol{\Phi}\boldsymbol{\Phi}^H]_{kj} = \frac{1}{K}\sum_{n=1}^{K}e^{j2\pi(k-1)(n-1)/K}e^{-j2\pi(j-1)(n-1)/K}KATEXPLACEHOLDER0END= \frac{1}{K}\sum_{n=0}^{K-1}e^{j2\pi(k-j)n/K} = \delta_{kj}$ $by the orthogonality of complex exponentials. So$ \boldsymbol{\Phi}\boldsymbol{\Phi}^H = \mathbf{I}_K$.

(b) PAPR

Each entry has $|[\boldsymbol{\Phi}]_{k,n}| = 1/\sqrt{K}$ — constant across $n$ . Maximum instantaneous power = average power, so $\text{PAPR} = 1$ (0 dB).

DFT rows are constant-amplitude sequences with PAPR = 1.

(c) Zadoff-Chu advantages

Zadoff-Chu sequences have:

Constant amplitude (PAPR = 1, same as DFT)
Ideal cyclic autocorrelation: $|r_{zc}(\tau)| = 0$ for $\tau \neq 0$ — no inter-symbol interference in OFDM
Multiple sequences: different root indices give mutually low-correlation sequences
Robustness to frequency offset: flat spectrum enables timing/frequency estimation

In 5G NR, ZC sequences are used for PRACH preambles and SRS pilots because their cyclic properties enable efficient FFT-based correlation at the receiver.

ex-ch03-07

Hard

Consider a one-ring covariance model for a ULA with $N_t$ antennas, half-wavelength spacing, mean angle $\theta_0$ , and angular spread $\Delta\theta$ . The covariance entries are approximately:

$[\mathbf{R}]_{mn} = e^{j\pi(m-n)\sin\theta_0} \text{sinc}((m-n)\Delta\theta_\text{rad})$

where $\Delta\theta_\text{rad}$ is the angular spread in radians.

(a) Show that the effective rank $r_k \approx N_t \cdot 2\Delta\theta_\text{rad}/\pi$ .

(b) Two users $k$ (in cell 1) and $k'$ (in cell 2) share a pilot. User $k$ has $\theta_0 = 10°$ , $\Delta\theta = 5°$ . User $k'$ has $\theta_0 = 40°$ , $\Delta\theta = 5°$ . Show that their covariance subspaces are approximately orthogonal for large $N_t$ .

(c) What is the minimum angular separation for exact subspace orthogonality in this model?

Show Hint

Eigenvalues of a Toeplitz matrix with entries $\text{sinc}((m-n)B)$ are approximately the DFT of the generating function — a rectangular window of width $2B$ .

The number of eigenvalues above a threshold $\epsilon$ equals the bandwidth of the angular spectrum.

Orthogonality condition: the angular intervals $[\theta_0 - \Delta\theta, \theta_0 + \Delta\theta]$ must not overlap.

Solution

(a) Effective rank from bandwidth

The covariance matrix $\mathbf{R}$ is a Hermitian Toeplitz matrix with generating sequence $r[m] = e^{j\pi m\sin\theta_0}\text{sinc}(m\Delta\theta_\text{rad})$ . Its spectrum is the angular-domain spectrum concentrated in the interval $[\sin\theta_0 - \Delta\theta_\text{rad}/\pi, \sin\theta_0 + \Delta\theta_\text{rad}/\pi]$ .

The number of significant eigenvalues (Szego's theorem) is approximately: $r_k \approx N_t \cdot \frac{2\Delta\theta_\text{rad}/\pi}{2} = \frac{N_t\Delta\theta_\text{rad}}{\pi}$

(b) Subspace orthogonality check

User $k$ : angular interval $[5°, 15°]$ in degrees, $[0.087, 0.262]$ rad. User $k'$ : angular interval $[35°, 45°]$ in degrees, $[0.611, 0.785]$ rad.

These intervals are disjoint: min separation $= 35° - 15° = 20° \gg 0$ . The covariance spectra are supported on non-overlapping sets, implying approximately orthogonal eigenbases for large $N_t$ . Formally: $\text{tr}(\mathbf{R}_{1,k}\mathbf{R}_{2,k'})/(\|\mathbf{R}_{1,k}\|_F\|\mathbf{R}_{2,k'}\|_F) \to 0$ .

(c) Minimum angular separation

Exact orthogonality requires the angular windows to be disjoint:

$|\theta_1 - \theta_2| > \Delta\theta_1 + \Delta\theta_2$

With $\Delta\theta_1 = \Delta\theta_2 = 5°$ : separation $> 10°$ .

For the specific numbers above: $|10° - 40°| = 30° \gg 10°$ — well orthogonal. Two users at $10°$ and $18°$ with $5°$ spread each would barely satisfy the $> 10°$ condition and would have partial subspace overlap.

ex-ch03-08

Hard

Derive the optimal pilot sequence length $\tau_p^*$ that maximizes the effective spectral efficiency (ESE):

$\text{ESE}(\tau_p) = \frac{\tau_c - \tau_p}{\tau_c} \cdot \log_2\left(1 + N_t \cdot \frac{p_u\tau_p/\sigma^2}{1 + p_u\tau_p/\sigma^2}\right)$

where the logarithm represents an approximation to the per-user rate using the estimated channel (assuming Gaussian channel and MMSE estimation).

(a) Show that the rate term is concave in $\tau_p$ .

(b) Find the first-order necessary condition for $\tau_p^*$ .

(c) For $N_t = 64$ , $\tau_c = 200$ , and high SNR $p_u/\sigma^2 \gg 1$ , what is the asymptotic $\tau_p^*$ ?

Show Hint

Let $\rho = p_u\tau_p/\sigma^2$ . The rate is $\log_2(1 + N_t\rho/(1+\rho))$ , which is concave in $\rho$ (and hence in $\tau_p$ ).

The product $f(\tau_p) = (\tau_c-\tau_p)g(\tau_p)$ is maximized when $-g(\tau_p) + (\tau_c-\tau_p)g'(\tau_p) = 0$ .

At high SNR: $\rho/(1+\rho) \to 1$ , so the rate saturates at $\log_2(1+N_t)$ and the optimal pilot fraction $\tau_p^*/\tau_c \to 0$ .

Solution

(a) Concavity

Let $\rho_p = p_u/\sigma^2$ (fixed). Define $f(\tau_p) = \tau_p\rho_p/(1+\tau_p\rho_p)$ . This is increasing and concave in $\tau_p$ (positive second derivative is negative).

The product $(\tau_c-\tau_p)\log_2(1+N_t f(\tau_p))$ is the product of a decreasing linear function and a concave increasing function — jointly concave, so a unique maximum exists.

(b) First-order condition

Setting $d(\text{ESE})/d\tau_p = 0$ :

$-\log_2(1+N_t f(\tau_p)) + (\tau_c-\tau_p) \cdot \frac{N_t f'(\tau_p)}{(1+N_t f(\tau_p))\ln 2} = 0$

where $f'(\tau_p) = \rho_p/(1+\tau_p\rho_p)^2$ . This transcendental equation must be solved numerically for general parameters.

(c) High-SNR asymptotic

At high SNR, $f(\tau_p) \to 1$ quickly and rate saturates at $\log_2(1+N_t)$ . Further pilot investment provides diminishing rate returns while the pre-log factor $(1-\tau_p/\tau_c)$ still decreases linearly.

The optimal $\tau_p^*$ converges to the minimum feasible: $\tau_p^* = K$ (just enough for orthogonal pilots). Pre-log factor: $(1-K/\tau_c) = 190/200 = 95\%$ . The high-SNR ESE is approximately $0.95\log_2(65) \approx 5.76$ bits/s/Hz.

ex-ch03-09

Hard

Consider the greedy pilot assignment algorithm. Prove that the greedy algorithm achieves at most a $\tau_p$ -factor approximation of the optimal assignment (in terms of total contamination cost $\sum_{k}\sum_{j\neq k:\phi(j)=\phi(k)}\rho_{k,j}$ ).

Assume the contamination metric $\rho_{k,j}$ is symmetric.

Show Hint

Upper bound the greedy cost per user by the average cost over all pilots.

Compare greedy cost with optimal cost via a relaxation argument.

Solution

Set up notation

Let $C^* = \sum_{k,j:\phi^*(j)=\phi^*(k),j<k}\rho_{k,j}$ be the optimal total cost and $C^G$ be the greedy cost. User $k$ assigned pilot $p$ by greedy incurs cost $C_k^G = \sum_{j:\phi^G(j)=p,j<k}\rho_{k,j}$ .

Bound greedy per-user cost

When assigning user $k$ , greedy picks the pilot minimizing $C_k^G$ . The average cost over all $\tau_p$ pilots is at most $\bar{C}_k = \frac{1}{\tau_p}\sum_p\sum_{j:\phi^G(j)=p}\rho_{k,j}$ . Greedy achieves $C_k^G \leq \bar{C}_k$ .

Bound total greedy cost

Summing over all $k$ :

$C^G = \sum_k C_k^G \leq \sum_k \frac{1}{\tau_p}\sum_{j<k}\rho_{k,j} = \frac{1}{\tau_p}\sum_{k<j}\rho_{k,j}$

Since the optimal assignment can only do better: $C^* \leq C^G \leq \frac{1}{\tau_p}C_{\text{all}}$ ,

where $C_{\text{all}} = \sum_{k<j}\rho_{k,j}$ is the cost when all users share one pilot.

The approximation ratio is at most $\tau_p$ . $\blacksquare$

ex-ch03-10

Challenge

Research Extension: The angular-domain representation of a ULA channel is $\mathbf{h} = \mathbf{F}\tilde{\mathbf{h}}$ where $\mathbf{F}$ is the $N_t\times N_t$ DFT matrix and $\tilde{\mathbf{h}}$ is the virtual angular-domain channel.

(a) Show that for the one-ring model, $\tilde{\mathbf{h}}$ is sparse: only $r_k \approx N_t\Delta\theta_\text{rad}/\pi$ entries are nonzero (approximately).

(b) Propose a compressed sensing approach to estimate the sparse $\tilde{\mathbf{h}}$ using fewer pilots than $N_t$ — and state the conditions on the pilot matrix $\boldsymbol{\Phi}$ for recovery to succeed.

(c) What is the minimum pilot length for reliable CS recovery, and how does this compare to the MMSE pilot length of $\tau_p = K$ ?

Show Hint

Sparsity in the angular domain follows from the bandwidth argument in Exercise 7(a).

CS recovery requires the pilot matrix $\boldsymbol{\Phi}\mathbf{F}^H$ to satisfy the restricted isometry property (RIP) with sparsity $r_k$ .

RIP is satisfied with high probability when $\boldsymbol{\Phi}$ is a random Gaussian matrix with $\tau_p = \mathcal{O}(r_k\log(N_t/r_k))$ rows.

Solution

(a) Angular sparsity

From Exercise 7(a): the one-ring covariance has rank $r_k \approx N_t\Delta\theta_\text{rad}/\pi$ . In the DFT domain, $\mathbf{R} = \mathbf{F}\tilde{\mathbf{R}}\mathbf{F}^H$ where $\tilde{\mathbf{R}}$ is approximately block-diagonal with $r_k$ nonzero entries (the angular window).

Therefore $\tilde{\mathbf{h}} = \mathbf{F}^H\mathbf{h}$ is supported on $r_k$ angular bins — approximately $r_k$ -sparse.

(b) CS estimation

Pilot observation: $\mathbf{y}_k = \sqrt{p_u\tau_p}\mathbf{h}_k + \mathbf{n} = \sqrt{p_u\tau_p}\boldsymbol{\Phi}\mathbf{F}\tilde{\mathbf{h}}_k + \mathbf{n}$

Wait — pilot matrix applied to $h$ : actually observation model for CS is: $\mathbf{y}_k = \sqrt{p_u}\boldsymbol{\Phi}\mathbf{h}_k + \mathbf{n} = \sqrt{p_u}\boldsymbol{\Psi}\tilde{\mathbf{h}}_k + \mathbf{n}$

where $\boldsymbol{\Psi} = \boldsymbol{\Phi}\mathbf{F}$ is the effective sensing matrix. CS recovers $\tilde{\mathbf{h}}_k$ if $\boldsymbol{\Psi}$ satisfies RIP with sparsity $r_k$ .

Random Gaussian $\boldsymbol{\Phi}$ gives a random $\boldsymbol{\Psi}$ that satisfies RIP with high probability when $\tau_p = \mathcal{O}(r_k\log(N_t/r_k))$ .

(c) Pilot length comparison

CS requirement: $\tau_p = \mathcal{O}(r_k\log(N_t/r_k))$
Standard MMSE: $\tau_p = K$ (orthogonal pilots, one per user)

For $N_t = 128$ , $r_k = 8$ , $K = 10$ : CS needs $\tau_p \approx 8\log(16) \approx 32$ . Standard MMSE needs $\tau_p = 10$ .

CS is actually worse than standard MMSE for the intra-cell estimation problem! The CS advantage appears in compressed feedback scenarios (FDD massive MIMO, Ch. 8) where the channel dimension $N_t$ is large and the sparsity enables compression below the Nyquist dimension.

ex-ch03-11

Medium

Consider the pilot reuse factor $f \in \{1,3,7\}$ in a hexagonal cell layout.

(a) With reuse factor $f$ , what fraction of cells use the same pilot pool?

(b) Write the effective per-user rate as a function of $f$ , assuming the SINR contamination floor scales as $1/(f-1)$ and pilot overhead scales as $1/f$ .

(c) Find the $f$ that maximizes the effective rate for a 7-cell system.

Show Hint

Reuse factor $f$ : only $L/f$ cells share the same pilot pool in a 7-cell system.

SINR floor: higher $f$ reduces number of co-pilot cells, raising the floor.

Pre-log factor decreases with $f$ because fewer pilots are available per coherence interval... wait, reuse means larger pilot pool, which means more pilots per cell.

Solution

(a) Fraction sharing pilots

With reuse factor $f$ , the pilot pool is split into $f$ groups. Each cell uses one group. In a 7-cell cluster, each group covers $7/f$ cells. The fraction of cells sharing the same pilot pool is $1/f$ .

$f=1$ : all 7 cells share one pool (universal reuse, maximum contamination)
$f=7$ : each cell has a unique pool (no reuse, maximum pilot overhead)

(b) Effective rate model

Pilot overhead per cell increases with $f$ (need more pilots): $\tau_p = fK$ , so pre-log factor: $(1 - fK/\tau_c)$ .

Co-pilot interferers: $(L/f - 1)$ cells, so SINR floor $\propto 1/(L/f - 1)$ .

Rate model (simplified): $R(f) \approx (1 - fK/\tau_c)\log_2(1 + c/(L/f - 1))$ for some constant $c$ depending on $N_t$ , SNR, and covariance structure.

(c) Optimal reuse factor

For 7-cell: $L = 7$ , $K = 10$ , $\tau_c = 200$ :

$f=1$ : $\tau_p = 10$ , pre-log $= 0.95$ , 6 co-pilot cells, low SINR floor
$f=3$ : $\tau_p = 30$ , pre-log $= 0.85$ , 1.33 co-pilot cells, medium floor
$f=7$ : $\tau_p = 70$ , pre-log $= 0.65$ , 0 co-pilot cells (no contamination!)

At high SNR with spatial correlation, $f=7$ often wins due to zero contamination. At low SNR, $f=1$ wins since pilot efficiency (high pre-log) dominates.

ex-ch03-12

Medium

The pilot contamination precoding (PCP) scheme of Ashikhmin and Marzetta (2012) works as follows: each base station uses the contaminated estimate $\hat{\mathbf{H}}_k^{\text{cont}}$ (which includes contributions from co-pilot users in other cells) to design precoding vectors that deliberately pre-cancel interference at the co-pilot users.

(a) Explain intuitively why a contaminated estimate can be useful for inter-cell interference cancellation in the downlink.

(b) If base station 1 transmits $\mathbf{x}_1 = \hat{\mathbf{h}}_{1,k}^{\text{cont}}s_1$ and base station 2 transmits $\mathbf{x}_2 = \hat{\mathbf{h}}_{2,k}^{\text{cont}}s_2$ , what does user $k$ in cell 1 receive from both base stations?

(c) Under what condition on $s_1, s_2$ does this eliminate inter-cell interference?

Show Hint

The contaminated estimate $\hat{\mathbf{h}}_{1,k}^\text{cont}$ contains information about $\mathbf{h}_{2,k}$ (the co-pilot user's channel).

User $k$ in cell 1 receives $y_k = \mathbf{h}_{1,k}^H\mathbf{x}_1 + \mathbf{h}_{2,k}^H\mathbf{x}_2 + n$ .

Solution

(a) Intuition

The contaminated estimate $\hat{\mathbf{h}}_{1,k}^{\text{cont}} \approx \mathbf{h}_{1,k} + \mathbf{h}_{2,k}$ (simplified 2-cell case). BS 1 "knows" something about $\mathbf{h}_{2,k}$ through contamination — it can exploit this information to coordinate with BS 2.

(b) Received signal

$y_k = \mathbf{h}_{1,k}^H\hat{\mathbf{h}}_{1,k}^{\text{cont}}s_1 + \mathbf{h}_{2,k}^H\hat{\mathbf{h}}_{2,k}^{\text{cont}}s_2 + n$ $As$ N_t \to \infty $:$ \frac{1}{N_t}\mathbf{h}{\ell,k}^H\hat{\mathbf{h}}{\ell,k}^{\text{cont}} \to $a deterministic constant$ c_\ell$.

(c) Interference cancellation condition

Setting $s_2 = -c_1/c_2 \cdot s_1$ would cancel interference, but this reduces user 2's signal. The PCP idea instead chooses $s_1, s_2$ to jointly maximize sum rate across both cells — a multi-cell DPC-like approach. The key insight is that the "side information" about the other cell's channel, obtained via contamination, can be exploited for coordinated precoding.

ex-ch03-13

Easy

Suppose a cell has $K = 5$ users and the contamination metric matrix (between users in this cell and users in a neighboring cell) is:

$\boldsymbol{\rho} = \begin{bmatrix} 0.8 & 0.1 & 0.05 & 0.3 & 0.6 \\ 0.1 & 0.9 & 0.2 & 0.1 & 0.05 \\ 0.05& 0.2 & 0.7 & 0.8 & 0.1 \\ 0.3 & 0.1 & 0.8 & 0.6 & 0.2 \\ 0.6 & 0.05& 0.1 & 0.2 & 0.85 \end{bmatrix}$

Row $k$ , column $j$ = contamination if users $k$ and $j$ share a pilot ( $\tau_p = 3$ ). Using greedy assignment, find the pilot assignments that minimize total contamination.

Show Hint

Sort users by their maximum contamination (most problematic first).

Greedily assign the least-contaminating available pilot to each user.

Solution

Identify worst contamination pairs

High contamination pairs: $(1,1)=0.8$ , $(2,2)=0.9$ , $(3,3)=0.7$ , $(3,4)=0.8$ , $(5,5)=0.85$ . Focus on: $(2,5)$ with 0.9 and 0.85, and $(3,4)$ both high at 0.7/0.8.

Greedy assignment (3 pilots)

User 1 → Pilot A
User 2 → Pilot B (would conflict with user 1 if assigned A: $\rho_{1,2}=0.1$ — small, but assign B anyway as most problematic)
User 5 (high cross-contamination with user 1, $\rho_{1,5}=0.6$ ) → Pilot C
User 3 → Pilot B ( $\rho_{3,2}=0.2$ — smallest remaining cost)
User 4 → Pilot A ( $\rho_{4,1}=0.3$ , $\rho_{4,3}=0.8$ avoided)

Assignment: $\{1,4\}\to A$ , $\{2,3\}\to B$ , $\{5\}\to C$ . Total contamination: $\rho_{1,4}=0.3 + \rho_{2,3}=0.2 = 0.5$ (much better than random).

ex-ch03-14

Hard

MMSE rate bound with pilot contamination. Using the "use-and-then-forget" (UatF) approach (to be derived in Chapter 4), the per-user uplink rate with contaminated MMSE estimates and MRC combining is bounded below by:

$R_k^{\text{UatF}} = \log_2\left(1 + \frac{p_u|{\mathbb{E}[\hat{\mathbf{h}}_{1,k}^H\mathbf{h}_{1,k}]}|^2} {p_u\sum_{\ell,j}|\mathbb{E}[\hat{\mathbf{h}}_{1,k}^H\mathbf{h}_{\ell,j}]|^2 - p_u|{\mathbb{E}[\hat{\mathbf{h}}_{1,k}^H\mathbf{h}_{1,k}]}|^2 + \sigma^2\mathbb{E}[\|\hat{\mathbf{h}}_{1,k}\|^2]}\right)$

For a two-cell system ( $L=2$ ) with $K=1$ user per cell and i.i.d. channels $\mathbf{R}_{1,1} = \mathbf{R}_{2,1} = \beta\mathbf{I}$ :

(a) Compute the numerator: $|\mathbb{E}[\hat{\mathbf{h}}_{1,1}^H\mathbf{h}_{1,1}]|^2$ .

(b) Compute the contamination interference: $p_u|\mathbb{E}[\hat{\mathbf{h}}_{1,1}^H\mathbf{h}_{2,1}]|^2$ .

(c) Show that as $N_t\to\infty$ , the SINR converges to $\beta^2 N_t^2/(4\beta^2N_t^2) = 1/4$ (an SINR floor independent of $N_t$ ).

Show Hint

The contaminated MMSE estimate is $\hat{\mathbf{h}}_{1,1} = \mathbf{y}_{1,1}/(2\sqrt{p_u\tau_p})$ when $\mathbf{R}_{1,1}=\mathbf{R}_{2,1}=\beta\mathbf{I}$ .

Compute $\mathbb{E}[\hat{\mathbf{h}}_{1,1}^H\mathbf{h}_{1,1}]$ and $\mathbb{E}[\hat{\mathbf{h}}_{1,1}^H\mathbf{h}_{2,1}]$ using the linearity of the estimator.

Solution

Contaminated MMSE estimator

With $\mathbf{R}_{1,1}=\mathbf{R}_{2,1}=\beta\mathbf{I}$ :

$\hat{\mathbf{h}}_{1,1} = \frac{\sqrt{p_u\tau_p}\beta\mathbf{I}}{2p_u\tau_p\beta + \sigma^2}\mathbf{y}_{1,1} \triangleq \alpha\mathbf{y}_{1,1}$

(a) Signal term

$\mathbb{E}[\hat{\mathbf{h}}_{1,1}^H\mathbf{h}_{1,1}] = \alpha\mathbb{E}[\mathbf{y}_{1,1}^H\mathbf{h}_{1,1}] = \alpha\sqrt{p_u\tau_p}\beta N_t$

(squaring gives the numerator)

(b) Contamination term

$\mathbb{E}[\hat{\mathbf{h}}_{1,1}^H\mathbf{h}_{2,1}] = \alpha\mathbb{E}[\mathbf{y}_{1,1}^H\mathbf{h}_{2,1}] = \alpha\sqrt{p_u\tau_p}\beta N_t$

Same magnitude! Both signal and contamination grow as $N_t$ with the same coefficient.

(c) SINR floor

Both numerator and contamination scale as $N_t^2$ , so the SINR converges to:

$\text{SINR}^{\infty} = \frac{|\alpha\sqrt{p_u\tau_p}\beta N_t|^2}{|\alpha\sqrt{p_u\tau_p}\beta N_t|^2} = 1$

(plus noise term which becomes negligible as $N_t\to\infty$ ).

With only one contaminator and identical covariances, the floor is SINR = 1 (0 dB). For $L$ co-pilot cells: SINR $^\infty = 1/(L-1)$ . $\blacksquare$

ex-ch03-15

Challenge

Simulation design. Design a Monte Carlo simulation to verify the pilot contamination SINR floor prediction from Theorem thm-sinr-floor.

(a) Specify the simulation setup: $L$ , $K$ , range of $N_t$ , channel model, pilot assignment, combining.

(b) Write pseudocode for the Monte Carlo simulation.

(c) What convergence behavior do you expect for the SINR as $N_t$ increases, and how many Monte Carlo trials are needed for accurate estimation at $N_t = 512$ for $\pm 0.1$ dB accuracy?

Show Hint

Generate $\mathbf{h}_{\ell,k} \sim \mathcal{CN}(\mathbf{0}, \mathbf{R}_{\ell,k})$ for all cells and users.

Compute contaminated estimates using the MMSE formula.

MRC receive: $r_k = \hat{\mathbf{h}}_{1,k}^H\mathbf{y}_{\text{ul}}$ . Compute SINR per trial, average.

Solution

(a) Simulation setup

$L = 7$ hexagonal cells, $K = 10$ users per cell
Channel model: one-ring with $\beta_{\ell k} = 1$ , angular spread $\Delta\theta = 5°$
$N_t \in \{8, 16, 32, 64, 128, 256, 512\}$
Universal pilot reuse ( $\tau_p = K = 10$ )
MRC combining using contaminated MMSE estimate

(b) Pseudocode

for each N_t in range:
  sinr_trials = []
  for trial = 1:N_mc:
    Generate h[l,k] ~ CN(0, R[l,k]) for all l,k
    Compute contaminated observations y[k] = sqrt(p*tau_p)*sum_l(h[l,k]) + n
    Compute MMSE estimates h_hat[k]
    Compute MRC output: r_k = h_hat[k]^H * (sqrt(p)*sum(h[1,:]*s) + noise)
    Compute SINR_k = |E[h_hat^H h]|^2 / (interference + noise)
    sinr_trials.append(SINR_k)
  sinr_vs_nt.append(mean(sinr_trials))
Plot sinr_vs_nt vs N_t (should plateau at SINR_floor)

(c) Convergence and accuracy

SINR converges to the floor as $N_t \to \infty$ . For large $N_t$ , self-averaging reduces variance — the law of large numbers over $N_t$ antenna elements.

For $\pm 0.1$ dB ( $\approx 1.15\%$ relative SINR accuracy), by central limit theorem, needed trials $N_{\text{mc}} \approx (Z_{0.975}\sigma_{\text{SINR}}/0.0115)^2$ . With $\sigma_{\text{SINR}} \approx 0.5$ (typical): $N_{\text{mc}} \approx 7500$ trials.

Exercises

ex-ch03-01

(a) Minimum pilot fraction

(b) Data fraction

(c) With doubled users

ex-ch03-02

(a) LS MSE

(b) MMSE MSE for i.i.d. channel

(c) Low-rank channel

ex-ch03-03

Observation model

Compute covariances

LMMSE formula

ex-ch03-04

(a) Contaminated MMSE estimate

(b) No vanishing error when covariances are proportional

(c) Vanishing error with orthogonal subspaces

ex-ch03-05

MMSE estimator is linear

Orthogonality to observation

Orthogonality to estimate

ex-ch03-06

(a) Orthogonality

(b) PAPR

(c) Zadoff-Chu advantages

ex-ch03-07

(a) Effective rank from bandwidth

(b) Subspace orthogonality check

(c) Minimum angular separation

ex-ch03-08

(a) Concavity

(b) First-order condition

(c) High-SNR asymptotic

ex-ch03-09

Set up notation

Bound greedy per-user cost

Bound total greedy cost

ex-ch03-10

(a) Angular sparsity

(b) CS estimation

(c) Pilot length comparison

ex-ch03-11

(a) Fraction sharing pilots

(b) Effective rate model

(c) Optimal reuse factor

ex-ch03-12

(a) Intuition

(b) Received signal

(c) Interference cancellation condition

ex-ch03-13

Identify worst contamination pairs

Greedy assignment (3 pilots)

ex-ch03-14

Contaminated MMSE estimator

(a) Signal term

(b) Contamination term

(c) SINR floor

ex-ch03-15

(a) Simulation setup

(b) Pseudocode

(c) Convergence and accuracy