Exercises
ex-ch03-01
EasyA massive MIMO system operates with coherence interval samples, serves users per cell, and devotes fraction to pilot training.
(a) What is the minimum pilot fraction to allow orthogonal pilots?
(b) If (equal uplink/downlink), what fraction of resources are available for data transmission?
(c) If doubles to 30, what is the new minimum pilot fraction?
Orthogonal pilots in require .
Total resources: . Data fraction = .
(a) Minimum pilot fraction
Minimum pilot length: .
(b) Data fraction
Data samples: .
Data fraction: .
(c) With doubled users
, so .
Doubling users doubles the pilot overhead and reduces data to 90%.
ex-ch03-02
EasyA base station with antennas estimates user channels via uplink pilots. The user's spatial covariance matrix is (i.i.d. Rayleigh channel), with and pilot SNR dB.
(a) Compute the LS estimation MSE.
(b) Compute the MMSE estimation MSE. Why does MMSE provide no gain over LS here?
(c) What happens to the MSE comparison if has effective rank ?
For , all eigenvalues equal .
Convert dB to linear: .
(a) LS MSE
.
(b) MMSE MSE for i.i.d. channel
All eigenvalues :
MMSE does better than LS by factor β a modest gain. When all eigenvalues are equal, MMSE reduces to a scalar shrinkage , providing no subspace gain.
(c) Low-rank channel
With equal eigenvalues :
MMSE gain: dB. The gain scales with .
ex-ch03-03
MediumDerive the MMSE channel estimator for the case where the pilot sequences are non-orthogonal: where for .
(a) Write the observation model after correlating with user 's pilot.
(b) What "effective noise" does user 's non-orthogonal pilot create?
(c) Derive the MMSE estimator and its error covariance for this case.
After correlating , terms with give β correlated interference.
The observation model is .
Apply LMMSE: cross-covariance , observation covariance .
Observation model
\mathbf{n}_k = \mathbf{N}_p\boldsymbol{\phi}k^*/\sqrt{\tau_p} \sim \mathcal{CN}(\mathbf{0},\sigma^2\mathbf{I})|c{kj}|$.
Compute covariances
Cross-covariance:
Observation covariance:
LMMSE formula
\mathbf{C}_k = \mathbf{R}_k - p_u\tau_p\mathbf{R}k\boldsymbol{\Sigma}{\mathbf{y}_k\mathbf{y}_k}^{-1}\mathbf{R}k\boldsymbol{\Sigma}{\mathbf{y}_k\mathbf{y}k}c{kj} = 0j\neq k\blacksquare$
ex-ch03-04
MediumIn a two-cell system (), each cell has one user () and both share the same pilot. User 1 has covariance and user 2 (in cell 2) has covariance .
(a) Write the MMSE estimate of from the contaminated observation.
(b) Show that the estimation error covariance does NOT vanish as when and are proportional: .
(c) Show that when (orthogonal subspaces), the subspace-projected MMSE estimator achieves as .
Contaminated observation: .
For (b): When , compute and show it equals a fixed matrix independent of .
For (c): Use the subspace projection theorem (Theorem thm-subspace-projection-decontamination).
(a) Contaminated MMSE estimate
$
(b) No vanishing error when covariances are proportional
With :
As (high SNR), this approaches:
The error remains of order β the contamination floor is proportional to .
(c) Vanishing error with orthogonal subspaces
When , the projected observation is:
The MMSE estimator from this clean observation has error covariance:
As , .
ex-ch03-05
MediumProve that the MMSE estimation error is uncorrelated with both the estimate and the received pilot observation (the orthogonality principle).
The MMSE estimator minimizes MSE, so the gradient with respect to any linear function of must be zero at the optimum.
Show by substituting and computing cross-covariance.
MMSE estimator is linear
The MMSE estimator takes the form where .
Orthogonality to observation
\mathbf{A} = \boldsymbol{\Sigma}_{\mathbf{h}_k\mathbf{y}k}\boldsymbol{\Sigma}{\mathbf{y}_k\mathbf{y}_k}^{-1}\mathbf{0}$.
Orthogonality to estimate
Since is a linear function of :
This property is critical: it means estimation error and estimate are uncorrelated, which is what makes the "use-and-then-forget" bound (Ch. 4) tight.
ex-ch03-06
MediumA DFT-based pilot matrix has entries , forming an orthogonal set of pilot sequences each of length .
(a) Show that (mutual pilot orthogonality).
(b) What is the PAPR (peak-to-average power ratio) of each pilot sequence?
(c) Why are Zadoff-Chu sequences preferred over DFT rows in practice for 5G NR?
. Geometric series.
Each DFT row has constant magnitude , so PAPR = 1.
Look up Zadoff-Chu: constant amplitude, zero autocorrelation sidelobes.
(a) Orthogonality
\boldsymbol{\Phi}\boldsymbol{\Phi}^H = \mathbf{I}_K$.
(b) PAPR
Each entry has β constant across . Maximum instantaneous power = average power, so (0 dB).
DFT rows are constant-amplitude sequences with PAPR = 1.
(c) Zadoff-Chu advantages
Zadoff-Chu sequences have:
- Constant amplitude (PAPR = 1, same as DFT)
- Ideal cyclic autocorrelation: for β no inter-symbol interference in OFDM
- Multiple sequences: different root indices give mutually low-correlation sequences
- Robustness to frequency offset: flat spectrum enables timing/frequency estimation
In 5G NR, ZC sequences are used for PRACH preambles and SRS pilots because their cyclic properties enable efficient FFT-based correlation at the receiver.
ex-ch03-07
HardConsider a one-ring covariance model for a ULA with antennas, half-wavelength spacing, mean angle , and angular spread . The covariance entries are approximately:
where is the angular spread in radians.
(a) Show that the effective rank .
(b) Two users (in cell 1) and (in cell 2) share a pilot. User has , . User has , . Show that their covariance subspaces are approximately orthogonal for large .
(c) What is the minimum angular separation for exact subspace orthogonality in this model?
Eigenvalues of a Toeplitz matrix with entries are approximately the DFT of the generating function β a rectangular window of width .
The number of eigenvalues above a threshold equals the bandwidth of the angular spectrum.
Orthogonality condition: the angular intervals must not overlap.
(a) Effective rank from bandwidth
The covariance matrix is a Hermitian Toeplitz matrix with generating sequence . Its spectrum is the angular-domain spectrum concentrated in the interval .
The number of significant eigenvalues (Szego's theorem) is approximately:
(b) Subspace orthogonality check
User : angular interval in degrees, rad. User : angular interval in degrees, rad.
These intervals are disjoint: min separation . The covariance spectra are supported on non-overlapping sets, implying approximately orthogonal eigenbases for large . Formally: .
(c) Minimum angular separation
Exact orthogonality requires the angular windows to be disjoint:
With : separation .
For the specific numbers above: β well orthogonal. Two users at and with spread each would barely satisfy the condition and would have partial subspace overlap.
ex-ch03-08
HardDerive the optimal pilot sequence length that maximizes the effective spectral efficiency (ESE):
where the logarithm represents an approximation to the per-user rate using the estimated channel (assuming Gaussian channel and MMSE estimation).
(a) Show that the rate term is concave in .
(b) Find the first-order necessary condition for .
(c) For , , and high SNR , what is the asymptotic ?
Let . The rate is , which is concave in (and hence in ).
The product is maximized when .
At high SNR: , so the rate saturates at and the optimal pilot fraction .
(a) Concavity
Let (fixed). Define . This is increasing and concave in (positive second derivative is negative).
The product is the product of a decreasing linear function and a concave increasing function β jointly concave, so a unique maximum exists.
(b) First-order condition
Setting :
where . This transcendental equation must be solved numerically for general parameters.
(c) High-SNR asymptotic
At high SNR, quickly and rate saturates at . Further pilot investment provides diminishing rate returns while the pre-log factor still decreases linearly.
The optimal converges to the minimum feasible: (just enough for orthogonal pilots). Pre-log factor: . The high-SNR ESE is approximately bits/s/Hz.
ex-ch03-09
HardConsider the greedy pilot assignment algorithm. Prove that the greedy algorithm achieves at most a -factor approximation of the optimal assignment (in terms of total contamination cost ).
Assume the contamination metric is symmetric.
Upper bound the greedy cost per user by the average cost over all pilots.
Compare greedy cost with optimal cost via a relaxation argument.
Set up notation
Let be the optimal total cost and be the greedy cost. User assigned pilot by greedy incurs cost .
Bound greedy per-user cost
When assigning user , greedy picks the pilot minimizing . The average cost over all pilots is at most . Greedy achieves .
Bound total greedy cost
Summing over all :
Since the optimal assignment can only do better: ,
where is the cost when all users share one pilot.
The approximation ratio is at most .
ex-ch03-10
ChallengeResearch Extension: The angular-domain representation of a ULA channel is where is the DFT matrix and is the virtual angular-domain channel.
(a) Show that for the one-ring model, is sparse: only entries are nonzero (approximately).
(b) Propose a compressed sensing approach to estimate the sparse using fewer pilots than β and state the conditions on the pilot matrix for recovery to succeed.
(c) What is the minimum pilot length for reliable CS recovery, and how does this compare to the MMSE pilot length of ?
Sparsity in the angular domain follows from the bandwidth argument in Exercise 7(a).
CS recovery requires the pilot matrix to satisfy the restricted isometry property (RIP) with sparsity .
RIP is satisfied with high probability when is a random Gaussian matrix with rows.
(a) Angular sparsity
From Exercise 7(a): the one-ring covariance has rank . In the DFT domain, where is approximately block-diagonal with nonzero entries (the angular window).
Therefore is supported on angular bins β approximately -sparse.
(b) CS estimation
Pilot observation:
Wait β pilot matrix applied to : actually observation model for CS is:
where is the effective sensing matrix. CS recovers if satisfies RIP with sparsity .
Random Gaussian gives a random that satisfies RIP with high probability when .
(c) Pilot length comparison
- CS requirement:
- Standard MMSE: (orthogonal pilots, one per user)
For , , : CS needs . Standard MMSE needs .
CS is actually worse than standard MMSE for the intra-cell estimation problem! The CS advantage appears in compressed feedback scenarios (FDD massive MIMO, Ch. 8) where the channel dimension is large and the sparsity enables compression below the Nyquist dimension.
ex-ch03-11
MediumConsider the pilot reuse factor in a hexagonal cell layout.
(a) With reuse factor , what fraction of cells use the same pilot pool?
(b) Write the effective per-user rate as a function of , assuming the SINR contamination floor scales as and pilot overhead scales as .
(c) Find the that maximizes the effective rate for a 7-cell system.
Reuse factor : only cells share the same pilot pool in a 7-cell system.
SINR floor: higher reduces number of co-pilot cells, raising the floor.
Pre-log factor decreases with because fewer pilots are available per coherence interval... wait, reuse means larger pilot pool, which means more pilots per cell.
(a) Fraction sharing pilots
With reuse factor , the pilot pool is split into groups. Each cell uses one group. In a 7-cell cluster, each group covers cells. The fraction of cells sharing the same pilot pool is .
- : all 7 cells share one pool (universal reuse, maximum contamination)
- : each cell has a unique pool (no reuse, maximum pilot overhead)
(b) Effective rate model
Pilot overhead per cell increases with (need more pilots): , so pre-log factor: .
Co-pilot interferers: cells, so SINR floor .
Rate model (simplified): for some constant depending on , SNR, and covariance structure.
(c) Optimal reuse factor
For 7-cell: , , :
- : , pre-log , 6 co-pilot cells, low SINR floor
- : , pre-log , 1.33 co-pilot cells, medium floor
- : , pre-log , 0 co-pilot cells (no contamination!)
At high SNR with spatial correlation, often wins due to zero contamination. At low SNR, wins since pilot efficiency (high pre-log) dominates.
ex-ch03-12
MediumThe pilot contamination precoding (PCP) scheme of Ashikhmin and Marzetta (2012) works as follows: each base station uses the contaminated estimate (which includes contributions from co-pilot users in other cells) to design precoding vectors that deliberately pre-cancel interference at the co-pilot users.
(a) Explain intuitively why a contaminated estimate can be useful for inter-cell interference cancellation in the downlink.
(b) If base station 1 transmits and base station 2 transmits , what does user in cell 1 receive from both base stations?
(c) Under what condition on does this eliminate inter-cell interference?
The contaminated estimate contains information about (the co-pilot user's channel).
User in cell 1 receives .
(a) Intuition
The contaminated estimate (simplified 2-cell case). BS 1 "knows" something about through contamination β it can exploit this information to coordinate with BS 2.
(b) Received signal
N_t \to \infty\frac{1}{N_t}\mathbf{h}{\ell,k}^H\hat{\mathbf{h}}{\ell,k}^{\text{cont}} \to c_\ell$.
(c) Interference cancellation condition
Setting would cancel interference, but this reduces user 2's signal. The PCP idea instead chooses to jointly maximize sum rate across both cells β a multi-cell DPC-like approach. The key insight is that the "side information" about the other cell's channel, obtained via contamination, can be exploited for coordinated precoding.
ex-ch03-13
EasySuppose a cell has users and the contamination metric matrix (between users in this cell and users in a neighboring cell) is:
Row , column = contamination if users and share a pilot (). Using greedy assignment, find the pilot assignments that minimize total contamination.
Sort users by their maximum contamination (most problematic first).
Greedily assign the least-contaminating available pilot to each user.
Identify worst contamination pairs
High contamination pairs: , , , , . Focus on: with 0.9 and 0.85, and both high at 0.7/0.8.
Greedy assignment (3 pilots)
- User 1 β Pilot A
- User 2 β Pilot B (would conflict with user 1 if assigned A: β small, but assign B anyway as most problematic)
- User 5 (high cross-contamination with user 1, ) β Pilot C
- User 3 β Pilot B ( β smallest remaining cost)
- User 4 β Pilot A (, avoided)
Assignment: , , . Total contamination: (much better than random).
ex-ch03-14
HardMMSE rate bound with pilot contamination. Using the "use-and-then-forget" (UatF) approach (to be derived in Chapter 4), the per-user uplink rate with contaminated MMSE estimates and MRC combining is bounded below by:
For a two-cell system () with user per cell and i.i.d. channels :
(a) Compute the numerator: .
(b) Compute the contamination interference: .
(c) Show that as , the SINR converges to (an SINR floor independent of ).
The contaminated MMSE estimate is when .
Compute and using the linearity of the estimator.
Contaminated MMSE estimator
With :
(a) Signal term
(squaring gives the numerator)
(b) Contamination term
Same magnitude! Both signal and contamination grow as with the same coefficient.
(c) SINR floor
Both numerator and contamination scale as , so the SINR converges to:
(plus noise term which becomes negligible as ).
With only one contaminator and identical covariances, the floor is SINR = 1 (0 dB). For co-pilot cells: SINR.
ex-ch03-15
ChallengeSimulation design. Design a Monte Carlo simulation to verify the pilot contamination SINR floor prediction from Theorem thm-sinr-floor.
(a) Specify the simulation setup: , , range of , channel model, pilot assignment, combining.
(b) Write pseudocode for the Monte Carlo simulation.
(c) What convergence behavior do you expect for the SINR as increases, and how many Monte Carlo trials are needed for accurate estimation at for dB accuracy?
Generate for all cells and users.
Compute contaminated estimates using the MMSE formula.
MRC receive: . Compute SINR per trial, average.
(a) Simulation setup
- hexagonal cells, users per cell
- Channel model: one-ring with , angular spread
- Universal pilot reuse ()
- MRC combining using contaminated MMSE estimate
(b) Pseudocode
for each N_t in range:
sinr_trials = []
for trial = 1:N_mc:
Generate h[l,k] ~ CN(0, R[l,k]) for all l,k
Compute contaminated observations y[k] = sqrt(p*tau_p)*sum_l(h[l,k]) + n
Compute MMSE estimates h_hat[k]
Compute MRC output: r_k = h_hat[k]^H * (sqrt(p)*sum(h[1,:]*s) + noise)
Compute SINR_k = |E[h_hat^H h]|^2 / (interference + noise)
sinr_trials.append(SINR_k)
sinr_vs_nt.append(mean(sinr_trials))
Plot sinr_vs_nt vs N_t (should plateau at SINR_floor)
(c) Convergence and accuracy
SINR converges to the floor as . For large , self-averaging reduces variance β the law of large numbers over antenna elements.
For dB ( relative SINR accuracy), by central limit theorem, needed trials . With (typical): trials.