Large-System Analysis of MIMO Detection

Why We Need Asymptotic Analysis

In Chapter 13 we derived the LMMSE (Wiener) estimator for linear models y=Hx+w\mathbf{y} = \mathbf{H}\mathbf{x} + \mathbf{w}. Its MSE and output SINR can be written explicitly in terms of H\mathbf{H}, SNR\text{SNR}, and the input covariance. But in a large MIMO system with, say, nt=64n_t = 64 transmit and nr=128n_r = 128 receive antennas, these expressions hide the scaling behavior we actually care about. They depend on the specific realization of H\mathbf{H} through eigenvalues that fluctuate from channel to channel.

The point is that as the dimensions grow, the fluctuations vanish. A suitably normalized SINR concentrates around a deterministic value that depends only on the aspect ratio Ξ²=nt/nr\beta = n_t/n_r and the SNR. This is the power of random matrix theory for wireless systems: we replace the intractable dependence on H\mathbf{H} by a scalar fixed-point equation. The result is not a rough estimate β€” it is sharp, in the sense that the gap between the random SINR and its deterministic limit is O(1/nr)O(1/n_r).

Definition:

Empirical Spectral Distribution

Let XN\mathbf{X}_N be an NΓ—NN \times N Hermitian matrix with real eigenvalues Ξ»1,…,Ξ»N\lambda_1, \ldots, \lambda_N. Its empirical spectral distribution (ESD) is the random probability measure FN(x)=1Nβˆ‘k=1N1{Ξ»k≀x}.F_N(x) = \frac{1}{N} \sum_{k=1}^N \mathbb{1}\{\lambda_k \leq x\}. When FN(x)F_N(x) converges (weakly, almost surely) to a deterministic measure F(x)F(x) as Nβ†’βˆžN \to \infty, we call FF the limiting spectral distribution (LSD).

The ESD summarizes all spectral information into a single measure. Traces of polynomials and inverses of XN\mathbf{X}_N become integrals against FNF_N: 1Ntr⁑(XNβˆ’1)=∫xβˆ’1 dFN(x)\frac{1}{N}\operatorname{tr}(\mathbf{X}_N^{-1}) = \int x^{-1}\,dF_N(x).

Definition:

Stieltjes Transform

The Stieltjes transform of a probability measure ΞΌ\mu on R\mathbb{R} is mΞΌ(z)=∫R1xβˆ’z dΞΌ(x),z∈C+.m_\mu(z) = \int_{\mathbb{R}} \frac{1}{x - z}\,d\mu(x), \qquad z \in \mathbb{C}^+. For a Hermitian matrix XN\mathbf{X}_N with ESD FNF_N, the associated (random) Stieltjes transform is mN(z)=1Ntr⁑(XNβˆ’zI)βˆ’1m_N(z) = \frac{1}{N}\operatorname{tr}(\mathbf{X}_N - z\mathbf{I})^{-1}.

The Stieltjes transform is an analytic function on C+\mathbb{C}^+ (upper half-plane) that encodes the measure ΞΌ\mu. The inversion formula (Plemelj) recovers the density: fΞΌ(x)=1Ο€lim⁑η→0+Im⁑ mΞΌ(x+jΞ·)f_\mu(x) = \frac{1}{\pi}\lim_{\eta \to 0^+} \operatorname{Im}\, m_\mu(x + j\eta). Convergence of the ESD is equivalent to pointwise convergence of mN(z)m_N(z) on C+\mathbb{C}^+.

Theorem: Marchenko-Pastur Law

Let H\mathbf{H} be an NΓ—MN \times M matrix whose entries are i.i.d. complex (or real) with mean zero, variance 1/N1/N, and finite fourth moment. Let Ξ²=M/N\beta = M/N and consider the Gram matrix W=HHH\mathbf{W} = \mathbf{H}\mathbf{H}^H. As N,Mβ†’βˆžN, M \to \infty with M/Nβ†’Ξ²βˆˆ(0,∞)M/N \to \beta \in (0, \infty), the ESD of W\mathbf{W} converges almost surely to the Marchenko-Pastur distribution with density fMP(x)=(1βˆ’1Ξ²)+Ξ΄(x)+(xβˆ’Ξ»βˆ’)+(Ξ»+βˆ’x)+2Ο€x,λ±=(1Β±Ξ²)2.f_{\text{MP}}(x) = \left(1 - \frac{1}{\beta}\right)^+ \delta(x) + \frac{\sqrt{(x - \lambda_-)^+ (\lambda_+ - x)^+}}{2\pi x}, \quad \lambda_\pm = (1 \pm \sqrt{\beta})^2.

The eigenvalues of a large i.i.d. Gram matrix do not cluster around the mean E[Ξ»]=Ξ²\mathbb{E}[\lambda] = \beta. They spread out over the interval [Ξ»βˆ’,Ξ»+][\lambda_-, \lambda_+], and the width of this interval is 4Ξ²4\sqrt{\beta}. When Ξ²<1\beta < 1, the matrix is tall and full rank; when Ξ²>1\beta > 1, there are (1βˆ’1/Ξ²)N(1 - 1/\beta) N zero eigenvalues because HHH\mathbf{H}\mathbf{H}^H is rank-deficient. The point is that a large random MIMO channel has an extreme spectrum β€” its smallest singular value approaches ∣1βˆ’Ξ²βˆ£|1 - \sqrt{\beta}|, not zero, provided Ξ²β‰ 1\beta \neq 1.

,

Marchenko-Pastur Density for Several Aspect Ratios

Marchenko-Pastur Density for Several Aspect Ratios
Density fMP(x)f_{\text{MP}}(x) for β∈{0.25,0.5,1.0,2.0}\beta \in \{0.25, 0.5, 1.0, 2.0\}. At β=1\beta = 1 the density diverges at x=0x = 0; this is the hard edge that makes square Gaussian matrices poorly conditioned.

Marchenko-Pastur: Empirical vs. Asymptotic Spectrum

Compare the eigenvalue histogram of a finite random Gram matrix W=HHH\mathbf{W} = \mathbf{H}\mathbf{H}^H (with H\mathbf{H} i.i.d. CN\mathcal{CN}) to the Marchenko-Pastur density. Adjust the dimensions and watch the histogram lock onto the limit.

Parameters
200
0.5
1

Definition:

LMMSE Detector SINR (Finite nrn_r)

Consider the MIMO channel y=Hx+w\mathbf{y} = \mathbf{H}\mathbf{x} + \mathbf{w} with x∼CN(0,I)\mathbf{x} \sim \mathcal{CN}(\mathbf{0}, \mathbf{I}) and w∼CN(0,Οƒ2I)\mathbf{w} \sim \mathcal{CN}(\mathbf{0}, \sigma^2\mathbf{I}). The LMMSE detector for stream kk is x^k=gkHy\hat{x}_k = \mathbf{g}_k^H \mathbf{y} with G=HH(HHH+Οƒ2I)βˆ’1\mathbf{G} = \mathbf{H}^{H}(\mathbf{H}\mathbf{H}^{H} + \sigma^2\mathbf{I})^{-1}. Equivalently, G=(HHH+Οƒ2I)βˆ’1HH\mathbf{G} = (\mathbf{H}^{H}\mathbf{H} + \sigma^2\mathbf{I})^{-1}\mathbf{H}^{H}. Its output SINR for stream kk is SINR⁑k=hkH(H(k)H(k)H+Οƒ2I)βˆ’1hk,\operatorname{SINR}_k = \mathbf{h}_k^H (\mathbf{H}_{(k)}\mathbf{H}_{(k)}^{H} + \sigma^2\mathbf{I})^{-1} \mathbf{h}_k, where H(k)\mathbf{H}_{(k)} is H\mathbf{H} with column kk removed.

Theorem: Deterministic Equivalent of the LMMSE SINR

Assume the entries of H\mathbf{H} are i.i.d. CN(0,1/nr)\mathcal{CN}(0, 1/n_r). In the large-system limit nt,nrβ†’βˆžn_t, n_r \to \infty with nt/nrβ†’Ξ²n_t/n_r \to \beta, the output SINR of the LMMSE detector on any stream converges almost surely to SINR⁑∞(Ξ²,SNR)=1SNRβˆ’1β‹…m(βˆ’SNRβˆ’1)+correctionβˆ’1,\operatorname{SINR}^\infty(\beta, \text{SNR}) = \frac{1}{\text{SNR}^{-1} \cdot m(-\text{SNR}^{-1}) + \text{correction}} - 1, where m(z)m(z) satisfies the fixed-point equation m(z)=(βˆ’z+Ξ²1+m(z))βˆ’1.m(z) = \left(-z + \frac{\beta}{1 + m(z)}\right)^{-1}. Equivalently, the limiting SINR is the unique positive solution of Ξ³=SNR1+Ξ²β‹…Ξ³1+Ξ³.\gamma = \frac{\text{SNR}}{1 + \beta \cdot \frac{\gamma}{1 + \gamma}}.

The deterministic equivalent replaces a random quantity (the SINR, which depends on the channel realization) by a scalar depending only on Ξ²\beta and SNR\text{SNR}. The fixed-point Ξ³=SNR/(1+Ξ²Ξ³/(1+Ξ³))\gamma = \text{SNR}/(1 + \beta\gamma/(1+\gamma)) admits an elegant interpretation: in the large-system limit each stream "sees" an effective interference-plus-noise with variance equal to a self-consistent function of its own SINR. The circular definition resolves into a quadratic. This is the simplest instance of the replica symmetric prediction in random matrix theory.

, ,

Example: Solving the LMMSE Fixed-Point Equation

For a MIMO system with Ξ²=0.5\beta = 0.5 (two receive antennas per transmit stream) and SNR=10\text{SNR} = 10 dB, compute the asymptotic LMMSE SINR Ξ³βˆ—\gamma^*. Compare to the single-user SISO rate log⁑2(1+SNR)\log_2(1+\text{SNR}).

Why the Fixed-Point is Benign

The fixed-point equation Ξ³=SNR/(1+Ξ²Ξ³/(1+Ξ³))\gamma = \text{SNR}/(1 + \beta\gamma/(1+\gamma)) is not a convex optimization per se, but the map whose fixed point we seek is a contraction on (0,∞)(0, \infty) in the metric d(Ξ³1,Ξ³2)=∣log⁑(Ξ³1/Ξ³2)∣d(\gamma_1, \gamma_2) = |\log(\gamma_1/\gamma_2)|. Fixed-point iteration converges globally and monotonically from any starting point β€” a reliable property that matters when we need to evaluate the limit over thousands of (Ξ²,SNR)(\beta, \text{SNR}) pairs for system design.

Deterministic Equivalent vs. Monte Carlo

Compare the asymptotic SINR predicted by the fixed-point equation to the empirical SINR distribution over random H\mathbf{H} realizations. As nrn_r grows, the empirical distribution concentrates at the fixed point.

Parameters
0.5
10
32
⚠️Engineering Note

Large-System Approximations in System Design

System designers routinely use large-system deterministic equivalents for early-stage link budget analysis. For nrβ‰₯16n_r \geq 16, the prediction error of the asymptotic SINR formula is typically under 0.3 dB. This allows sweeping thousands of configurations analytically rather than running Monte Carlo.

Practical Constraints
  • β€’

    Formulas assume i.i.d. Rayleigh fading β€” correlated channels require the generalized Silverstein equation.

  • β€’

    Per-realization outage behavior is lost β€” only ergodic metrics are captured.

  • β€’

    Deterministic equivalents are asymptotic: below nrβ‰ˆ8n_r \approx 8 the variance of the SINR matters.

Historical Note: From Nuclear Physics to Wireless

1967-1999

The Marchenko-Pastur law (1967) was published in a Soviet mathematics journal with no applied motivation β€” it generalized Wigner's semicircle law from nuclear physics to rectangular matrices. Three decades later, David Tse and Steve Hanly (1999) recognized that the same distribution governs the SINR of CDMA receivers when the number of users and the spreading gain both grow to infinity. Sergio VerdΓΊ and Shlomo Shamai independently obtained closely related results. This opened a highly productive line of research that is now the standard analytical tool for massive MIMO and cell-free networks.

Why This Matters: Cell-Free Massive MIMO

In cell-free massive MIMO, each user is served by a large pool of distributed access points. The combined channel matrix is thin and tall, with Ξ²β‰ͺ1\beta \ll 1. The Marchenko-Pastur analysis here predicts the achievable per-user SINR after centralized LMMSE processing, and informs the centralized-vs-distributed tradeoff explored in Book MIMO Chapter 18.

Common Mistake: Forgetting the Aspect Ratio Regime

Mistake:

Applying the Marchenko-Pastur formulas in the square regime Ξ²β‰ˆ1\beta \approx 1, where the density diverges at x=0x = 0 and small eigenvalues dominate the MSE.

Correction:

At Ξ²=1\beta = 1, the smallest eigenvalue of HHH\mathbf{H}\mathbf{H}^{H} scales as 1/nr1/n_r, so the LMMSE matrix inversion is ill-conditioned. The deterministic equivalent is still valid, but the constants in the O(1/nr)O(1/n_r) error bound blow up as Ξ²β†’1\beta \to 1. In practice, stay away from Ξ²=1\beta = 1 β€” add regularization or use more receive antennas.

Quick Check

For Ξ²=0.25\beta = 0.25 (four times more rows than columns), what is the lower edge Ξ»βˆ’\lambda_- of the Marchenko-Pastur density?

Ξ»βˆ’=0.25\lambda_- = 0.25

Ξ»βˆ’=0\lambda_- = 0

Ξ»βˆ’=0.5\lambda_- = 0.5

Ξ»βˆ’=1βˆ’2Ξ²\lambda_- = 1 - 2\sqrt{\beta}

The Marchenko-Pastur Law Emerging

Animated convergence of the eigenvalue histogram of HHH\mathbf{H}\mathbf{H}^{H} to the Marchenko-Pastur density as NN grows. The random spectrum locks onto the deterministic limit before your eyes.

Key Takeaway

In the large-system limit, the random output SINR of an LMMSE detector on an i.i.d. Rayleigh MIMO channel concentrates around a deterministic value characterized by a simple scalar fixed-point equation in Ξ²\beta and SNR\text{SNR}. This replaces per-realization simulation with an analytic formula β€” an indispensable tool for massive MIMO system design.

πŸŽ“CommIT Contribution(2011)

Deterministic Equivalents for Correlated Channels

R. Couillet, M. Debbah, G. Caire β€” IEEE Trans. Inform. Theory

The CommIT group contributed to extending deterministic-equivalent techniques from the classical i.i.d. Marchenko-Pastur setting to isometric (Haar) precoders and correlated channels. This is the mathematical foundation behind many of the system-level predictions in Book MIMO.

random matrixdeterministic equivalentlarge systemView Paper β†’