MIMO Capacity Review — From Book ITA to Space-Time Coding

From ITA Ch. 13.5 to Space-Time Coding

Book ITA Ch. 13.5 established the capacity of the i.i.d. Rayleigh MIMO channel — arguably the single most influential result in modern wireless information theory. Two independent papers, Telatar (1995/1999, AT&T Bell Labs Technical Memorandum and later the European Transactions on Telecommunications / IEEE Trans. IT paper) and Foschini–Gans (1998, Wireless Personal Communications), proved that adding antennas at both ends of a wireless link scales capacity linearly in min(nt,nr)\min(n_t, n_r), not logarithmically as the SISO Shannon formula log2(1+SNR)\log_2(1 + \text{SNR}) suggests. This was the trigger for twenty-five years of MIMO research and for every modern wireless standard from 3G HSPA onwards.

From an information-theoretic standpoint, the capacity is the end of the story. From a coding-theoretic standpoint, it is the beginning. In Part III we ask: given the capacity, how do we design codes that (a) approach it when the fading is ergodic, and (b) operate reliably on the much harsher block-fading channel where the capacity is itself random? This chapter sets up the two pieces of the answer: the capacity formula we are trying to approach (§1 — a review), and the code design criteria (rank and determinant) that tell us how to build codes that achieve the right diversity and coding gain on block-fading channels (§3, §4, §5).

We treat the MIMO capacity review briefly — the full proof is in ITA Ch. 13.5 — and concentrate on the structural features that are operationally relevant to space-time coding: the multiplexing prelog min(nt,nr)\min(n_t, n_r), the parallel-subchannel decomposition via the SVD of H\mathbf{H}, and the difference between the ergodic and outage notions of capacity (fleshed out in §2).

,

Definition:

MIMO System Model

A point-to-point MIMO channel with ntn_t transmit antennas and nrn_r receive antennas is the discrete-time complex-baseband model y  =  Hx+w,\mathbf{y} \;=\; \mathbf{H}\mathbf{x} + \mathbf{w}, where:

  • xCnt\mathbf{x} \in \mathbb{C}^{n_t} is the transmitted symbol vector, satisfying the average-power constraint E[x2]P\mathbb{E}[\|\mathbf{x}\|^2] \le P;
  • HCnr×nt\mathbf{H} \in \mathbb{C}^{n_r \times n_t} is the channel matrix;
  • wCN(0,σ2Inr)\mathbf{w} \sim \mathcal{CN}(\mathbf{0}, \sigma^2\mathbf{I}_{n_r}) is circularly-symmetric complex AWGN;
  • yCnr\mathbf{y} \in \mathbb{C}^{n_r} is the received vector.

The transmit SNR is SNRP/σ2\text{SNR} \triangleq P/\sigma^2. Under the i.i.d. Rayleigh assumption, the entries [H]ij[\mathbf{H}]_{ij} are i.i.d. CN(0,1)\mathcal{CN}(0, 1). We write HCN(0,Inrnt)\mathbf{H} \sim \mathcal{CN}(0, \mathbf{I}_{n_r n_t}) in vec(H)\mathrm{vec}(\mathbf{H})-form.

The normalisation E[[H]ij2]=1\mathbb{E}[|[\mathbf{H}]_{ij}|^2] = 1 is a modelling convention: path-loss and shadowing are absorbed into the linear-scale SNR\text{SNR} rather than the channel variance. The transmit power PP is the sum power across antennas; it does NOT scale with ntn_t, so the per-antenna power budget decreases as we add antennas. This is why the capacity formula below has SNR/nt\text{SNR}/n_t, not SNR\text{SNR} — adding antennas at the transmitter distributes a fixed power budget.

,

Theorem: Ergodic MIMO Capacity (Telatar 1999)

For the i.i.d. Rayleigh MIMO channel with ntn_t transmit antennas, nrn_r receive antennas, transmit SNR SNR\text{SNR}, and perfect CSI at the receiver (CSIR), the ergodic capacity under equal-power (isotropic) transmission is C(SNR;nt,nr)  =  EH ⁣[log2det ⁣(Inr+SNRntHHH)]bits/channel use.C(\text{SNR}; n_t, n_r) \;=\; \mathbb{E}_{\mathbf{H}}\!\left[ \log_2 \det\!\left( \mathbf{I}_{n_r} + \tfrac{\text{SNR}}{n_t} \mathbf{H}\mathbf{H}^{H} \right) \right] \quad \text{bits/channel use.} At high SNR, this behaves as C(SNR;nt,nr)  =  min(nt,nr)log2SNR  +  O(1),C(\text{SNR}; n_t, n_r) \;=\; \min(n_t, n_r)\,\log_2\text{SNR} \;+\; O(1), i.e., the multiplexing prelog is min(nt,nr)\min(n_t, n_r).

Diagonalise H\mathbf{H} via the SVD: H=UΣVH\mathbf{H} = \mathbf{U}\boldsymbol{\Sigma} \mathbf{V}^H with Σ\boldsymbol{\Sigma} holding the singular values σ1σmin(nt,nr)0\sigma_1 \ge \cdots \ge \sigma_{\min(n_t,n_r)} \ge 0. Pre-rotating the input by V\mathbf{V} and post-rotating the output by UH\mathbf{U}^H turns the MIMO channel into min(nt,nr)\min(n_t, n_r) parallel scalar sub-channels with gains σi2\sigma_i^2. Water-filling over the random gains gives a capacity that, for i.i.d. Rayleigh, reduces to the isotropic formula above (water-filling and equal allocation coincide in expectation for i.i.d. channels). At high SNR, each active sub-channel contributes log2(σi2SNR/nt)\log_2(\sigma_i^2 \text{SNR}/n_t), and the log\log-det splits into a sum of min(nt,nr)\min(n_t, n_r) logarithms — hence the prelog.

, ,

Ergodic MIMO Capacity C(SNR;nt,nr)C(\text{SNR}; n_t, n_r) vs. SNR

Monte-Carlo ergodic capacity of the nt×nrn_t \times n_r i.i.d. Rayleigh MIMO channel under isotropic Gaussian input, swept over SNR in dB. The dashed reference is the SISO capacity log2(1+SNR)\log_2(1 + \text{SNR}). At high SNR the slope of each curve is min(nt,nr)\min(n_t, n_r) bits/channel use per 3 dB.

Parameters
2
2

Example: MIMO Capacity at 10 dB for 2×22\times 2 and 4×44\times 4 Rayleigh

Estimate the ergodic capacity of an i.i.d. Rayleigh MIMO channel at SNR=10\text{SNR} = 10 dB for (a) nt=nr=2n_t = n_r = 2, (b) nt=nr=4n_t = n_r = 4, and (c) nt=nr=8n_t = n_r = 8. Compare against the SISO capacity log2(1+SNR)=log2(11)3.46\log_2(1 + \text{SNR}) = \log_2(11) \approx 3.46 bits/channel use, and verify the rough-cut high-SNR approximation Cmin(nt,nr)log2SNRC \approx \min(n_t, n_r) \log_2 \text{SNR}.

,

Key Takeaway

The single most important number in MIMO theory is the multiplexing prelog min(nt,nr)\min(n_t, n_r). It determines how fast capacity grows in log2SNR\log_2\text{SNR} and, in Part III, it will re-emerge as the maximum multiplexing gain of the diversity-multiplexing tradeoff (Ch. 12). The purpose of a space-time code is to translate this potential into actual bits on the wire: either full min(nt,nr)\min(n_t, n_r) streams (spatial multiplexing, low diversity — Ch. 11 V-BLAST) or full ntnrn_t n_r diversity (Alamouti / OSTBCs, rate 1 — Ch. 11), or an informed compromise (lattice STCs — Chs. 13, 17).

Capacity Says 'How Much', Coding Says 'How'

The capacity formula of Theorem TErgodic MIMO Capacity (Telatar 1999) is a fundamental limit: it does not tell the designer which codewords to transmit, only that a rate R<CR < C is achievable with arbitrarily low error for sufficiently long blocks. On a fast-fading channel (fading i.i.d. across symbols), the AWGN capacity-achieving coding recipes of Part II (BICM + LDPC) port over almost verbatim — interleavers make the channel "effectively" ergodic.

On a block-fading channel (quasi-static fading over the whole codeword), this reasoning collapses: the channel is not ergodic in the codeword time-scale, the capacity C(H)C(\mathbf{H}) is a random variable, and diversity — not multiplexing — becomes the central code-design criterion. Sections §2–§5 develop this story.

Historical Note: Telatar 1995 and Foschini–Gans 1998 — The MIMO Capacity Result

1995–1999

Emre Telatar (at AT&T Bell Labs) circulated the technical memorandum "Capacity of multi-antenna Gaussian channels" in 1995. The result — that the capacity of an nt×nrn_t \times n_r MIMO channel scales as min(nt,nr)logSNR\min(n_t, n_r)\log\text{SNR} — was initially met with disbelief: the prevailing intuition held that capacity must grow logarithmically in total power, not linearly in the min of antenna counts. The memorandum circulated informally for four years before appearing in the European Transactions on Telecommunications (1999) and becoming the most-cited wireless-capacity paper of all time.

Independently, Gerard Foschini and Michael Gans at Bell Labs published "On limits of wireless communications in a fading environment when using multiple antennas" in Wireless Personal Communications (1998). They reached the same conclusion and additionally introduced the V-BLAST architecture — a practical receiver that approaches the MIMO capacity by successive interference cancellation (see Ch. 11). The combination of these two papers triggered the modern MIMO era: by 2001 Lucent, Bell Labs (Foschini's home), and others had MIMO prototypes in the lab; by 2008 3GPP HSPA was commercial; by 2019 5G NR was shipping 64×464 \times 4 massive-MIMO base stations.

The history is neatly told by Biglieri, Caire, Taricco, and others in survey articles (e.g., [?biglieri-caire-taricco-2000]) and in Goldsmith et al.'s 2003 survey [?goldsmith-jafar-jindal-vishwanath-2003].

, ,