MIMO Capacity Review — From Book ITA to Space-Time Coding
From ITA Ch. 13.5 to Space-Time Coding
Book ITA Ch. 13.5 established the capacity of the i.i.d. Rayleigh MIMO channel — arguably the single most influential result in modern wireless information theory. Two independent papers, Telatar (1995/1999, AT&T Bell Labs Technical Memorandum and later the European Transactions on Telecommunications / IEEE Trans. IT paper) and Foschini–Gans (1998, Wireless Personal Communications), proved that adding antennas at both ends of a wireless link scales capacity linearly in , not logarithmically as the SISO Shannon formula suggests. This was the trigger for twenty-five years of MIMO research and for every modern wireless standard from 3G HSPA onwards.
From an information-theoretic standpoint, the capacity is the end of the story. From a coding-theoretic standpoint, it is the beginning. In Part III we ask: given the capacity, how do we design codes that (a) approach it when the fading is ergodic, and (b) operate reliably on the much harsher block-fading channel where the capacity is itself random? This chapter sets up the two pieces of the answer: the capacity formula we are trying to approach (§1 — a review), and the code design criteria (rank and determinant) that tell us how to build codes that achieve the right diversity and coding gain on block-fading channels (§3, §4, §5).
We treat the MIMO capacity review briefly — the full proof is in ITA Ch. 13.5 — and concentrate on the structural features that are operationally relevant to space-time coding: the multiplexing prelog , the parallel-subchannel decomposition via the SVD of , and the difference between the ergodic and outage notions of capacity (fleshed out in §2).
Definition: MIMO System Model
MIMO System Model
A point-to-point MIMO channel with transmit antennas and receive antennas is the discrete-time complex-baseband model where:
- is the transmitted symbol vector, satisfying the average-power constraint ;
- is the channel matrix;
- is circularly-symmetric complex AWGN;
- is the received vector.
The transmit SNR is . Under the i.i.d. Rayleigh assumption, the entries are i.i.d. . We write in -form.
The normalisation is a modelling convention: path-loss and shadowing are absorbed into the linear-scale rather than the channel variance. The transmit power is the sum power across antennas; it does NOT scale with , so the per-antenna power budget decreases as we add antennas. This is why the capacity formula below has , not — adding antennas at the transmitter distributes a fixed power budget.
Theorem: Ergodic MIMO Capacity (Telatar 1999)
For the i.i.d. Rayleigh MIMO channel with transmit antennas, receive antennas, transmit SNR , and perfect CSI at the receiver (CSIR), the ergodic capacity under equal-power (isotropic) transmission is At high SNR, this behaves as i.e., the multiplexing prelog is .
Diagonalise via the SVD: with holding the singular values . Pre-rotating the input by and post-rotating the output by turns the MIMO channel into parallel scalar sub-channels with gains . Water-filling over the random gains gives a capacity that, for i.i.d. Rayleigh, reduces to the isotropic formula above (water-filling and equal allocation coincide in expectation for i.i.d. channels). At high SNR, each active sub-channel contributes , and the -det splits into a sum of logarithms — hence the prelog.
Start from the MIMO mutual information with Gaussian input and isotropic covariance .
Use the determinant identity to move between the -dim and -dim forms.
For the high-SNR asymptotic, factor out of the dominant terms and verify the constant is the offset encoded by .
Gaussian input achieves capacity
The capacity of the vector Gaussian channel with known and input covariance constraint is a concave optimisation over : For i.i.d. Rayleigh (the distribution is invariant under unitary transformations ), the capacity-achieving averaged over is : any unitary rotation of the input leaves the ensemble distribution unchanged, so isotropic input is an optimiser.
Ergodic average
Substituting and yields the conditional capacity . The ergodic capacity is the expectation over the fading distribution, Operationally, this is the capacity when the codeword spans many independent fades (ergodic assumption).
SVD and parallel sub-channels
Write , with . Then . This is a sum of independent SISO capacities — the MIMO channel decomposes into parallel eigen-sub-channels.
High-SNR prelog
At large , each active term contributes , and the sum is The first term is the multiplexing prelog; the second is a finite offset that depends on the fading distribution. For i.i.d. Rayleigh it is negative (penalty relative to a deterministic parallel channel with identical eigenvalues).
Ergodic MIMO Capacity vs. SNR
Monte-Carlo ergodic capacity of the i.i.d. Rayleigh MIMO channel under isotropic Gaussian input, swept over SNR in dB. The dashed reference is the SISO capacity . At high SNR the slope of each curve is bits/channel use per 3 dB.
Parameters
Example: MIMO Capacity at 10 dB for and Rayleigh
Estimate the ergodic capacity of an i.i.d. Rayleigh MIMO channel at dB for (a) , (b) , and (c) . Compare against the SISO capacity bits/channel use, and verify the rough-cut high-SNR approximation .
SISO reference
At linear, bits/channel use. This is our baseline.
$2 \times 2$ ergodic capacity
Averaging over i.i.d. entries (the factor is ) gives bits/channel use — about the SISO rate. The high-SNR estimate over-predicts (the offset is negative: ).
$4 \times 4$ ergodic capacity
With , bits/channel use — about the SISO rate. High-SNR estimate ; offset . The capacity-per-antenna ratio is below the SISO at the same total : the per-antenna penalty grows with because the power is split across more streams.
$8 \times 8$ ergodic capacity
At with , the Monte-Carlo average is bits/channel use — about SISO. The multiplexing gain is unmistakable: eight antennas multiplied the spectral efficiency by more than five at 10 dB, without any code yet being specified.
Interpretation
The ergodic capacity grows roughly linearly in , validating the high-SNR prelog. The offset from the naive prediction widens with because (i) the per-antenna SNR drops, and (ii) the Marchenko–Pastur eigenvalue spread of deviates from unity.
Key Takeaway
The single most important number in MIMO theory is the multiplexing prelog . It determines how fast capacity grows in and, in Part III, it will re-emerge as the maximum multiplexing gain of the diversity-multiplexing tradeoff (Ch. 12). The purpose of a space-time code is to translate this potential into actual bits on the wire: either full streams (spatial multiplexing, low diversity — Ch. 11 V-BLAST) or full diversity (Alamouti / OSTBCs, rate 1 — Ch. 11), or an informed compromise (lattice STCs — Chs. 13, 17).
Capacity Says 'How Much', Coding Says 'How'
The capacity formula of Theorem TErgodic MIMO Capacity (Telatar 1999) is a fundamental limit: it does not tell the designer which codewords to transmit, only that a rate is achievable with arbitrarily low error for sufficiently long blocks. On a fast-fading channel (fading i.i.d. across symbols), the AWGN capacity-achieving coding recipes of Part II (BICM + LDPC) port over almost verbatim — interleavers make the channel "effectively" ergodic.
On a block-fading channel (quasi-static fading over the whole codeword), this reasoning collapses: the channel is not ergodic in the codeword time-scale, the capacity is a random variable, and diversity — not multiplexing — becomes the central code-design criterion. Sections §2–§5 develop this story.
Historical Note: Telatar 1995 and Foschini–Gans 1998 — The MIMO Capacity Result
1995–1999Emre Telatar (at AT&T Bell Labs) circulated the technical memorandum "Capacity of multi-antenna Gaussian channels" in 1995. The result — that the capacity of an MIMO channel scales as — was initially met with disbelief: the prevailing intuition held that capacity must grow logarithmically in total power, not linearly in the min of antenna counts. The memorandum circulated informally for four years before appearing in the European Transactions on Telecommunications (1999) and becoming the most-cited wireless-capacity paper of all time.
Independently, Gerard Foschini and Michael Gans at Bell Labs published "On limits of wireless communications in a fading environment when using multiple antennas" in Wireless Personal Communications (1998). They reached the same conclusion and additionally introduced the V-BLAST architecture — a practical receiver that approaches the MIMO capacity by successive interference cancellation (see Ch. 11). The combination of these two papers triggered the modern MIMO era: by 2001 Lucent, Bell Labs (Foschini's home), and others had MIMO prototypes in the lab; by 2008 3GPP HSPA was commercial; by 2019 5G NR was shipping massive-MIMO base stations.
The history is neatly told by Biglieri, Caire, Taricco, and others in survey articles (e.g., [?biglieri-caire-taricco-2000]) and in Goldsmith et al.'s 2003 survey [?goldsmith-jafar-jindal-vishwanath-2003].