The Zheng-Tse Theorem
The Curve that Binds Diversity and Multiplexing
We are ready to state the central theorem of Part III: the Zheng-Tse diversity-multiplexing tradeoff. For an i.i.d. Rayleigh channel with block length , the optimal tradeoff curve is the piecewise-linear interpolation of the points
The theorem has two independent parts:
-
Converse — an outage-probability lower bound: no code family operating at multiplexing gain can achieve diversity gain exceeding . This is a purely information-theoretic statement, proved by computing the outage exponent from the Wishart eigenvalue distribution of .
-
Achievability — a constructive upper bound: a Gaussian random codebook with i.i.d. entries and rate achieves diversity exactly in the high-SNR limit. This is the error-exponent counterpart of Shannon's random-coding theorem, lifted to the exponential- equality regime.
The matching of the two bounds gives a tight characterisation: the DMT curve is the exact, best-possible tradeoff. The proof pattern — outage converse + Gaussian random-coding achievability — is the same pattern as ergodic-capacity proofs (Chapter 10), adapted for the exponential-equality regime. We walk through both parts in full.
Definition: Diversity-Multiplexing Tradeoff Curve
Diversity-Multiplexing Tradeoff Curve
The diversity-multiplexing tradeoff (DMT) curve of an MIMO channel is the function By definition is the pointwise maximum diversity attainable at each multiplexing gain . The curve is non-increasing in : higher costs more diversity.
Endpoints. At (fixed rate), is the maximum classical diversity order — for an i.i.d. Rayleigh channel this is . At (maximum multiplexing slope, approaching capacity), : there is no reliability margin left when the rate tracks capacity.
Operational meaning. A code family's operating point must lie below the curve: . A code is DMT-optimal if its operating point lies on the curve for the entire range — i.e., it achieves the best diversity for every multiplexing slope. Alamouti achieves the curve only at ; the Golden code (§4) achieves it for all on .
Theorem: Zheng-Tse Diversity-Multiplexing Tradeoff
Consider an i.i.d. Rayleigh MIMO channel with coherent detection and block length . The DMT curve is the piecewise-linear interpolation of the integer corner points Explicitly, for with , The maximum multiplexing gain is and the maximum diversity gain is .
The optimal tradeoff is achieved by a Gaussian random codebook with i.i.d. entries.
Think of as "how many independent streams you are sending", and as "how much protection each stream gets". The channel has independent scalar fading coefficients in . If you send streams, they "consume" rows of on the transmit side and columns on the receive side; what remains for diversity protection is an sub-channel with independent fading coefficients. That sub-channel supplies the diversity exponent. The tradeoff is exactly the geometric product of the "spare" transmit and receive dimensions.
Converse — compute the outage exponent from the joint Wishart density of eigenvalues.
Achievability — use a Gaussian random codebook at rate ; bound the PEP by the outage probability plus an exponentially small correction.
Matching — both the converse and achievability computations reduce to the same large-deviations principle on the eigenvalue vector, with the rate function under .
Setup — rewrite outage as eigenvalue event
Let be with i.i.d. entries. The instantaneous capacity is where and are the ordered nonzero eigenvalues of . Reparametrise each eigenvalue as : the exponent vector with encodes the "fading depth" of each eigenvalue.
The outage event at rate is For large , , so the outage event reduces to
Eigenvalue density — Wishart asymptotics
The joint density of the ordered eigenvalues of (complex Wishart, case) is where is a normalising constant. Substituting , , the density in is The factor is for and contributes exponent if , or suppresses the density super-polynomially if . So the relevant regime is (all eigenvalues decay in SNR), and after collecting the Vandermonde and marginal factors (assuming ; note the Vandermonde contributes from the pairs and from the pairs, summing to ; but with ordering, only pairs contribute and the exponent becomes , which combined with the factor gives the coefficient; see Zheng-Tse eq. (15)).
Outage exponent — large-deviations optimisation
The outage probability at multiplexing gain is with where . This is a linear program in over a convex region.
For with integer , the optimum is attained at for (these eigenvalues fail to fade) and for (these eigenvalues fade at rate ), with tuned to exactly make : . Plugging in, the sum at the optimum evaluates to the piecewise-linear interpolation of at . After the algebra (Zheng-Tse Lemma 5), one obtains exactly with linear interpolation between integers.
Converse — diversity of any code is $\le d^*_{\rm out}(r)$
For any coding scheme at rate , the block error probability is lower-bounded by the outage: Therefore the diversity gain satisfies . No code can do better than the outage exponent in diversity.
Achievability — Gaussian random codebook
Consider a random codebook of size , each codeword with i.i.d. entries. The average block error probability under ML decoding is bounded by The first term is the outage — its exponent is . The second term (undetected error given not in outage) can be made exponentially smaller than the outage by a Gallager random-coding argument provided : the ensemble average over Gaussian codebooks, combined with a union bound and the Chernoff-bound-to-PEP analysis of Chapter 11, gives i.e., faster-than-polynomial decay. The block-length assumption is needed so that the error-matrix product is full-rank with probability — §5 discusses what breaks when is shorter.
Combining, . The matching converse and achievability give at integer , with linear interpolation.
Building the DMT Curve: Corner Points
The DMT Curve for Configurable
Explore the DMT curve (piecewise-linear between integer corner points) as a function of . The corner points for are marked. Use the "Compare with" dropdown to overlay a second configuration and see how asymmetric (e.g., vs ) gives identical DMT curves — the pair is symmetric in .
Parameters
Example: DMT Corner Points for , , ,
For each of the MIMO configurations , list the corner points of the DMT curve. Sketch the four curves on a single graph and identify the maximum diversity and maximum multiplexing of each.
$4 \times 4$
. Corners at : Max diversity ; max multiplexing . The curve is the piecewise-linear graph through these five points.
$2 \times 4$ and $4 \times 2$
in both cases. Corners at : Max diversity (= ); max multiplexing (= ).
The DMT curve is identical for and : the formula is symmetric in . Even though the two physical channels are different (uplink vs downlink asymmetry), their fundamental-limit tradeoff is the same. The asymmetry re-emerges at finite SNR through coding gain and through the complexity of transmitter/receiver processing.
$2 \times 2$
. Corners at : This is the canonical DMT curve — the one that governs Alamouti, V-BLAST, and the Golden code. The steep initial slope vs shallow final slope is the tell-tale piecewise structure.
Takeaway
Larger MIMO dimensions give both more diversity (at any fixed ) and more multiplexing (higher ). There is no tradeoff between adding antennas and any of the two resources — the tradeoff is within a fixed antenna pair, between and . This is why 5G massive MIMO aims for : even a modest rank retains a large diversity buffer.
Pattern-Aware: Zheng-Tse = Shannon, One Level Up
The structure of the Zheng-Tse proof should feel familiar:
- Converse: the outage lower bound — error probability outage probability at the target rate. This is the DMT-regime analogue of "error probability is lower bounded by the probability that rate exceeds capacity", the Fano-inequality backbone of classical converses.
- Achievability: a Gaussian random codebook. Same distribution as Telatar's ergodic-capacity proof — and in fact, the same codebook achieves both the ergodic capacity at and the full DMT curve at every other . Gaussian is doubly universal.
- Matching: both computations reduce to the same large-deviations rate function on the eigenvalue exponent vector , with the Vandermonde factor of the Wishart distribution driving the coefficients . The LP optimum is the piecewise-linear DMT curve.
So Zheng-Tse is Shannon's random-coding-argument template, lifted from the rate-reliability plane to the multiplexing-diversity plane . The same proof machinery, a different scaling regime. This is why Gaussian i.i.d. codebooks "just work" in MIMO theory — they are the default answer to "what codebook achieves the exponent", at every level of the hierarchy.
Quick Check
On a i.i.d. Rayleigh MIMO channel with block length , what is (linearly interpolated)?
(because exceeds the spatial degrees of freedom )
Corner points: , , , . At , interpolate linearly between and : . Equivalently, is not the correct answer at non-integer — the curve is piecewise-linear, not the continuous quadratic.
Common Mistake: at Non-Integer
Mistake:
Using the continuous formula at non-integer — e.g., writing .
Correction:
The continuous formula is only correct at integer . Between integer corner points the DMT is the piecewise-linear interpolation of those corners. The piecewise-linear curve is above the continuous quadratic: e.g., on at , the piecewise-linear value is but the quadratic would give .
The quadratic is the lower envelope (the achievable exponent using a fixed-rank scheme) while the piecewise linear is the full tradeoff (achievable by time-sharing / rank-adaptive schemes). The diversity gap is the benefit of rank adaptation / time-sharing, which connects fractional to adjacent integer operating points.
DMT Asymptotics vs Finite-SNR Practice
The DMT curve is an asymptotic statement: is exact only as . In practice, "high SNR" in cellular and Wi-Fi systems means – dB, which is where the DMT is approximate. At these operating points:
- The empirical diversity exponent lags the asymptotic by –%. The interactive plot above shows this convergence gap directly: the empirical slope at dB is typically to , reaching only around dB.
- Coding gain — the constant in front of — matters as much as itself at these SNRs. A 3 dB coding-gain gap corresponds to a 2x SNR improvement; two codes with the same but different coding gains can differ substantially in practice.
- Chapter 13 introduces CDA / Golden-code families that achieve both the optimal and high coding gain, closing the finite-SNR gap.
- •
DMT asymptotic SNR regime: typically dB for , dB for .
- •
Finite-SNR gap between empirical and asymptotic exponent: –% at cellular operating SNRs.
- •
Coding gain and DMT are independent design targets — optimize both.
Historical Note: Zheng and Tse 2003: The Unifying Tradeoff
2003Lizhong Zheng and David Tse's May 2003 IEEE Trans. Information Theory paper, "Diversity and multiplexing: A fundamental tradeoff in multiple- antenna channels" (vol. 49, no. 5, pp. 1073–1096), is one of the most cited papers in wireless information theory. Zheng, then a PhD student at Berkeley, and Tse, his advisor, resolved the five-year-old Alamouti- vs-V-BLAST tension by showing that both schemes are corner points of a single piecewise-linear curve — the DMT.
Three contributions made the paper a classic:
- The right question. The prior literature debated "is Alamouti good?" and "is V-BLAST good?" as if they competed; Zheng-Tse reframed the question as "which pair are you targeting, and is your code optimal at that point?"
- The closed-form answer. at integer corners is a formula that even wireless practitioners (not just information theorists) could compute and use as a design target.
- The proof technique. The Wishart large-deviations + Gaussian random-coding matching proof became the template for dozens of follow-up papers on DMT in other channel models: correlated fading (Telatar-Tse variants), ARQ (El Gamal-Caire-Damen), multicasting, MIMO broadcast, etc. The proof template is as influential as the theorem itself.
Subsequent CommIT-group work built directly on Zheng-Tse: Elia-Kumar- Pawar-Kumar-Lu 2006 constructed explicit DMT-optimal codes (CDA / Golden), and El Gamal-Caire-Damen 2004 proved lattice codes achieve the DMT. These are the topics of Chapters 13 and 17 respectively.
DMT Curve
The function giving the maximum diversity gain achievable at multiplexing gain on an i.i.d. Rayleigh channel. For block length , it is the piecewise-linear interpolation of for (Zheng-Tse 2003). The fundamental performance limit of any MIMO space-time code.
Related: Diversity Gain , Multiplexing Gain , Wishart, Outage Probability and -Outage Capacity
Wishart Distribution
The distribution of when is an matrix with i.i.d. entries. The joint density of the ordered eigenvalues carries a factor (Vandermonde times marginal), which drives the DMT large-deviations exponent computation.
Related: Wishart Eigenvalues, DMT Slope and Segment Structure, Outage Probability and -Outage Capacity, Mimo Channel