Block Length and Correlation: DMT Refinements
Refining the DMT: When the Theorem's Hypotheses Break
The Zheng-Tse theorem assumes two things we have glossed over: (i) block length (so that the error-matrix product is full-rank under Gaussian random coding), and (ii) i.i.d. Rayleigh fading (so that the eigenvalues of follow the unbiased Wishart distribution). In real systems both assumptions often fail:
- Block length. 5G NR slot duration is OFDM symbols at sub-6 GHz, at mmWave. On a fast-fading channel the coherence time may be as short as a handful of OFDM symbols, so typical STC block lengths are β per fading block β often below the threshold for even moderate antenna counts ( for ).
- Correlation. Real MIMO channels have substantial spatial correlation β between transmit antennas due to proximity, between receive antennas due to angular spread limits, and cross-correlation through scatterer geometry. A typical 5G Tx correlation coefficient is β for antenna spacing .
This section states and proves two refinements:
- Block-length truncation. For , the DMT curve is truncated at : beyond that, no code can achieve positive diversity.
- Correlation invariance. For spatially correlated Rayleigh fading with full-rank correlation matrix, the DMT exponent is unchanged: remains the Zheng-Tse curve. Correlation affects only the coding gain (multiplicative constant), not the exponent.
The second result is striking: correlation is a coding-gain penalty, not a DMT penalty. A code that is DMT-optimal on i.i.d. Rayleigh remains DMT-optimal on correlated Rayleigh (as long as ), losing only a constant factor in BER.
Theorem: DMT Truncation for Short Block Length
Consider an i.i.d. Rayleigh MIMO channel with coherent detection and space-time codeword block length . If , the DMT curve is the full Zheng-Tse curve of Thm. TZheng-Tse Diversity-Multiplexing Tradeoff. If , the DMT is truncated beyond the -th corner: (piecewise-linear as before, between integer corners). In particular, for , multiplexing gains are not achievable with positive diversity.
The practical consequence: short block lengths (= short coherence time) impose a multiplexing ceiling , independent of antenna count. A channel with has β the channel cannot support more than streams reliably per block, even though the antenna counts support .
The Zheng-Tse proof uses Gaussian random codewords of length . For the error matrix to be full-rank (rank ), we need β but to fully exploit the Wishart eigenvalue pool we actually need , which ensures that the joint rate function in the proof's LP is unconstrained. For shorter the LP acquires an extra constraint , which caps the achievable multiplexing gain at .
Rank of is .
For , the Zheng-Tse LP has an active constraint and the optimum is cut off at .
Zheng-Tse Thm. 3 formalises this via a careful analysis of the rank-deficient codeword ensemble.
Rank constraint on codewords
A space-time codeword matrix is . If , then , so the error matrix can have rank at most . The rank- constraint limits the number of independent streams the code can transmit per block to .
LP with block-length constraint
In the Zheng-Tse eigenvalue-exponent LP of Thm. TZheng-Tse Diversity-Multiplexing Tradeoff, the relevant eigenvalues are those of β which has rank . For , only the smallest-exponent eigenvalues contribute to the outage event; the extra directions cannot carry any rate. The LP rate-function becomes and the constraint caps at .
Truncation formula
For , the LP optimum is the same as in the unconstrained case (Zheng-Tse): at integer corners. For , the LP is infeasible (cannot achieve positive diversity with rate above per block). The DMT is therefore truncated at .
Special case . Single-channel-use codes (no time dimension) can support only multiplexing. The DMT curve is for and otherwise β a two-corner curve regardless of how many antennas.
Special case . The full Zheng-Tse curve is recovered. This is the "thick codeword" regime β Chapter 13's CDA codes need (just enough to support rank- error matrices with the algebraic structure), and the extra "wiggle room" up to is for Gaussian random codes to achieve the exponent with probability .
DMT Truncation as Block Length Varies
The DMT curve for a channel as block length varies from to . At the full Zheng-Tse curve is recovered; at the curve is truncated to the segment . Intermediate values give truncated curves at . Practical coherence-time constraints (β for 5G NR subcarriers, or for non-coherent STC) directly read off this plot.
Parameters
Coherence Time and DMT Block Length in 5G NR
The "block length" in the DMT theorem is the number of channel uses over which the fading is constant β i.e., the coherence time measured in channel uses. For 5G NR OFDM systems:
- Sub-6 GHz, pedestrian users ( km/h at GHz carrier): coherence time ms, which at 14 OFDM symbols per 1-ms slot is OFDM symbols. Across the 273 allocated subcarriers this is enormous β effectively infinite . DMT fully applies.
- Sub-6 GHz, vehicular users ( km/h at GHz): coherence time ms, so OFDM symbols. Still for ; DMT applies.
- mmWave (28 GHz), vehicular users ( km/h at GHz): coherence time ΞΌs, or OFDM symbols at 7 symbols/ slot. This is too short for full-DMT operation of a code; the truncation theorem bites.
- Wi-Fi 7 indoor ( m/s at GHz): coherence time ms, OFDM symbols. DMT fully applies.
Operational implication. For mmWave high-mobility scenarios (V2X, drones), the effective is capped by the coherence-time , not by the antenna count. This is one driver for non-coherent space-time codes in high-mobility mmWave links β when is so short that even training overhead would kill the DMT, the code itself must work without channel estimation.
- β’
5G NR OFDM symbol duration: ΞΌs (15 kHz sub-6 GHz), ΞΌs (120 kHz mmWave).
- β’
Coherence time at 100 km/h: ms at 3.5 GHz, ΞΌs at 28 GHz.
- β’
threshold: for , for β sometimes exceeds mmWave coherence time.
Measured Tx Correlation in LTE/5G:
Measured Tx correlation coefficients in 3GPP urban microcell scenarios (UMi) range from β for typical -spaced antenna arrays at the base station. At the UE, smaller form factors and mutual coupling yield β on handset devices.
What the DMT theorem says about this. For any (full- rank correlation), the DMT exponent is unchanged from the i.i.d. Rayleigh case. This is reassuring: real channels that are not i.i.d. do not lose their fundamental multiplexing structure. The coding gain, however, does degrade by , which for with gives on (a dB loss) and on ( dB).
Design implication. A 5G base station with closely-spaced antennas should expect β dB coding-gain loss vs i.i.d. Rayleigh, but the DMT exponent is robust. Spatial separation (e.g., polarisation diversity, distributed antennas) reduces and recovers the coding gain β which is why massive MIMO arrays use non-adjacent element positioning and mixed polarisations.
- β’
3GPP UMi high-correlation scenario: to per 38.901 spatial channel model.
- β’
Coding-gain penalty from full-rank correlation: β dB.
- β’
DMT exponent unchanged for any .
Common Mistake: Rank-Deficient Correlation Does Break DMT
Mistake:
Concluding from Thm. TDMT Invariance under Full-Rank Tx Correlation that "correlation doesn't matter for DMT" β in particular for rank-deficient correlation matrices ().
Correction:
The theorem requires (full-rank). For rank- deficient correlation, the DMT does degrade. Specifically, if , then only transmit directions are active and the effective MIMO channel is : the DMT reduces to with .
When this matters. Rank-deficient correlation arises in:
- Line-of-sight (LOS) channels with few scatterers β the channel has only a handful of dominant directions.
- Analog-beamforming / hybrid-precoding architectures, where the RF phase shifters constrain the effective transmit subspace to a small rank.
- Keyhole channels (classical, rarely encountered) with rank- spatial correlation.
In all these cases the "effective " is smaller than the physical , and the DMT is governed by the reduced dimensionality. This connects Chapter 12 to Chapter 10's discussion of the keyhole effect and to Chapter 21's treatment of high-mobility correlation.
Quick Check
On a i.i.d. Rayleigh channel with block length , what is ?
(the full spatial degrees of freedom)
(= )
(no multiplexing possible)
Thm. TDMT Truncation for Short Block Length: for , the DMT is truncated at . At , the achievable multiplexing gain is capped at β beyond that, no positive diversity is possible.
Quick Check
A channel has Tx correlation matrix with (full rank). At multiplexing, the DMT exponent is:
(diversity scales with )
(any correlation kills the DMT)
Cannot determine without knowing the correlation eigenvalues
Thm. TDMT Invariance under Full-Rank Tx Correlation: full-rank correlation preserves the DMT exponent. is the same as on i.i.d. Rayleigh: . The coding gain is degraded by the factor (a dB loss), but this is invisible to the DMT exponent.
Why This Matters: Next: ARQ-DMT and Cooperative Diversity
Chapter 14 extends the DMT framework to ARQ-based MIMO systems, where the transmitter can retransmit after receiving a NACK. Each retransmission adds both diversity (the new fading realization is statistically independent) and rate flexibility (the effective code rate adapts to the channel). The ARQ-DMT of El Gamal-Caire- Damen 2004 gives a closed-form tradeoff curve that strictly exceeds the Zheng-Tse DMT β at the cost of feedback latency. 5G NR HARQ is a practical implementation of the ARQ-DMT.
Chapter 22 discusses the open problems: non-coherent DMT (when is too short for training), short-packet DMT (when asymptotic analysis breaks at finite blocklength), and finite-alphabet DMT (when Gaussian codebooks must be replaced with QAM/PSK for implementability).
Historical Note: CommIT Group Contributions to DMT Theory
2004β2006The CommIT research group β led by Giuseppe Caire across appointments at Eurecom (1998β2010) and USC/TU Berlin (2010β) β made three foundational contributions to post-Zheng-Tse DMT theory:
-
Lattice codes achieve the DMT. El Gamal, Caire, and Damen (2004 IEEE Trans. IT) proved that LAST codes β lattice space-time codes built from dense lattice packings β achieve the entire Zheng-Tse DMT curve. This was the first explicit (albeit high-complexity) construction of DMT-optimal codes for arbitrary channels. The proof uses the MMSE-GDFE receiver as the lattice-domain counterpart of the MMSE-SIC receiver for Gaussian codes. Chapter 17 builds on this.
-
Explicit DMT-optimal CDA codes. Elia, Kumar, Pawar, Kumar, Lu, and Caire (2006 IEEE Trans. IT) constructed explicit DMT- optimal space-time codes from cyclic division algebras (CDA) over number fields. These are shorter and simpler than the lattice codes of El Gamal-Caire-Damen 2004, with bounded-complexity sphere- decoder reception. The Golden code of Belfiore-Rekaya-Viterbo (2005) is the instance; Elia et al. proved the general construction achieves the full DMT. Chapter 13 covers the CDA construction.
-
ARQ-DMT. El Gamal, Caire, and Damen (2006 IEEE Trans. IT) extended the DMT framework to incremental-redundancy HARQ systems, computing the tradeoff gains from each retransmission. The ARQ-DMT is the information-theoretic foundation of 5G NR HARQ. Chapter 14 covers this.
Collectively these three papers closed the circle: from Zheng-Tse's existence proof (Gaussian random codes) to explicit constructions (CDA, LAST) to practical ARQ extensions. The CDA / Perfect codes of Chapter 13 are the most mature of these and underlie the 3GPP Release 8 LTE-Advanced "tall" MIMO codes.
Key Takeaway
Diversity and multiplexing trade off along a precise curve. For an i.i.d. Rayleigh MIMO channel with block length , the Zheng-Tse DMT is the piecewise-linear interpolation of for . Full-rank Tx correlation affects coding gain but not the DMT exponent. Short block length truncates the curve at . Alamouti achieves full diversity only at ; V-BLAST achieves full multiplexing only at ; CDA / Golden codes (Chapter 13) achieve the entire curve.