Block Length and Correlation: DMT Refinements

Refining the DMT: When the Theorem's Hypotheses Break

The Zheng-Tse theorem assumes two things we have glossed over: (i) block length Lβ‰₯nt+nrβˆ’1L \ge n_t + n_r - 1 (so that the error-matrix product ΔΔH\boldsymbol{\Delta}\boldsymbol{\Delta}^H is full-rank under Gaussian random coding), and (ii) i.i.d. Rayleigh fading (so that the eigenvalues of HHH\mathbf{H}\mathbf{H}^{H} follow the unbiased Wishart distribution). In real systems both assumptions often fail:

  • Block length. 5G NR slot duration is 1414 OFDM symbols at sub-6 GHz, 77 at mmWave. On a fast-fading channel the coherence time may be as short as a handful of OFDM symbols, so typical STC block lengths are L=2L = 2–88 per fading block β€” often below the nt+nrβˆ’1n_t + n_r - 1 threshold for even moderate antenna counts (nt+nrβˆ’1=7n_t + n_r - 1 = 7 for 4Γ—44 \times 4).
  • Correlation. Real MIMO channels have substantial spatial correlation β€” between transmit antennas due to proximity, between receive antennas due to angular spread limits, and cross-correlation through scatterer geometry. A typical 5G Tx correlation coefficient ρ\rho is 0.30.3–0.70.7 for antenna spacing ≀λ/2\le \lambda/2.

This section states and proves two refinements:

  1. Block-length truncation. For L<nt+nrβˆ’1L < n_t + n_r - 1, the DMT curve is truncated at r=Lr = L: beyond that, no code can achieve positive diversity.
  2. Correlation invariance. For spatially correlated Rayleigh fading with full-rank correlation matrix, the DMT exponent is unchanged: dβˆ—(r)d^*(r) remains the Zheng-Tse curve. Correlation affects only the coding gain (multiplicative constant), not the exponent.

The second result is striking: correlation is a coding-gain penalty, not a DMT penalty. A code that is DMT-optimal on i.i.d. Rayleigh remains DMT-optimal on correlated Rayleigh (as long as det⁑Rt>0\det \mathbf{R}_t > 0), losing only a constant factor det⁑(Rt)βˆ’1/nt\det(\mathbf{R}_t)^{-1/n_t} in BER.

,

Theorem: DMT Truncation for Short Block Length

Consider an ntΓ—nrn_t \times n_r i.i.d. Rayleigh MIMO channel with coherent detection and space-time codeword block length LL. If Lβ‰₯nt+nrβˆ’1L \ge n_t + n_r - 1, the DMT curve is the full Zheng-Tse curve of Thm. TZheng-Tse Diversity-Multiplexing Tradeoff. If L<nt+nrβˆ’1L < n_t + n_r - 1, the DMT is truncated beyond the LL-th corner: dβˆ—(r)β€…β€Š=β€…β€Š{(ntβˆ’r)(nrβˆ’r)r∈[0,L]0r>Ld^*(r) \;=\; \begin{cases} (n_t - r)(n_r - r) & r \in [0, L] \\ 0 & r > L \end{cases} (piecewise-linear as before, between integer corners). In particular, for L<nt+nrβˆ’1L < n_t + n_r - 1, multiplexing gains r>Lr > L are not achievable with positive diversity.

The practical consequence: short block lengths (= short coherence time) impose a multiplexing ceiling r≀Lr \le L, independent of antenna count. A 4Γ—44 \times 4 channel with L=2L = 2 has rmax⁑eff=2r_{\max}^{\rm eff} = 2 β€” the channel cannot support more than 22 streams reliably per block, even though the antenna counts support 44.

The Zheng-Tse proof uses Gaussian random codewords of length LL. For the error matrix Ξ”=\ntnXβˆ’\ntnX^\boldsymbol{\Delta} = \ntn{X} - \hat{\ntn{X}} to be full-rank (rank ntn_t), we need Lβ‰₯ntL \ge n_t β€” but to fully exploit the nrΓ—ntn_r \times n_t Wishart eigenvalue pool we actually need Lβ‰₯nt+nrβˆ’1L \ge n_t + n_r - 1, which ensures that the joint rate function in the proof's LP is unconstrained. For shorter LL the LP acquires an extra constraint βˆ‘i(1βˆ’Ξ±i)+≀L\sum_i (1 - \alpha_i)^+ \le L, which caps the achievable multiplexing gain at LL.

,

DMT Truncation as Block Length LL Varies

The DMT curve dβˆ—(r)d^*(r) for a 4Γ—44 \times 4 channel as block length LL varies from 11 to nt+nrβˆ’1=7n_t + n_r - 1 = 7. At L=7L = 7 the full Zheng-Tse curve is recovered; at L=1L = 1 the curve is truncated to the segment r∈[0,1]r \in [0, 1]. Intermediate LL values give truncated curves at r=Lr = L. Practical coherence-time constraints (L=2L = 2–88 for 5G NR subcarriers, or L=1L = 1 for non-coherent STC) directly read off this plot.

Parameters
4
4
4

Theorem: DMT Invariance under Full-Rank Tx Correlation

Consider an ntΓ—nrn_t \times n_r correlated Rayleigh MIMO channel H=HwRt1/2\mathbf{H} = \mathbf{H}_{w} \mathbf{R}_t^{1/2}, where Hw\mathbf{H}_{w} has i.i.d. CN(0,1)\mathcal{CN}(0, 1) entries and Rt\mathbf{R}_t is the Hermitian positive-definite transmit correlation matrix with det⁑Rt>0\det \mathbf{R}_t > 0. Assume Lβ‰₯nt+nrβˆ’1L \ge n_t + n_r - 1 for full-rank codeword ensembles.

The DMT curve of the correlated channel is identical to the i.i.d. case: dcorrβˆ—(r)β€…β€Š=β€…β€Š(ntβˆ’r)(nrβˆ’r)atΒ integerΒ r∈{0,1,…,min⁑(nt,nr)},d^*_{\rm corr}(r) \;=\; (n_t - r)(n_r - r) \quad \text{at integer } r \in \{0, 1, \ldots, \min(n_t, n_r)\}, with piecewise-linear interpolation. Full-rank Tx correlation does not change the DMT exponent; only the coding gain is affected by the factor det⁑(Rt)βˆ’1/nt\det(\mathbf{R}_t)^{-1/n_t} (rate-rr outage probability multiplicatively scaled, invisibly to ≐\doteq).

det⁑(Rt)>0\det(\mathbf{R}_t) > 0 means Rt\mathbf{R}_t is non-degenerate β€” every transmit direction carries some fading. The eigenvalues of HHH=HwRtHwH\mathbf{H}\mathbf{H}^{H} = \mathbf{H}_{w} \mathbf{R}_t \mathbf{H}_{w}^{H} are those of Rt1/2HwHHwRt1/2\mathbf{R}_t^{1/2} \mathbf{H}_{w}^{H} \mathbf{H}_{w} \mathbf{R}_t^{1/2}, which is a scaled Wishart. Scaling the eigenvalues by the fixed factors (eigenvalues of Rt\mathbf{R}_t) changes the outage threshold multiplicatively but doesn't change the exponent: the rate-function in the large-deviations computation picks up only a constant shift, which is lost under ≐\doteq.

,

DMT under Tx Spatial Correlation

The DMT curve dβˆ—(r)d^*(r) for a 2Γ—22 \times 2 MIMO channel with exponential transmit correlation matrix Rt=[[1,ρ],[ρ,1]]\mathbf{R}_t = [[1, \rho], [\rho, 1]], as ρ\rho varies from 00 (i.i.d.) to 0.950.95 (nearly rank-deficient). The curve is invariant for any ρ∈[0,1)\rho \in [0, 1) (full-rank regime): full-rank correlation is a coding-gain penalty, not a DMT penalty. The outage probability at a fixed SNR is plotted in a secondary panel to show the coding-gain degradation as a function of ρ\rho β€” a constant offset that depends on det⁑(Rt)=1βˆ’Ο2\det(\mathbf{R}_t) = 1 - \rho^2. At ρ=1\rho = 1 the correlation becomes rank-deficient and the DMT does degrade (not shown, since the slider stops at 0.950.95).

Parameters
2
2
0.7
⚠️Engineering Note

Coherence Time and DMT Block Length in 5G NR

The "block length" LL in the DMT theorem is the number of channel uses over which the fading is constant β€” i.e., the coherence time measured in channel uses. For 5G NR OFDM systems:

  • Sub-6 GHz, pedestrian users (33 km/h at 3.53.5 GHz carrier): coherence time ∼30\sim 30 ms, which at 14 OFDM symbols per 1-ms slot is ∼400\sim 400 OFDM symbols. Across the 273 allocated subcarriers this is enormous β€” effectively infinite LL. DMT fully applies.
  • Sub-6 GHz, vehicular users (100100 km/h at 3.53.5 GHz): coherence time ∼1\sim 1 ms, so ∼14\sim 14 OFDM symbols. Still Lβ‰₯7=nt+nrβˆ’1L \ge 7 = n_t + n_r - 1 for 4Γ—44 \times 4; DMT applies.
  • mmWave (28 GHz), vehicular users (100100 km/h at 2828 GHz): coherence time ∼100\sim 100 ΞΌs, or ∼1.4\sim 1.4 OFDM symbols at 7 symbols/ slot. This is too short for full-DMT operation of a 4Γ—44 \times 4 code; the truncation theorem bites.
  • Wi-Fi 7 indoor (11 m/s at 55 GHz): coherence time ∼30\sim 30 ms, ∼1500\sim 1500 OFDM symbols. DMT fully applies.

Operational implication. For mmWave high-mobility scenarios (V2X, drones), the effective rmax⁑r_{\max} is capped by the coherence-time LL, not by the antenna count. This is one driver for non-coherent space-time codes in high-mobility mmWave links β€” when LL is so short that even training overhead would kill the DMT, the code itself must work without channel estimation.

Practical Constraints
  • β€’

    5G NR OFDM symbol duration: 71.471.4 ΞΌs (15 kHz sub-6 GHz), 8.98.9 ΞΌs (120 kHz mmWave).

  • β€’

    Coherence time at 100 km/h: ∼1\sim 1 ms at 3.5 GHz, ∼100\sim 100 μs at 28 GHz.

  • β€’

    nt+nrβˆ’1n_t + n_r - 1 threshold: 77 for 4Γ—44 \times 4, 1515 for 8Γ—88 \times 8 β€” sometimes exceeds mmWave coherence time.

πŸ“‹ Ref: 3GPP TS 38.211 (physical channels and modulation), Β§4.3 (frame structure)
πŸ”§Engineering Note

Measured Tx Correlation in LTE/5G: ρ∈[0.3,0.7]\rho \in [0.3, 0.7]

Measured Tx correlation coefficients in 3GPP urban microcell scenarios (UMi) range from ρ=0.3\rho = 0.3–0.70.7 for typical Ξ»/2\lambda/2-spaced antenna arrays at the base station. At the UE, smaller form factors and mutual coupling yield ρ=0.5\rho = 0.5–0.90.9 on handset devices.

What the DMT theorem says about this. For any ρ<1\rho < 1 (full- rank correlation), the DMT exponent is unchanged from the i.i.d. Rayleigh case. This is reassuring: real channels that are not i.i.d. do not lose their fundamental multiplexing structure. The coding gain, however, does degrade by (det⁑Rt)βˆ’1/nt(\det \mathbf{R}_t)^{-1/n_t}, which for Rt=Toeplitz([1,ρ,ρ2,…])\mathbf{R}_t = \mathrm{Toeplitz}([1, \rho, \rho^2, \ldots]) with ρ=0.5\rho = 0.5 gives detβ‘βˆ’1/nt∼1.2\det^{-1/n_t} \sim 1.2 on 2Γ—22 \times 2 (a ∼1\sim 1 dB loss) and ∼1.8\sim 1.8 on 4Γ—44 \times 4 (∼2.5\sim 2.5 dB).

Design implication. A 5G base station with closely-spaced antennas should expect 11–33 dB coding-gain loss vs i.i.d. Rayleigh, but the DMT exponent is robust. Spatial separation (e.g., polarisation diversity, distributed antennas) reduces ρ\rho and recovers the coding gain β€” which is why massive MIMO arrays use non-adjacent element positioning and mixed polarisations.

Practical Constraints
  • β€’

    3GPP UMi high-correlation scenario: ρ=0.6\rho = 0.6 to 0.90.9 per 38.901 spatial channel model.

  • β€’

    Coding-gain penalty from full-rank correlation: ∼1\sim 1–33 dB.

  • β€’

    DMT exponent unchanged for any det⁑(Rt)>0\det(\mathbf{R}_t) > 0.

πŸ“‹ Ref: 3GPP TR 38.901 (channel model), Β§7.5

Common Mistake: Rank-Deficient Correlation Does Break DMT

Mistake:

Concluding from Thm. TDMT Invariance under Full-Rank Tx Correlation that "correlation doesn't matter for DMT" β€” in particular for rank-deficient correlation matrices (det⁑Rt=0\det \mathbf{R}_t = 0).

Correction:

The theorem requires det⁑Rt>0\det \mathbf{R}_t > 0 (full-rank). For rank- deficient correlation, the DMT does degrade. Specifically, if rank(Rt)=k<nt\mathrm{rank}(\mathbf{R}_t) = k < n_t, then only kk transmit directions are active and the effective MIMO channel is kΓ—nrk \times n_r: the DMT reduces to deffβˆ—(r)=(kβˆ’r)(nrβˆ’r)d^*_{\rm eff}(r) = (k - r)(n_r - r) with rmax⁑=min⁑(k,nr)r_{\max} = \min(k, n_r).

When this matters. Rank-deficient correlation arises in:

  • Line-of-sight (LOS) channels with few scatterers β€” the channel has only a handful of dominant directions.
  • Analog-beamforming / hybrid-precoding architectures, where the RF phase shifters constrain the effective transmit subspace to a small rank.
  • Keyhole channels (classical, rarely encountered) with rank-11 spatial correlation.

In all these cases the "effective ntn_t" is smaller than the physical ntn_t, and the DMT is governed by the reduced dimensionality. This connects Chapter 12 to Chapter 10's discussion of the keyhole effect and to Chapter 21's treatment of high-mobility correlation.

Quick Check

On a 4Γ—44 \times 4 i.i.d. Rayleigh channel with block length L=2L = 2, what is rmax⁑effr_{\max}^{\rm eff}?

rmax⁑eff=4r_{\max}^{\rm eff} = 4 (the full spatial degrees of freedom)

rmax⁑eff=2r_{\max}^{\rm eff} = 2

rmax⁑eff=7r_{\max}^{\rm eff} = 7 (= nt+nrβˆ’1n_t + n_r - 1)

rmax⁑eff=0r_{\max}^{\rm eff} = 0 (no multiplexing possible)

Quick Check

A 4Γ—44 \times 4 channel has Tx correlation matrix Rt\mathbf{R}_t with det⁑Rt=0.5>0\det \mathbf{R}_t = 0.5 > 0 (full rank). At r=2r = 2 multiplexing, the DMT exponent is:

dβˆ—(2)=(4βˆ’2)(4βˆ’2)=4d^*(2) = (4 - 2)(4 - 2) = 4

dβˆ—(2)=4β‹…0.5=2d^*(2) = 4 \cdot 0.5 = 2 (diversity scales with det⁑Rt\det \mathbf{R}_t)

dβˆ—(2)=0d^*(2) = 0 (any correlation kills the DMT)

Cannot determine without knowing the correlation eigenvalues

Why This Matters: Next: ARQ-DMT and Cooperative Diversity

Chapter 14 extends the DMT framework to ARQ-based MIMO systems, where the transmitter can retransmit after receiving a NACK. Each retransmission adds both diversity (the new fading realization is statistically independent) and rate flexibility (the effective code rate adapts to the channel). The ARQ-DMT of El Gamal-Caire- Damen 2004 gives a closed-form tradeoff curve that strictly exceeds the Zheng-Tse DMT β€” at the cost of feedback latency. 5G NR HARQ is a practical implementation of the ARQ-DMT.

Chapter 22 discusses the open problems: non-coherent DMT (when LL is too short for training), short-packet DMT (when asymptotic analysis breaks at finite blocklength), and finite-alphabet DMT (when Gaussian codebooks must be replaced with QAM/PSK for implementability).

Historical Note: CommIT Group Contributions to DMT Theory

2004–2006

The CommIT research group β€” led by Giuseppe Caire across appointments at Eurecom (1998–2010) and USC/TU Berlin (2010–) β€” made three foundational contributions to post-Zheng-Tse DMT theory:

  1. Lattice codes achieve the DMT. El Gamal, Caire, and Damen (2004 IEEE Trans. IT) proved that LAST codes β€” lattice space-time codes built from dense lattice packings β€” achieve the entire Zheng-Tse DMT curve. This was the first explicit (albeit high-complexity) construction of DMT-optimal codes for arbitrary ntΓ—nrn_t \times n_r channels. The proof uses the MMSE-GDFE receiver as the lattice-domain counterpart of the MMSE-SIC receiver for Gaussian codes. Chapter 17 builds on this.

  2. Explicit DMT-optimal CDA codes. Elia, Kumar, Pawar, Kumar, Lu, and Caire (2006 IEEE Trans. IT) constructed explicit DMT- optimal space-time codes from cyclic division algebras (CDA) over number fields. These are shorter and simpler than the lattice codes of El Gamal-Caire-Damen 2004, with bounded-complexity sphere- decoder reception. The Golden code of Belfiore-Rekaya-Viterbo (2005) is the 2Γ—22 \times 2 instance; Elia et al. proved the general construction achieves the full DMT. Chapter 13 covers the CDA construction.

  3. ARQ-DMT. El Gamal, Caire, and Damen (2006 IEEE Trans. IT) extended the DMT framework to incremental-redundancy HARQ systems, computing the tradeoff gains from each retransmission. The ARQ-DMT is the information-theoretic foundation of 5G NR HARQ. Chapter 14 covers this.

Collectively these three papers closed the circle: from Zheng-Tse's existence proof (Gaussian random codes) to explicit constructions (CDA, LAST) to practical ARQ extensions. The CDA / Perfect codes of Chapter 13 are the most mature of these and underlie the 3GPP Release 8 LTE-Advanced "tall" MIMO codes.

,

Key Takeaway

Diversity and multiplexing trade off along a precise curve. For an ntΓ—nrn_t \times n_r i.i.d. Rayleigh MIMO channel with block length Lβ‰₯nt+nrβˆ’1L \ge n_t + n_r - 1, the Zheng-Tse DMT is the piecewise-linear interpolation of (k,(ntβˆ’k)(nrβˆ’k))(k, (n_t - k)(n_r - k)) for k=0,1,…,min⁑(nt,nr)k = 0, 1, \ldots, \min(n_t, n_r). Full-rank Tx correlation affects coding gain but not the DMT exponent. Short block length L<nt+nrβˆ’1L < n_t + n_r - 1 truncates the curve at r=Lr = L. Alamouti achieves full diversity only at r=0r = 0; V-BLAST achieves full multiplexing only at r=rmax⁑r = r_{\max}; CDA / Golden codes (Chapter 13) achieve the entire curve.