Interpreting the DMT Curve

From Formula to Engineering: What the DMT Actually Says

Armed with the Zheng-Tse formula dβˆ—(r)=(ntβˆ’r)(nrβˆ’r)d^*(r) = (n_t - r)(n_r - r), we can now read the curve as an engineer. Three operational messages emerge.

Message 1 β€” the tradeoff is quadratic at r=0r = 0 and linear at r=rmax⁑r = r_{\max}. The initial slope βˆ’ddβˆ—/dr∣r=0=nt+nr-dd^*/dr|_{r = 0} = n_t + n_r is large: sacrificing the first unit of diversity buys you a lot of multiplexing. The final slope βˆ’ddβˆ—/dr∣r=rmaxβ‘βˆ’=1-dd^*/dr|_{r = r_{\max}^-} = 1 is small: sacrificing the last unit of multiplexing gets you only one unit of diversity back. The curve is concave on each linear segment, with shrinking segment slopes: the first r=0β†’1r = 0 \to 1 step costs nt+nrβˆ’1n_t + n_r - 1 units of dβˆ—d^*, the second r=1β†’2r = 1 \to 2 step costs nt+nrβˆ’3n_t + n_r - 3 units, …, the last r=mβˆ’1β†’mr = m-1 \to m step costs nt+nrβˆ’(2mβˆ’1)n_t + n_r - (2m - 1) units (= ∣ntβˆ’nr∣+1|n_t - n_r| + 1).

Message 2 β€” the curve is symmetric in (nt,nr)(n_t, n_r). A 4Γ—24 \times 2 and a 2Γ—42 \times 4 channel have identical DMT curves, even though their physical characteristics differ. Nonetheless, the tradeoff between reliability and rate is the same: you cannot tell by looking at an (r,dβˆ—)(r, d^*) plot whether the transmitter or the receiver has more antennas. This is a deep consequence of the outage-exponent computation: the Wishart density of HHH\mathbf{H}\mathbf{H}^{H} vs HHH\mathbf{H}^{H}\mathbf{H} differ only in the normalising constant, not in the eigenvalue-exponent rate function.

Message 3 β€” rank adaptation = DMT climbing. LTE and 5G NR contain an explicit rank indicator that the UE reports to the gNB, specifying the number of spatial streams k∈{1,…,min⁑(nt,nr)}k \in \{1, \ldots, \min(n_t, n_r)\} to be used for the next slot. Conceptually, kk is exactly the operating multiplexing gain β€” and the choice of kk navigates the DMT curve in response to channel conditions. The engineering note below makes this precise.

,

Definition:

DMT Slope and Segment Structure

The DMT curve has min⁑(nt,nr)\min(n_t, n_r) linear segments; on the kk-th segment r∈[k,k+1]r \in [k, k+1] the slope is ddβˆ—dr∣r∈(k,k+1)=(ntβˆ’kβˆ’1)(nrβˆ’kβˆ’1)βˆ’(ntβˆ’k)(nrβˆ’k)=βˆ’(nt+nrβˆ’2kβˆ’1).\left.\frac{dd^*}{dr}\right|_{r \in (k, k+1)} = (n_t - k - 1)(n_r - k - 1) - (n_t - k)(n_r - k) = -(n_t + n_r - 2k - 1). The initial slope (at r=0+r = 0^+) is βˆ’(nt+nrβˆ’1)-(n_t + n_r - 1); the final slope (at r=rmaxβ‘βˆ’r = r_{\max}^-) is βˆ’(∣ntβˆ’nr∣+1)-(|n_t - n_r| + 1).

Consequence. The DMT curve is concave and piecewise linear, not the continuous quadratic. It lies strictly above the continuous (ntβˆ’r)(nrβˆ’r)(n_t - r)(n_r - r) between corner points. The gap is achieved by time-sharing between two adjacent integer-rate codes: a code at r=kr = k run a fraction 1βˆ’(rβˆ’k)1 - (r - k) of the time, and a code at r=k+1r = k+1 run a fraction rβˆ’kr - k of the time, achieves multiplexing gain rr and diversity gain equal to the linear interpolation. This is the operational meaning of the "piecewise-linear interpolation".

,

Theorem: DMT Symmetry in (nt,nr)(n_t, n_r)

Under the assumptions of Thm. TZheng-Tse Diversity-Multiplexing Tradeoff, dβˆ—(r)Β forΒ anΒ ntΓ—nrΒ channelβ€…β€Š=β€…β€Šdβˆ—(r)Β forΒ anΒ nrΓ—ntΒ channeld^*(r) \text{ for an } n_t \times n_r \text{ channel} \;=\; d^*(r) \text{ for an } n_r \times n_t \text{ channel} for all r∈[0,min⁑(nt,nr)]r \in [0, \min(n_t, n_r)]. In particular, the tradeoff curves of an (nt,nr)(n_t, n_r) channel and its transpose (nr,nt)(n_r, n_t) are identical, independent of which side has more antennas.

The DMT only depends on the eigenvalues of the Wishart matrix. The Wishart matrices HHH\mathbf{H}\mathbf{H}^{H} (size nrΓ—nrn_r \times n_r) and HHH\mathbf{H}^{H}\mathbf{H} (size ntΓ—ntn_t \times n_t) have identical nonzero eigenvalues. The DMT exponent sees only those shared eigenvalues β€” the transmitter-vs-receiver asymmetry washes out.

,

Example: Slope Cost per Unit of rr: 4Γ—44 \times 4 vs 2Γ—22 \times 2

For 2Γ—22 \times 2 and 4Γ—44 \times 4 channels, tabulate the DMT segment slopes and interpret the change in tradeoff structure as antennas scale.

,

DMT Corner Points for Common MIMO Configurations

Configuration (nt,nr)(n_t, n_r)rmax⁑r_{\max}dβˆ—(0)d^*(0)Corner pointsInitial slope βˆ’(nt+nrβˆ’1)-(n_t + n_r - 1)
(1,1)(1, 1)1111(0,1),(1,0)(0, 1), (1, 0)βˆ’1-1
(1,2)(1, 2)1122(0,2),(1,0)(0, 2), (1, 0)βˆ’2-2
(2,2)(2, 2)2244(0,4),(1,1),(2,0)(0, 4), (1, 1), (2, 0)βˆ’3-3
(2,4)(2, 4) or (4,2)(4, 2)2288(0,8),(1,3),(2,0)(0, 8), (1, 3), (2, 0)βˆ’5-5
(3,3)(3, 3)3399(0,9),(1,4),(2,1),(3,0)(0, 9), (1, 4), (2, 1), (3, 0)βˆ’5-5
(4,4)(4, 4)441616(0,16),(1,9),(2,4),(3,1),(4,0)(0, 16), (1, 9), (2, 4), (3, 1), (4, 0)βˆ’7-7
(8,8)(8, 8)886464(0,64),(1,49),…,(7,1),(8,0)(0, 64), (1, 49), \ldots, (7, 1), (8, 0)βˆ’15-15
🚨Critical Engineering Note

Rank Adaptation in LTE and 5G NR: Walking the DMT Curve

Every LTE and 5G NR receiver computes and reports a rank indicator (RI) β€” an integer k∈{1,2,…,min⁑(nt,nr)}k \in \{1, 2, \ldots, \min(n_t, n_r)\} telling the base station how many spatial streams to transmit on the next slot. The RI is computed by the UE from measured channel coefficients and SNR: roughly, the UE picks the largest kk such that the expected throughput at rank kk exceeds the expected throughput at rank kβˆ’1k - 1.

This is literally walking the DMT curve. Each rank kk corresponds to a target multiplexing gain r=kr = k; rank adaptation selects the corner point (k,(ntβˆ’k)(nrβˆ’k))(k, (n_t - k)(n_r - k)) on the DMT curve that maximises throughput at the current SNR. At low SNR the optimal rank is 11 β€” the corner (1,(ntβˆ’1)(nrβˆ’1))(1, (n_t - 1)(n_r - 1)) gives maximum diversity consistent with a rate that grows. At high SNR the optimal rank is min⁑(nt,nr)\min(n_t, n_r) β€” the corner (min⁑(nt,nr),0)(\min(n_t, n_r), 0) gives maximum rate but no outage protection (the latter comes from HARQ retransmissions, which is how 5G NR closes the reliability gap at high rank).

5G NR specifics. The RI is reported over PUCCH / PUSCH with a periodicity of 55–8080 ms. The reported RI is fed to the scheduler, which allocates precoded resource elements. This is the adaptation loop that keeps the system operating on the DMT curve in real time.

Practical Constraints
  • β€’

    RI feedback periodicity: 55–8080 ms in 5G NR (type-I codebook).

  • β€’

    Rank ∈{1,2,…,min⁑(nt,nr,8)}\in \{1, 2, \ldots, \min(n_t, n_r, 8)\} for NR Rel-17 (up to 88 layers in PDSCH).

  • β€’

    Rank adaptation is per-UE, per-TTI; jointly optimised with modulation-coding-scheme (MCS) index.

  • β€’

    Massive MIMO (ntβ‰₯32n_t \ge 32) uses rank adaptation with kβ‰ͺntk \ll n_t; the diversity buffer (ntβˆ’k)(nrβˆ’k)(n_t - k)(n_r - k) stays large.

πŸ“‹ Ref: 3GPP TS 38.214 (physical layer procedures for data), Β§5.2.1.4 β€” CSI reporting
,

Common Mistake: The min⁑\min-Corner Is (min⁑(nt,nr),0)(\min(n_t, n_r), 0), Not (nt,0)(n_t, 0)

Mistake:

On an asymmetric MIMO channel (e.g., 4Γ—24 \times 2 or 2Γ—42 \times 4), locating the right endpoint of the DMT curve at r=ntr = n_t β€” i.e., assuming one can multiplex up to the number of transmit antennas.

Correction:

The maximum multiplexing gain is rmax⁑=min⁑(nt,nr)r_{\max} = \min(n_t, n_r), not ntn_t or nrn_r individually. On a 4Γ—24 \times 2 channel one can only support 22 parallel streams, because the receiver has only 22 antennas to disentangle them. The fourth-antenna excess at the transmitter provides diversity (available through precoding) but not multiplexing (which would require receive-side spatial resolution that doesn't exist).

Correspondingly, the DMT right endpoint is always at (min⁑(nt,nr),0)(\min(n_t, n_r), 0): for 4Γ—24 \times 2 it is (2,0)(2, 0), not (4,0)(4, 0). The left endpoint at dβˆ—(0)=ntnrd^*(0) = n_t n_r uses the full product β€” excess transmit antennas do contribute diversity, just not multiplexing.

Mnemonic. Multiplexing is receiver-limited as well as transmitter- limited; diversity is only limited by the product ntβ‹…nrn_t \cdot n_r. The asymmetry enters through the Wishart rank min⁑(nt,nr)\min(n_t, n_r) β€” the number of nonzero eigenvalues of HHH\mathbf{H}\mathbf{H}^{H}.

Quick Check

What is the DMT curve of a 6Γ—26 \times 2 i.i.d. Rayleigh MIMO channel?

dβˆ—(r)d^*(r) has corners at (0,12),(1,5),(2,0)(0, 12), (1, 5), (2, 0)

dβˆ—(r)d^*(r) has corners at (0,12),(1,5),(2,0),(3,0),…,(6,0)(0, 12), (1, 5), (2, 0), (3, 0), \ldots, (6, 0)

dβˆ—(r)d^*(r) has corners at (0,8),(1,3),(2,0)(0, 8), (1, 3), (2, 0)

dβˆ—(r)d^*(r) is the same as for 2Γ—22 \times 2

Three Ways to Read the DMT Curve

The DMT curve has three equivalent operational readings:

  1. Information-theoretic. dβˆ—(r)d^*(r) is the outage-diversity exponent at rate-scaling rlog⁑2SNRr \log_2 \text{SNR}. Every code's error probability decays no faster than SNRβˆ’dβˆ—(r)\text{SNR}^{-d^*(r)} at that rate. Tight via Zheng-Tse 2003.

  2. Code-design. dβˆ—(r)d^*(r) is the diversity order achieved by the best possible space-time code at multiplexing gain rr. Families sitting at (r,d)(r, d) with d<dβˆ—(r)d < d^*(r) are sub-optimal; families on the curve are DMT-optimal (Β§4).

  3. Scheduler. dβˆ—(r)d^*(r) is the reliability-vs-throughput budget that a rank-adaptive scheduler navigates in real time. The rank kk is the operating corner point; rank adaptation corresponds to jumping between adjacent corners.

All three readings are compatible and refer to the same asymptotic curve. The distinction matters when you are interpreting measurements: a code's actual BER at finite SNR depends on both its DMT operating point and its coding gain, which the DMT does not constrain.

,