DMT Optimality of LAST Codes
Closing the Loop: LAST + MMSE-GDFE Achieves the Zheng-Tse DMT
Section 1 built the LAST codebook. Section 2 built the MMSE-GDFE receiver. This section glues them together and proves the theorem that puts the "L" and the "ST" in "LAST" on equal footing: LAST codes with MMSE-GDFE decoding achieve the Zheng-Tse DMT curve for every and every .
The proof is an exercise in composition. The MMSE-GDFE (Β§2, Thm. TMMSE-GDFE Preserves Mutual Information) converts the MIMO channel into an equivalent effective-AWGN channel with aggregate capacity equal to the MIMO mutual information. The Erez-Zamir analysis (Ch. 16) says that on this effective channel, random- lattice codes achieve the capacity. So the error event of the LAST code on the equivalent channel is the outage event β the event that the channel mutual information falls below the rate. But the outage exponent is the Zheng-Tse DMT (by Zheng-Tse's definition). Thus the LAST code's error exponent equals .
The only subtlety is the interaction between random-coding averaging (over and over the dither ) and channel-outage averaging (over ). The point is that these two sources of randomness can be separated: first condition on (giving an Erez-Zamir lattice-AWGN channel), then average over to get the channel outage. The chain rule keeps the bookkeeping straight, and the Laplace method of Ch. 12 gives the Wishart eigenvalue exponent .
Theorem: LAST + MMSE-GDFE Achieves the Zheng-Tse DMT (El Gamal-Caire-Damen 2004)
Let be a sequence of LAST codes of block length with rate scaling as for fixed multiplexing gain , and let the fine lattice be drawn from the Minkowski-Hlawka random-lattice ensemble. Let the receiver be the MMSE-GDFE + layer-by-layer lattice decoder of AMMSE-GDFE + Layer-by-Layer Lattice Decoder. Then on the i.i.d. Rayleigh block-fading channel, the average codeword error probability satisfies for every (interpolated piecewise-linearly between integers). In words, LAST codes achieve the Zheng-Tse DMT curve.
The point is that the MMSE-GDFE reduces the MIMO decoding problem to lattice decoding on an equivalent AWGN channel; the Erez-Zamir analysis says lattices achieve capacity on that AWGN channel; and the channel outage β the event that the random has mutual information less than the rate β has probability decaying as by the Zheng-Tse Wishart-Laplace analysis (Ch. 12). The LAST error exponent equals the outage exponent, which is .
Decompose the error event into channel outage (the MIMO mutual information is below the rate) and lattice error given non-outage (the Erez-Zamir lattice fails on a typical channel).
For non-outage realisations, Erez-Zamir gives sub-exponential lattice-decoding error β negligible compared to the outage.
The channel-outage probability is by Zheng-Tse's Wishart eigenvalue analysis (Ch. 12).
Minkowski-Hlawka averaging over random converts the existence argument into a concrete error bound.
Step 1 β MMSE-GDFE preserves the MIMO mutual information
By Thm. TMMSE-GDFE Preserves Mutual Information, the MMSE-GDFE filter is a sufficient statistic for decoding from . Hence the mutual information between the transmitted codeword and the filtered observation equals the full MIMO mutual information . The LAST code's error probability on the MMSE-GDFE output equals the error probability on the raw MIMO output β no capacity is lost.
Step 2 β Erez-Zamir on the effective AWGN channel
Conditioned on , the MMSE-GDFE output with of covariance is equivalent (via Thm. TLAST Codebook Realises a Nested-Lattice Code over the MIMO Channel) to an Erez-Zamir lattice-AWGN channel with per-dimension SNR proportional to . By Erez-Zamir (Ch. 16), random nested-lattice codes at rate achieve arbitrarily small error probability in the limit of large block-length (lattice dimension).
Step 3 β Channel outage event dominates
The LAST error probability decomposes as On non-outage channel realisations, Erez-Zamir (Step 2) gives sub-exponentially as , so the second term is for any . The first term β the outage probability β is the asymptotic dominant contribution.
Step 4 β Zheng-Tse outage exponent
The Zheng-Tse analysis (Ch. 12) of the outage probability on the i.i.d. Rayleigh channel uses the eigenvalue decomposition (with at high SNR). The outage event becomes , and the Wishart joint density plus Laplace's method gives
Step 5 β Minkowski-Hlawka averaging
The random-lattice averaging (Minkowski-Hlawka, Ch. 15) turns "for the random " into "for a random draw, with concentration". Since the Erez-Zamir step is an average over , the ensemble-averaged error exponent is exactly , and by concentration there exists a deterministic sequence of lattices achieving the same exponent.
Step 6 β Matching Zheng-Tse converse
The Zheng-Tse converse (Ch. 12) states that every code on the i.i.d. Rayleigh MIMO channel with rate has error exponent at most . Combining with the achievability of Steps 1-5: the LAST code's error exponent equals .
Operational Interpretation of the Theorem
The theorem tells us something more than "LAST codes are DMT-optimal." It tells us that any lattice with adequate coding gain, when combined with MMSE-GDFE, achieves the DMT. The designer is free to pick the lattice β and the 2008 Kumar-Caire result (Β§4) says: pick the densest one for the best finite-SNR performance.
To see why this is the operational message, notice that the proof uses only two properties of the lattice ensemble: (i) Minkowski-Hlawka existence, i.e., there are lattices achieving density (Ch. 15); (ii) the crypto-lemma dither argument, which works for any shaping lattice with round Voronoi regions. These are not restrictive β they are satisfied by , , , the Leech lattice, Barnes-Wall, and indeed any "generic" lattice. Therefore the theorem applies immediately to structured LAST codes with inner lattice or Leech β which is what Β§4 proves formally.
This is a design lesson worth pausing on. The algebraic CDA-NVD proof of Ch. 13 works for one lattice β the one embedded in the cyclic division algebra β and breaks if you try to substitute a different one. The lattice-theoretic LAST proof works for every sufficiently dense lattice. As a designer, you get to optimise the lattice for finite-SNR coding gain without losing DMT-optimality. That is precisely the freedom Kumar and Caire exploit in Β§4.
Theorem: The DMT Is the Outage Exponent of the MIMO Channel
On the i.i.d. Rayleigh block-fading channel of block length , the outage-probability exponent with respect to rate equals the Zheng-Tse DMT: In words, the outage-exponent lower bound is tight: the best code achieves exactly the outage probability asymptotically.
This is the converse side of LAST's achievability. It says that no code can do better than outage β the channel is the fundamental bottleneck, and the code's job is only to realise the outage-floor performance. Every DMT-optimal code (Alamouti at , CDA-NVD, LAST, etc.) realises the same exponent via a different mechanism, but the ceiling is the same. The theorem is why "DMT-optimality" is a universal benchmark and not a code-specific quantity.
Use Zheng-Tse's Wishart eigenvalue parametrisation .
Express the outage event as .
Laplace's method on the joint Wishart density gives the SNR exponent subject to the outage constraint.
Minimise this exponent over the feasible β the minimum equals .
Step 1 β Eigenvalue parametrisation
Write for . At high SNR, for all with probability . The MIMO mutual information becomes . Hence outage is .
Step 2 β Wishart joint density
The joint density of the eigenvalues under i.i.d. Rayleigh has the form . In the -parametrisation and at high SNR, this becomes up to constants.
Step 3 β Laplace's method
Integrate over the outage region . By Laplace's method, the dominant contribution comes from minimising the exponent subject to the outage constraint.
Step 4 β Minimisation
For an integer, the optimum is for and for . The resulting exponent is . For non-integer , linear interpolation applies. Hence .
Example: DMT Curves for , , and MIMO
Tabulate the Zheng-Tse DMT curve at integer multiplexing gains for the following configurations and sketch the piecewise-linear curves: (a) ; (b) ; (c) . For a LAST code at , what diversity order does it achieve in each case?
Part (a): 2x2
. At the diversity is β a single-antenna-equivalent diversity order, even though we have 2 transmit and 2 receive antennas. This is the classical Zheng-Tse surprise: at the maximum multiplexing gain there is essentially no diversity advantage.
Part (b): 2x4
. The extra receive antennas help at every : at the diversity is (vs. for ), reflecting the receive-diversity gain encoded in the product.
Part (c): 4x4
. At (half the maximum multiplexing) a achieves diversity β the same as a at . A LAST code at on achieves diversity .
Common Mistake: DMT Is an Asymptotic Benchmark β Do Not Over-Interpret at Finite SNR
Mistake:
A reader may conclude that a DMT-optimal code (CDA-NVD, LAST, or any other) automatically outperforms a non-DMT-optimal code (plain V-BLAST, zero-forcing) at every SNR and every rate. In particular, one might expect a structured LAST code at on a channel to outperform ZF-V-BLAST at any operating SNR.
Correction:
DMT is an asymptotic () benchmark. At finite SNR, the coding gain β the shift of the error- probability-vs-SNR curve β matters more than the slope (the DMT). A structured LAST code with inner lattice may have a dB coding-gain advantage over ZF-V-BLAST but only start showing this advantage at . Below that SNR, ZF-V-BLAST (which is simpler and has lower decoder complexity) may be preferable. The DMT tells you the slope of the log-BER vs. log-SNR curve at asymptotic SNR; the coding gain tells you the intercept. Both matter in deployment, and a good design optimises both. This is exactly the reason the 2008 Kumar-Caire paper (Β§4) was needed: the 2004 LAST paper gave DMT-optimality (the slope), but structured LAST gives coding gain (the intercept). Together they are the full story.