Exercises
ex-ch17-01
EasyState the definition of a LAST codebook. Given , , block length , and , write down the codebook cardinality and the rate in bits per channel use.
Cardinality is .
Rate is .
Cardinality
.
Rate
bits/ch.use.
ex-ch17-02
EasyExplain why the common random dither is essential for the LAST construction, and what breaks if the dither is replaced by .
Recall the crypto lemma (Erez-Zamir): modulo a lattice, a dithered value is uniform.
Without dither, the codeword is deterministic โ what property fails?
Role of the dither
The dither makes the transmit signal uniform on regardless of the message (Erez-Zamir crypto lemma). This uniformity is the lattice analog of i.i.d. Gaussian random coding: it lets us compute ensemble- averaged error probabilities.
Without dither
If , the transmit signal is , which depends deterministically on and does not average to uniform. The DMT proof via Minkowski-Hlawka averaging then breaks: the error probability cannot be bounded by without the uniformity. Practically, undithered LAST codes have larger finite-SNR BER and worse asymptotic slope.
ex-ch17-03
EasyCompute the augmented channel matrix for , , .
Use .
.
MMSE coefficient
, .
Augmented matrix
, a real matrix.
ex-ch17-04
MediumProve that the MMSE-GDFE filter satisfies โ i.e., is not unitary.
Use with .
Partition ; use .
; also .
Setup
From and the block partition: .
Compute $\mathbf{Q}_2^H \mathbf{Q}_2$
The bottom block of is , so and .
Recover $\mathbf{F}^H \mathbf{F}$
, i.e., a non-unitary positive-definite operator.
ex-ch17-05
MediumState the three ingredients of the LAST-DMT-optimality proof (El Gamal-Caire-Damen 2004, Thm. 1). Then, for each ingredient, identify the chapter of this book where it was established.
The ingredients are in the proof-steps of Thm. TLAST + MMSE-GDFE Achieves the Zheng-Tse DMT (El Gamal-Caire-Damen 2004).
Ch. 12 gave the outage-exponent; Ch. 16 gave the Erez-Zamir AWGN result; Ch. 15 gave Minkowski-Hlawka.
Three ingredients
(1) MMSE-GDFE preserves mutual information (this chapter, ยง2 Thm. TMMSE-GDFE Preserves Mutual Information). (2) Erez-Zamir lattice coding achieves the AWGN capacity on the effective channel. (3) Zheng-Tse Wishart-Laplace analysis giving .
Cross-references
(1) is this chapter's ยง2. (2) is Ch. 16 (Erez-Zamir 2004) plus Ch. 15 (Minkowski-Hlawka random-lattice averaging). (3) is Ch. 12 (Zheng-Tse 2003). Composition of three prior results gives the LAST theorem.
ex-ch17-06
MediumFor , tabulate the Zheng-Tse DMT curve at . At , compute the achievable diversity order. What slope does the BER-vs-SNR curve of a LAST code achieve at ?
Piecewise-linear interpolation between .
BER slope equals the diversity order.
Tabulation
, , , , .
Diversity and slope at $r = 1.5$
Diversity . BER-vs-SNR slope is , i.e., BER at high SNR. This is a very shallow slope โ at on a channel, the diversity advantage of the LAST code is minimal; we are essentially multiplexing at the full MIMO rate.
ex-ch17-07
MediumWrite down the Erez-Zamir "equivalent-AWGN channel" seen by the lattice decoder after MMSE-GDFE. Specifically, for the LAST code over a MIMO channel , describe the effective noise variance per layer and the effective SNR aggregate.
After MMSE-GDFE the per-layer signal is with .
Per-layer SNR is .
Use Thm. TMMSE-GDFE Triangularises the MIMO Channel for the product formula.
Per-layer effective SNR
Per layer , the signal is , the effective noise has unit variance, and the effective SNR is .
Aggregate SNR
(by Thm. TMMSE-GDFE Triangularises the MIMO Channel). Taking log: , which equals the MIMO mutual information (up to the MMSE bias offset). The equivalent channel is a product of scalar AWGN channels with varying per-layer SNR, whose aggregate capacity equals the full MIMO capacity.
ex-ch17-08
MediumCompute the normalised coding gain of (the densest 4D lattice, with , ) relative to . Translate this into the BER shift for a structured LAST code on with block length .
.
BER shift factor is .
Coding gain
. . Ratio , or dB.
BER shift
, i.e., dB of SNR shift at the same BER target. At the BER operating point, structured--LAST operates dB below -LAST.
ex-ch17-09
MediumState and prove that MMSE-GDFE is a linear sufficient statistic for decoding the transmitted codeword from the received vector on a MIMO channel.
A linear transformation with full column rank is a sufficient statistic.
Use Thm. TMMSE-GDFE Preserves Mutual Information and the structure of .
Statement
The MMSE-GDFE filter applied to gives that is a sufficient statistic for , i.e., and form a Markov chain (trivially, as is a function of ).
Proof
From Exercise 4, , which is positive-definite (hence full column rank) whenever has full rank. A full-column-rank linear transformation of preserves mutual information with (data-processing equality for invertible linear maps). Hence is a sufficient statistic for .
ex-ch17-10
MediumGiven the MMSE-GDFE decoder's complexity and the sphere decoder's average complexity , determine the value of at which the sphere decoder becomes cheaper than MMSE-GDFE for .
Set and solve for .
With , cubic cost is . Sphere cost is .
Equation
.
Interpretation
At (e.g., BPSK/QPSK), sphere decoding is cheaper than MMSE-GDFE. At (16-QAM), MMSE-GDFE wins: sphere decoder cost is , much larger than MMSE-GDFE's . For standard MIMO rates (16-QAM and above), MMSE-GDFE is the clear winner.
ex-ch17-11
HardConsider a LAST code with rate bits/ch.use on a i.i.d. Rayleigh channel. At SNR dB, the target BER is . (a) What is the required multiplexing gain? (b) What is the DMT exponent at this ? (c) Is structured--LAST feasible in this setting? (d) What coding-gain advantage would it provide over random LAST?
Multiplexing gain .
Check if for an -compatible configuration.
Part (a): Multiplexing gain
.
Part (b): DMT exponent
. BER decays as .
Part (c): $E_8$ compatibility
lives in , so we need , i.e., . With , we need . Yes, structured- -LAST with fits the setting.
Part (d): Coding-gain advantage
From Ex. 8's pattern: (i.e., dB); BER shift factor , i.e., dB of SNR shift. Structured--LAST operates dB below random LAST at BER .
ex-ch17-12
HardDerive the per-layer effective SNR of the MMSE-GDFE for a i.i.d. Rayleigh channel at . Average the result over the channel distribution to obtain the ensemble-averaged aggregate SNR.
Per-layer SNR is .
Use via the Wishart moments.
where is the -principal minor.
Per-layer SNR (formal)
are the squared diagonals of the QR factor of . By QR properties, . For : where are eigenvalues of .
Ensemble average
Under i.i.d. Rayleigh, (for normalised), . With : .
Aggregate log
(per block, ). The lattice decoder sees an ensemble-averaged aggregate SNR in this range โ specific block realisations may be higher or lower depending on channel eigenvalues.
ex-ch17-13
HardExplain why zero-forcing V-BLAST does NOT achieve the full DMT on an i.i.d. Rayleigh channel with . Specifically: derive the diversity order of ZF-V-BLAST at the maximum multiplexing gain .
ZF decorrelates via , converting to per-stream Gaussian channels.
Per-stream effective SNR is .
The diagonal entry has the inverse-Wishart distribution, whose tail gives the diversity.
ZF equivalent channel
After ZF, each stream sees a Gaussian channel with effective noise variance . The per-stream SNR is .
Distribution of the diagonal
is inverse-Wishart; its diagonals are distributed as . Hence the per-stream SNR is distributed as , which has diversity order .
DMT conclusion
At , each of the streams carries rate , and the probability of per-stream outage is . This is strictly smaller than the Zheng-Tse (degenerate case, where we should have no outage asymptotically); but at intermediate the ZF diversity is stuck at , below the Zheng-Tse . Hence ZF-V-BLAST is not DMT-optimal. This is precisely the pitfall that MMSE-GDFE fixes.
ex-ch17-14
HardProve that MMSE-GDFE + lattice decoding and MMSE-SIC + Gaussian- random-code ML decoding achieve the same aggregate capacity on a MIMO channel. This is the formal statement of the "MMSE-GDFE is the lattice analog of MMSE-SIC" remark.
Both receivers triangularise via QR and decode layer by layer.
Per-layer aggregate capacity: .
Show this equals .
Per-layer capacity
Per layer of the triangular system, the achievable rate with a Gaussian random code is ; with a lattice code, the achievable rate is (Erez-Zamir gives per real dimension, or at high SNR per complex dimension).
Aggregate
.
Conclusion
Both MMSE-GDFE (lattice) and MMSE-SIC (Gaussian) achieve the same aggregate capacity on the triangularised channel. The receivers differ only in which codebook they assume โ Gaussian random code versus lattice code. This is the precise sense in which MMSE-GDFE is the lattice analog of MMSE-SIC.
ex-ch17-15
MediumDesign a structured LAST code for a MIMO channel with block length , using the Leech lattice as the inner lattice. (a) Verify the dimension matching. (b) Compute the codebook cardinality for . (c) State the coding-gain advantage over random LAST.
, so we need .
is the shaping factor: .
, i.e., dB over .
Part (a): Dimension check
. Matches Leech dim.
Part (b): Cardinality
codewords, i.e., a rate of bits per channel use across 3 transmit antennas ( bits/ch.use per antenna, equivalent to -QAM per layer).
Part (c): Coding-gain advantage
( dB), so BER shift factor is dB. Structured-Leech-LAST operates dB below random LAST at the same BER target โ an enormous finite-SNR advantage from the Leech lattice's extreme density.
ex-ch17-16
MediumGiven that MMSE-GDFE achieves the MIMO mutual information per Thm. TMMSE-GDFE Preserves Mutual Information, would it be correct to say "MMSE-GDFE achieves the AWGN capacity on each scalar layer"? Explain why or why not.
Each layer individually has a scalar AWGN capacity .
The sum of per-layer capacities equals the MIMO capacity โ this is a conservation statement.
Yes and no
Yes โ each layer achieves its own AWGN capacity with a sufficiently dense lattice code; no โ the per-layer rates are not chosen to match the per-layer capacities (they are chosen to match the overall MIMO rate). What MMSE-GDFE actually achieves is aggregate capacity: . If the code rate is fixed at bits/ch.use, the per-layer rate is per scalar channel use, which may be below or above the per-layer AWGN capacity depending on the channel realisation.
Implication for DMT
The DMT-achievability of LAST does not rely on per-layer rate-matching; it relies on the aggregate capacity being above the aggregate rate (the non-outage event). On non-outage channels the code succeeds; on outage channels it fails. This is the fundamental trade-off being optimised in Thm. TLAST + MMSE-GDFE Achieves the Zheng-Tse DMT (El Gamal-Caire-Damen 2004).
ex-ch17-17
Hard(Open-ended.) Argue that for on a i.i.d. Rayleigh channel, structured LAST codes (if a dense lattice of dimension or exists) will outperform CDA-NVD codes at moderate SNR and moderate rate. Consider decoder complexity and coding gain.
CDA sphere decoding: for .
For : use or Barnes-Wall (for ) with .
For : use , a known dense lattice in dimension 12.
MMSE-GDFE complexity: for .
Complexity comparison
CDA sphere decoder at : per block โ , intractable at . MMSE-GDFE
- structured LAST: . For (gives , matching after padding), complexity . MMSE-GDFE wins by 12 orders of magnitude.
Coding gain
(Coxeter-Todd lattice in dim 12) has โ modest. Still, it gives dB over . Combined with polynomial decoding, this gives structured LAST the edge at moderate SNR.
Conclusion
For , CDA codes are impractical (exponential decoding); structured LAST is practical with some coding-gain penalty. At , structured LAST is the clear winner. This is the scaling-robustness argument for lattice codes over algebraic codes.
ex-ch17-18
MediumState whether the following modifications of the LAST construction preserve DMT-optimality, and explain why. (a) Replace the common random dither by . (b) Replace the fine lattice by a half-density sub-lattice. (c) Replace MMSE-GDFE by plain MMSE (no backsubstitution). (d) Replace the inner lattice by in appropriate dimension.
Think about which step of Thm. TLAST + MMSE-GDFE Achieves the Zheng-Tse DMT (El Gamal-Caire-Damen 2004) each modification breaks.
Part (a): No dither
Breaks DMT. Without the dither, the crypto-lemma (uniform codeword distribution) fails, and the Minkowski-Hlawka averaging argument cannot be applied. The Erez-Zamir step of the proof breaks. Undithered LAST has strictly worse DMT than .
Part (b): Half-density sublattice
Preserves DMT. As long as has density for some (Thm. TStructured LAST with Dense Inner Lattice Achieves the DMT (Kumar-Caire 2008)), DMT is preserved. Half-density is still asymptotically adequate; only the finite-SNR coding gain degrades.
Part (c): Plain MMSE
Breaks DMT. Plain MMSE achieves only diversity (ZF baseline), not the full . The backsubstitution is what extracts the transmit diversity. See pitfall โ MMSE-GDFE Is Not Plain MMSE โ The Feedback Structure Is Essential.
Part (d): $E_8$
Preserves DMT and gains coding gain. Thm. TStructured LAST with Dense Inner Lattice Achieves the DMT (Kumar-Caire 2008) (Kumar-Caire 2008). DMT preserved; dB coding gain over .
ex-ch17-19
MediumWrite the MATLAB/Python pseudocode for the MMSE-GDFE receiver of a LAST code with an explicit inner lattice. Assume: (so is doubled , or use two blocks), dB, channel matrix given. Include: augmented matrix construction, QR decomposition, filter application, dither removal, and layer-by-layer -decoding.
Use numpy.linalg.qr for QR decomposition.
-decoding is a lookup against 240 nearest neighbours or a standard decoder (Leech-style).
Pseudocode
import numpy as np
def last_decode(y_vec, H, snr_lin, dither, E8_lattice_points):
# 1. Augment channel
alpha = 1.0 / snr_lin
H_tilde = np.kron(np.eye(T), H) # n_r T x n_t T
sqrt_alpha_I = np.sqrt(alpha) * np.eye(H_tilde.shape[1])
H_aug = np.vstack([H_tilde, sqrt_alpha_I])
# 2. QR
Q, R = np.linalg.qr(H_aug)
F = Q[:H_tilde.shape[0], :].conj().T # upper block
# 3. Filter and remove dither
z = F @ y_vec - F @ H_tilde @ dither
# 4. Layer-by-layer lattice decode
x_hat = np.zeros_like(z)
for i in range(len(z) - 1, -1, -1):
z_i_tilde = (z[i] - R[i, i+1:] @ x_hat[i+1:]) / R[i, i]
# E8 nearest-neighbour via inner-lattice table
x_hat[i] = nearest_in_E8_coord(z_i_tilde, E8_lattice_points)
# 5. Undo modulo shaping
u_hat = inverse_G @ ((x_hat + dither) % Lambda_s)
return u_hat
def nearest_in_E8_coord(z_i, E8_points):
# Return the nearest coordinate from precomputed E8 table
return E8_points[np.argmin(np.abs(E8_points - z_i))]
Notes
In production one precomputes the 240 nearest-neighbour offsets of for fast per-layer lookup. For Leech, the Conway-Sloane efficient decoder uses a tree-based algorithm that avoids the full -point search. The overall structure โ augment, QR, filter, backsubstitute โ is identical across inner lattices.
ex-ch17-20
HardConsider the Zheng-Tse DMT on the asymmetric channel. (a) Plot the DMT curve for . (b) Identify the multiplexing gain at which the diversity is exactly . (c) What LAST block length is required to approach in a single channel block?
.
Solve .
Part (a): Plot
. Piecewise linear between integer points.
Part (b): $r^*$ where $d = 2$
or . Only is in range . So .
Part (c): Block length
Zheng-Tse requires for the DMT to be achievable. At , suffices. Standard choice: (, so or inner lattice if complex-to- real). At , LAST achieves the full DMT at .
ex-ch17-21
Hard(Conceptual.) The chapter distinguishes between two CommIT contributions โ El Gamal-Caire-Damen 2004 (existence of DMT- optimal LAST) and Kumar-Caire 2008 (structured LAST from dense lattices). Discuss whether these two contributions should be viewed as (a) two steps of a single research programme, (b) two independent contributions, or (c) a single contribution distributed over two papers. Use the content of ยงยง1-5 to support your answer.
A research programme involves setting out a long-term goal; two steps of the same programme may span years.
Independent contributions address unrelated questions.
A single contribution artificially split would use different papers for the same result.
Two steps of a single research programme
The most defensible reading is (a), two steps of a single programme. The 2004 paper established the existence of DMT-optimal LAST codes via a random-lattice argument; its limitation (non-constructive) was explicitly noted in the paper's Section IV. The 2008 paper addressed the constructive question left open in 2004: it replaced the random inner lattice by dense structured lattices and showed that (i) DMT is preserved, (ii) finite-SNR coding gain is gained. This is the natural follow-up.
Evidence from the content
(1) The 2008 paper cites the 2004 paper as its starting point and uses the same DMT framework (Zheng-Tse 2003) and the same decoder (MMSE-GDFE). (2) The authors overlap (G. Caire on both). (3) The 2008 paper proves its theorem by reducing to the 2004 theorem โ an internal dependency (Thm. TStructured LAST with Dense Inner Lattice Achieves the DMT (Kumar-Caire 2008), Step 1). (4) The chapter's ยง1-5 structure is built around the two papers as complementary pieces of the LAST story, with ยง4 explicitly bridging theory and practice.
Counter-argument
A reading favoring (b) or (c) would note that the two papers have four years between them and use different technical tools (Minkowski-Hlawka vs. Conway-Sloane catalogue). But given Caire's consistent authorship and the explicit forward/ backward references between the papers, (a) is the most accurate summary.
ex-ch17-22
HardDerive the Wishart-Laplace channel outage exponent for the i.i.d. Rayleigh channel and verify it equals at (full diversity) and at integer .
Eigenvalues of have the Wishart joint density .
Reparametrise via ; high-SNR density is .
Outage constraint: .
Minimise exponent subject to outage.
Wishart density in $\alpha$-coordinates
At high SNR, the joint density of (sorted so ) is under i.i.d. Rayleigh.
Outage region
Outage: . Feasible region: , sorted. For integer , the most probable point in the outage region has for (no outage from the first eigenvalues) and for .
Minimisation at $r = k$
Minimise subject to . The linear minimum is for , giving exponent (after summation algebra).
Verify at $r = 0$
At : โ the full-diversity order, correct. At : โ consistent with the case's and 's . At : โ the full-multiplexing degenerate case. All consistent with Zheng-Tse.
ex-ch17-23
MediumExplain in your own words why the MMSE-GDFE is the lattice analog of MMSE-SIC, but not a literal "lattice SIC."
MMSE-SIC: iterative. Receive, demodulate strongest stream, subtract, recurse.
MMSE-GDFE: non-iterative. Augment, QR, filter, backsubstitute.
The difference is in how the triangularisation is done.
The analogy
Both MMSE-SIC and MMSE-GDFE triangularise the MIMO channel: they convert the channel into scalar channels, decoded sequentially, with earlier decisions feeding back into later ones. In both cases the aggregate rate equals the full MIMO mutual information (no capacity lost). In both cases the order of decoding matters (strongest first for MMSE-SIC; the QR structure determines order for MMSE-GDFE).
The difference
MMSE-SIC is iterative: decode stream 1, make a hard decision, subtract its reconstructed contribution from the channel output, redo the MMSE filter on the residual, decode stream 2, and so on. MMSE-GDFE is non-iterative: form the augmented matrix, do one QR decomposition, filter through it once, and then backsubstitute through the triangular system. The feedback is implicit in the triangularisation rather than explicit in an iterative loop.
Why the non-iterative form for lattices?
For Gaussian random codes, iterative MMSE-SIC makes sense because each stream is independently coded โ soft information flows cleanly between the demodulator and the outer-code decoder. For lattice codes, the codewords span multiple layers simultaneously (they are structured jointly on the lattice), so iterative per-stream decoding with hard decisions would lose the lattice structure. The non-iterative QR-backsubstitution form of MMSE-GDFE preserves the lattice structure throughout.
ex-ch17-24
ChallengeOpen problem (circa 2026, based on the current state of the literature): for very large MIMO (, such as 5G massive MIMO), is structured LAST competitive with codebook-based precoding? Sketch the relevant trade-offs and state what would need to be shown for a structured-LAST-based receiver to be adopted in a 6G standard.
Massive MIMO: , . Dimension .
Dense lattices at these dimensions: Barnes-Wall BW_64, BW_128, ...
Complexity: for .
Codebook precoding: discrete set, low receiver complexity.
Trade-off for massive MIMO
For : , Barnes-Wall BW_128 has dim (after complex-to-real), โ strong coding gain. MMSE-GDFE complexity flops per block. Single-block latency at 100 GFLOPS. Feasible.
For (to get compatibility): cubed is or โ pushing against real-time limits.
What would need to be shown
(1) Finite-SNR performance: structured-BW-LAST must beat Type-II codebook precoding at typical 5G operating points (SNR 10-30 dB, rate bits/ch.use/user). (2) Real-time implementation: the MMSE-GDFE at must fit in the receiver latency budget ( ms) across realistic user configurations. (3) Robustness: the code must be robust to CSI estimation errors, which are significant in massive-MIMO regimes. (4) Standards inertia: 3GPP has significant investment in the codebook paradigm; switching would require a compelling gain ( dB at standard operating points).
None of these have been demonstrated at the scale required for a 6G standard. Research in this direction is active but has not crossed the threshold for standardisation adoption. This is an open problem with significant practical implications.