Ferkans — Interactive Telecom Tutor

ex-ch17-01

Easy

State the definition of a LAST codebook. Given $\Lambda_c = \mathbb{Z}^4$ , $\Lambda_s = 4 \mathbb{Z}^4$ , block length $T = 2$ , and $n_t = 2$ , write down the codebook cardinality $|\mathcal{C}|$ and the rate $R$ in bits per channel use.

Show Hint

Cardinality is $|\Lambda_c / \Lambda_s| = V(\Lambda_s)/V(\Lambda_c)$ .

Rate is $R = T^{-1} \log_2 |\mathcal{C}|$ .

Solution

Cardinality

$|\mathcal{C}| = |\Lambda_c / \Lambda_s| = V(4 \mathbb{Z}^4)/V(\mathbb{Z}^4) = 4^4 = 256$ .

Rate

$R = T^{-1} \log_2 256 = (1/2) \cdot 8 = 4$ bits/ch.use. $\blacksquare$

ex-ch17-02

Easy

Explain why the common random dither $\mathbf{d} \sim \mathrm{Unif} (\mathcal{V}(\Lambda_s))$ is essential for the LAST construction, and what breaks if the dither is replaced by $\mathbf{d} = \mathbf{0}$ .

Show Hint

Recall the crypto lemma (Erez-Zamir): modulo a lattice, a dithered value is uniform.

Without dither, the codeword is deterministic — what property fails?

Solution

Role of the dither

The dither makes the transmit signal $\mathbf{x} = [\mathbf{G} \mathbf{u} + \mathbf{d}] \bmod \Lambda_s$ uniform on $\mathcal{V}(\Lambda_s)$ regardless of the message $\mathbf{u}$ (Erez-Zamir crypto lemma). This uniformity is the lattice analog of i.i.d. Gaussian random coding: it lets us compute ensemble- averaged error probabilities.

Without dither

If $\mathbf{d} = \mathbf{0}$ , the transmit signal is $\mathbf{G} \mathbf{u} \bmod \Lambda_s$ , which depends deterministically on $\mathbf{u}$ and does not average to uniform. The DMT proof via Minkowski-Hlawka averaging then breaks: the error probability cannot be bounded by $\gamma_c^{-n_r}$ without the uniformity. Practically, undithered LAST codes have larger finite-SNR BER and worse asymptotic slope. $\blacksquare$

ex-ch17-03

Easy

Compute the augmented channel matrix $\bar{\mathbf{H}}$ for $\mathbf{H} = \bigl(\begin{smallmatrix} 1 & 0.5 \\ 0.5 & 1 \end{smallmatrix}\bigr)$ , $T = 1$ , $\text{SNR} = 10$ .

Show Hint

Use $\alpha = 1/\text{SNR}$ .

$\bar{\mathbf{H}} = [\mathbf{H}^{T}, \sqrt{\alpha}\mathbf{I}]^T$ .

Solution

MMSE coefficient

$\alpha = 1/10 = 0.1$ , $\sqrt{\alpha} \approx 0.316$ .

Augmented matrix

$\bar{\mathbf{H}} = \begin{pmatrix} 1 & 0.5 \\ 0.5 & 1 \\ 0.316 & 0 \\ 0 & 0.316 \end{pmatrix}$ , a $4 \times 2$ real matrix. $\blacksquare$

ex-ch17-04

Medium

Prove that the MMSE-GDFE filter $\mathbf{F} = \mathbf{Q}_1^H$ satisfies $\mathbf{F}^H \mathbf{F} = (\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})^{-1} \mathbf{H}^{H} \mathbf{H} (\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})^{-1}$ — i.e., $\mathbf{F}$ is not unitary.

Show Hint

Use $\bar{\mathbf{H}} = \mathbf{Q} \mathbf{R}$ with $\mathbf{Q}^H \mathbf{Q} = \mathbf{I}$ .

Partition $\mathbf{Q} = (\mathbf{Q}_1^T, \mathbf{Q}_2^T)^T$ ; use $\mathbf{Q}_1^H \mathbf{Q}_1 + \mathbf{Q}_2^H \mathbf{Q}_2 = \mathbf{I}$ .

$\bar{\mathbf{H}}^H \bar{\mathbf{H}} = \mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I}$ ; also $\bar{\mathbf{H}}^H \bar{\mathbf{H}} = \mathbf{R}^H \mathbf{R}$ .

Solution

Setup

From $\mathbf{Q}^H \mathbf{Q} = \mathbf{I}$ and the block partition: $\mathbf{Q}_1^H \mathbf{Q}_1 + \mathbf{Q}_2^H \mathbf{Q}_2 = \mathbf{I}$ .

Compute $\mathbf{Q}_2^H \mathbf{Q}_2$

The bottom block of $\bar{\mathbf{H}} = \mathbf{Q} \mathbf{R}$ is $\sqrt{\alpha} \mathbf{I} = \mathbf{Q}_2 \mathbf{R}$ , so $\mathbf{Q}_2 = \sqrt{\alpha} \mathbf{R}^{-1}$ and $\mathbf{Q}_2^H \mathbf{Q}_2 = \alpha (\mathbf{R}^H \mathbf{R})^{-1} = \alpha (\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})^{-1}$ .

Recover $\mathbf{F}^H \mathbf{F}$

$\mathbf{F}^H \mathbf{F} = \mathbf{Q}_1^H \mathbf{Q}_1 = \mathbf{I} - \alpha (\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})^{-1} = (\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I} - \alpha \mathbf{I})(\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})^{-1} = \mathbf{H}^{H} \mathbf{H} (\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})^{-1}$ , i.e., a non-unitary positive-definite operator. $\blacksquare$

ex-ch17-05

Medium

State the three ingredients of the LAST-DMT-optimality proof (El Gamal-Caire-Damen 2004, Thm. 1). Then, for each ingredient, identify the chapter of this book where it was established.

Show Hint

The ingredients are in the proof-steps of Thm. TLAST + MMSE-GDFE Achieves the Zheng-Tse DMT (El Gamal-Caire-Damen 2004).

Ch. 12 gave the outage-exponent; Ch. 16 gave the Erez-Zamir AWGN result; Ch. 15 gave Minkowski-Hlawka.

Solution

Three ingredients

(1) MMSE-GDFE preserves mutual information (this chapter, §2 Thm. TMMSE-GDFE Preserves Mutual Information). (2) Erez-Zamir lattice coding achieves the AWGN capacity on the effective channel. (3) Zheng-Tse Wishart-Laplace analysis giving $\Pr(\text{outage}) \doteq \text{SNR}^{-d^*(r)}$ .

Cross-references

(1) is this chapter's §2. (2) is Ch. 16 (Erez-Zamir 2004) plus Ch. 15 (Minkowski-Hlawka random-lattice averaging). (3) is Ch. 12 (Zheng-Tse 2003). Composition of three prior results gives the LAST theorem. $\blacksquare$

ex-ch17-06

Medium

For $(n_t, n_r) = (2, 2)$ , tabulate the Zheng-Tse DMT curve $d^*(r)$ at $r = 0, 0.5, 1, 1.5, 2$ . At $r = 1.5$ , compute the achievable diversity order. What slope does the BER-vs-SNR curve of a LAST code achieve at $r = 1.5$ ?

Show Hint

Piecewise-linear interpolation between $(0, 4), (1, 1), (2, 0)$ .

BER slope equals the diversity order.

Solution

Tabulation

$d^*(0) = 4$ , $d^*(0.5) = 4 + (1-4) \cdot 0.5 = 2.5$ , $d^*(1) = 1$ , $d^*(1.5) = 1 + (0-1) \cdot 0.5 = 0.5$ , $d^*(2) = 0$ .

Diversity and slope at $r = 1.5$

Diversity $d^*(1.5) = 0.5$ . BER-vs-SNR slope is $-0.5$ , i.e., BER $\propto \text{SNR}^{-0.5}$ at high SNR. This is a very shallow slope — at $r = 1.5$ on a $2 \times 2$ channel, the diversity advantage of the LAST code is minimal; we are essentially multiplexing at the full MIMO rate. $\blacksquare$

ex-ch17-07

Medium

Write down the Erez-Zamir "equivalent-AWGN channel" seen by the lattice decoder after MMSE-GDFE. Specifically, for the LAST code over a MIMO channel $\ntn{Y} = \mathbf{H} \mathbf{X} + \mathbf{w}$ , describe the effective noise variance per layer and the effective SNR aggregate.

Show Hint

After MMSE-GDFE the per-layer signal is $R_{ii} x_i + w_i'$ with $\mathrm{Var}(w_i') = 1$ .

Per-layer SNR is $R_{ii}^2 / 1 = R_{ii}^2$ .

Use Thm. TMMSE-GDFE Triangularises the MIMO Channel for the product formula.

Solution

Per-layer effective SNR

Per layer $i \in \{1, \ldots, n_t T\}$ , the signal is $R_{ii} x_i$ , the effective noise has unit variance, and the effective SNR is $R_{ii}^2$ .

Aggregate SNR

$\prod_i R_{ii}^2 = \det(\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})^T$ (by Thm. TMMSE-GDFE Triangularises the MIMO Channel). Taking log: $\sum_i \log_2 R_{ii}^2 = T \log_2 \det(\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})$ , which equals the MIMO mutual information (up to the MMSE bias offset). The equivalent channel is a product of $n_t T$ scalar AWGN channels with varying per-layer SNR, whose aggregate capacity equals the full MIMO capacity. $\blacksquare$

ex-ch17-08

Medium

Compute the normalised coding gain of $D_4$ (the densest 4D lattice, with $d_{\min}^2 = 2$ , $V(D_4) = 1/2$ ) relative to $\mathbb{Z}^4$ . Translate this into the BER shift for a structured LAST code on $(n_t, n_r) = (2, 2)$ with block length $T = 2$ .

Show Hint

$\gamma_c(\Lambda) = d_{\min}^2 / V^{2/n}$ .

BER shift factor is $\gamma_c^{-n_r}$ .

Solution

Coding gain

$\gamma_c(D_4) = 2 / (1/2)^{2/4} = 2 / (1/\sqrt{2}) = 2\sqrt{2} \approx 2.83$ . $\gamma_c(\mathbb{Z}^4) = 1 / 1^{1/2} = 1$ . Ratio $\gamma_c(D_4) / \gamma_c(\mathbb{Z}^4) = 2\sqrt{2}$ , or $\approx 4.5$ dB.

BER shift

$\gamma_c^{-n_r} = (2\sqrt{2})^{-2} = 1/8$ , i.e., $9$ dB of SNR shift at the same BER target. At the BER $10^{-4}$ operating point, structured- $D_4$ -LAST operates $9$ dB below $\mathbb{Z}^4$ -LAST. $\blacksquare$

ex-ch17-09

Medium

State and prove that MMSE-GDFE is a linear sufficient statistic for decoding the transmitted codeword $\mathbf{x}$ from the received vector $\mathbf{y}$ on a MIMO channel.

Show Hint

A linear transformation with full column rank is a sufficient statistic.

Use Thm. TMMSE-GDFE Preserves Mutual Information and the structure of $\mathbf{F}$ .

Solution

Statement

The MMSE-GDFE filter $\mathbf{F} = \mathbf{Q}_1^H$ applied to $\mathbf{y}$ gives $\mathbf{z} = \mathbf{F} \mathbf{y}$ that is a sufficient statistic for $\mathbf{x}$ , i.e., $I(\mathbf{x}; \mathbf{z}) = I(\mathbf{x}; \mathbf{y})$ and $\mathbf{x} \to \mathbf{z} \to \mathbf{y}$ form a Markov chain (trivially, as $\mathbf{z}$ is a function of $\mathbf{y}$ ).

Proof

From Exercise 4, $\mathbf{F}^H \mathbf{F} = \mathbf{H}^{H} \mathbf{H} (\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})^{-1}$ , which is positive-definite (hence full column rank) whenever $\mathbf{H}$ has full rank. A full-column-rank linear transformation of $\mathbf{y}$ preserves mutual information with $\mathbf{x}$ (data-processing equality for invertible linear maps). Hence $\mathbf{z}$ is a sufficient statistic for $\mathbf{x}$ . $\blacksquare$

ex-ch17-10

Medium

Given the MMSE-GDFE decoder's complexity $O((n_t T)^3)$ and the sphere decoder's average complexity $O(M^{n_t T / 2})$ , determine the value of $M$ at which the sphere decoder becomes cheaper than MMSE-GDFE for $n_t T = 8$ .

Show Hint

Set $M^{n_t T / 2} = (n_t T)^3$ and solve for $M$ .

With $n_t T = 8$ , cubic cost is $8^3 = 512$ . Sphere cost is $M^4$ .

Solution

Equation

$M^{n_t T / 2} = (n_t T)^3 \implies M^4 = 512 \implies M = 512^{1/4} = 2^{9/4} \approx 4.76$ .

Interpretation

At $M \lesssim 5$ (e.g., BPSK/QPSK), sphere decoding is cheaper than MMSE-GDFE. At $M = 16$ (16-QAM), MMSE-GDFE wins: sphere decoder cost is $16^4 = 65536$ , much larger than MMSE-GDFE's $\sim 512$ . For standard MIMO rates (16-QAM and above), MMSE-GDFE is the clear winner. $\blacksquare$

ex-ch17-11

Hard

Consider a LAST code with rate $R = 2$ bits/ch.use on a $(2, 2)$ i.i.d. Rayleigh channel. At SNR $= 20$ dB, the target BER is $10^{-3}$ . (a) What is the required multiplexing gain? (b) What is the DMT exponent at this $r$ ? (c) Is structured- $E_8$ -LAST feasible in this setting? (d) What coding-gain advantage would it provide over random LAST?

Show Hint

Multiplexing gain $r = R / \log_2(\text{SNR}) = 2 / \log_2(100)$ .

Check if $n_t T = 8$ for an $E_8$ -compatible configuration.

Solution

Part (a): Multiplexing gain

$r = R / \log_2(\text{SNR}) = 2 / \log_2(100) = 2 / 6.64 \approx 0.30$ .

Part (b): DMT exponent

$d^*(0.30) = (2 - 0.30)(2 - 0.30) = 1.70 \cdot 1.70 = 2.89$ . BER decays as $\text{SNR}^{-2.89}$ .

Part (c): $E_8$ compatibility

$E_8$ lives in $\mathbb{R}^8$ , so we need $2 n_t T = 8$ , i.e., $n_t T = 4$ . With $n_t = 2$ , we need $T = 2$ . Yes, structured- $E_8$ -LAST with $T = 2$ fits the $(2, 2)$ setting.

Part (d): Coding-gain advantage

From Ex. 8's pattern: $\gamma_c(E_8) = 2$ (i.e., $3$ dB); BER shift factor $\gamma_c^{-n_r} = 2^{-2}$ , i.e., $6$ dB of SNR shift. Structured- $E_8$ -LAST operates $6$ dB below random LAST at BER $10^{-3}$ . $\blacksquare$

ex-ch17-12

Hard

Derive the per-layer effective SNR of the MMSE-GDFE for a $(n_t, n_r) = (2, 2)$ i.i.d. Rayleigh channel at $\text{SNR} = 10$ . Average the result over the channel distribution to obtain the ensemble-averaged aggregate SNR.

Show Hint

Per-layer SNR is $R_{ii}^2$ .

Use $\mathbb{E}[\det(\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})]$ via the Wishart moments.

$\mathbb{E}[\det(W + \alpha I)] = \sum_k \binom{n}{k} \alpha^{n-k} \mathbb{E}[\det(W_k)]$ where $W_k$ is the $k$ -principal minor.

Solution

Per-layer SNR (formal)

$R_{ii}^2$ are the squared diagonals of the QR factor of $[\mathbf{H}^{T}, \sqrt{\alpha}\mathbf{I}]^T$ . By QR properties, $\prod_i R_{ii}^2 = \det(\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})$ . For $(2,2)$ : $\det = \lambda_1 \lambda_2 + \alpha(\lambda_1 + \lambda_2) + \alpha^2$ where $\lambda_i$ are eigenvalues of $\mathbf{H}^{H} \mathbf{H}$ .

Ensemble average

Under i.i.d. Rayleigh, $\mathbb{E}[\lambda_1 \lambda_2] = \mathbb{E}[\det(\mathbf{H}^{H} \mathbf{H})] = 1$ (for $n_t = n_r = 2$ normalised), $\mathbb{E}[\lambda_1 + \lambda_2] = \mathbb{E}[\mathrm{tr}(\mathbf{H}^{H} \mathbf{H})] = n_t \cdot n_r \cdot 1 = 4$ . With $\alpha = 0.1$ : $\mathbb{E}[\det(\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I})] = 1 + 0.1 \cdot 4 + 0.01 = 1.41$ .

Aggregate log

$\mathbb{E}[\sum_i \log_2 R_{ii}^2] = \mathbb{E}[\log_2 \det] \approx \log_2 1.41 = 0.50$ (per block, $T = 1$ ). The lattice decoder sees an ensemble-averaged aggregate SNR in this range — specific block realisations may be higher or lower depending on channel eigenvalues. $\blacksquare$

ex-ch17-13

Hard

Explain why zero-forcing V-BLAST does NOT achieve the full DMT on an $(n_t, n_r)$ i.i.d. Rayleigh channel with $n_t \le n_r$ . Specifically: derive the diversity order of ZF-V-BLAST at the maximum multiplexing gain $r_{\max} = n_t$ .

Show Hint

ZF decorrelates via $(\mathbf{H}^{H} \mathbf{H})^{-1} \mathbf{H}^{H}$ , converting to per-stream Gaussian channels.

Per-stream effective SNR is $\text{SNR} / [(\mathbf{H}^{H} \mathbf{H})^{-1}]_{ii}$ .

The diagonal entry has the inverse-Wishart distribution, whose tail gives the diversity.

Solution

ZF equivalent channel

After ZF, each stream sees a Gaussian channel with effective noise variance $[(\mathbf{H}^{H} \mathbf{H})^{-1}]_{ii}$ . The per-stream SNR is $\text{SNR} \cdot [(\mathbf{H}^{H} \mathbf{H})^{-1}]_{ii}^{-1}$ .

Distribution of the diagonal

$(\mathbf{H}^{H} \mathbf{H})^{-1}$ is inverse-Wishart; its diagonals are distributed as $1/\chi^2_{n_r - n_t + 1}$ . Hence the per-stream SNR is distributed as $\text{SNR} \cdot \chi^2_{n_r - n_t + 1}$ , which has diversity order $n_r - n_t + 1$ .

DMT conclusion

At $r = n_t$ , each of the $n_t$ streams carries rate $\log_2 \text{SNR}$ , and the probability of per-stream outage is $\text{SNR}^{-(n_r - n_t + 1)}$ . This is strictly smaller than the Zheng-Tse $d^*(n_t) = 0$ (degenerate case, where we should have no outage asymptotically); but at intermediate $r < n_t$ the ZF diversity is stuck at $n_r - n_t + 1$ , below the Zheng-Tse $(n_t - r)(n_r - r)$ . Hence ZF-V-BLAST is not DMT-optimal. This is precisely the pitfall that MMSE-GDFE fixes. $\blacksquare$

ex-ch17-14

Hard

Prove that MMSE-GDFE + lattice decoding and MMSE-SIC + Gaussian- random-code ML decoding achieve the same aggregate capacity on a MIMO channel. This is the formal statement of the "MMSE-GDFE is the lattice analog of MMSE-SIC" remark.

Show Hint

Both receivers triangularise via QR and decode layer by layer.

Per-layer aggregate capacity: $\sum_i \log_2(1 + R_{ii}^2/\sigma_{\text{eff}}^2)$ .

Show this equals $I_{\text{MIMO}}(\mathbf{H}) = \log_2 \det(\mathbf{I} + \text{SNR} \mathbf{H}^{H} \mathbf{H})$ .

Solution

Per-layer capacity

Per layer $i$ of the triangular system, the achievable rate with a Gaussian random code is $\log_2(1 + R_{ii}^2 / \sigma_{\text{eff}}^2)$ ; with a lattice code, the achievable rate is $\log_2(R_{ii}^2 / \sigma_{\text{eff}}^2)$ (Erez-Zamir gives $\tfrac12 \log_2(1 + \text{SNR})$ per real dimension, or $\log_2 \text{SNR}$ at high SNR per complex dimension).

Aggregate

$\sum_i \log_2 R_{ii}^2 = T \log_2 \det(\mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I}) = T \log_2 \det(\mathbf{I} + \text{SNR} \mathbf{H}^{H} \mathbf{H}) - n_t T \log_2 \alpha + \text{(offset)} = T \cdot I_{\text{MIMO}}(\mathbf{H}) - O(1)$ .

Conclusion

Both MMSE-GDFE (lattice) and MMSE-SIC (Gaussian) achieve the same aggregate capacity $I_{\text{MIMO}}(\mathbf{H})$ on the triangularised channel. The receivers differ only in which codebook they assume — Gaussian random code versus lattice code. This is the precise sense in which MMSE-GDFE is the lattice analog of MMSE-SIC. $\blacksquare$

ex-ch17-15

Medium

Design a structured LAST code for a $3 \times 3$ MIMO channel with block length $T = 4$ , using the Leech lattice $\Lambda_{24}$ as the inner lattice. (a) Verify the dimension matching. (b) Compute the codebook cardinality for $k = 8$ . (c) State the coding-gain advantage over random LAST.

Show Hint

$\Lambda_{24} \subset \mathbb{R}^{24}$ , so we need $2 n_t T = 24$ .

$k$ is the shaping factor: $\Lambda_s = k \Lambda_c$ .

$\gamma_c(\Lambda_{24}) = 4$ , i.e., $6$ dB over $\mathbb{Z}^{24}$ .

Solution

Part (a): Dimension check

$2 n_t T = 2 \cdot 3 \cdot 4 = 24$ . Matches Leech dim.

Part (b): Cardinality

$|\mathcal{C}| = k^{24} = 8^{24} = 2^{72}$ codewords, i.e., a rate of $(1/T) \log_2 |\mathcal{C}| = 72/4 = 18$ bits per channel use across 3 transmit antennas ( $= 6$ bits/ch.use per antenna, equivalent to $64$ -QAM per layer).

Part (c): Coding-gain advantage

$\gamma_c(\Lambda_{24}) = 4$ ( $6$ dB), so BER shift factor is $\gamma_c^{-n_r} = 4^{-3} = 1/64 = -18$ dB. Structured-Leech-LAST operates $18$ dB below random LAST at the same BER target — an enormous finite-SNR advantage from the Leech lattice's extreme density. $\blacksquare$

ex-ch17-16

Medium

Given that MMSE-GDFE achieves the MIMO mutual information $I(\mathbf{x}; \mathbf{y})$ per Thm. TMMSE-GDFE Preserves Mutual Information, would it be correct to say "MMSE-GDFE achieves the AWGN capacity on each scalar layer"? Explain why or why not.

Show Hint

Each layer individually has a scalar AWGN capacity $\log_2(1 + R_{ii}^2 / \sigma^2)$ .

The sum of per-layer capacities equals the MIMO capacity — this is a conservation statement.

Solution

Yes and no

Yes — each layer achieves its own AWGN capacity with a sufficiently dense lattice code; no — the per-layer rates are not chosen to match the per-layer capacities (they are chosen to match the overall MIMO rate). What MMSE-GDFE actually achieves is aggregate capacity: $\sum_i \log_2(1 + \text{per-layer SNR}_i) = I(\mathbf{x}; \mathbf{y})$ . If the code rate is fixed at $R$ bits/ch.use, the per-layer rate is $R / (n_t T)$ per scalar channel use, which may be below or above the per-layer AWGN capacity depending on the channel realisation.

Implication for DMT

The DMT-achievability of LAST does not rely on per-layer rate-matching; it relies on the aggregate capacity being above the aggregate rate (the non-outage event). On non-outage channels the code succeeds; on outage channels it fails. This is the fundamental trade-off being optimised in Thm. TLAST + MMSE-GDFE Achieves the Zheng-Tse DMT (El Gamal-Caire-Damen 2004). $\blacksquare$

ex-ch17-17

Hard

(Open-ended.) Argue that for $n_t \ge 5$ on a $5 \times 5$ i.i.d. Rayleigh channel, structured LAST codes (if a dense lattice of dimension $2 n_t T = 10$ or $2 n_t T = 20$ exists) will outperform CDA-NVD codes at moderate SNR and moderate rate. Consider decoder complexity and coding gain.

Show Hint

CDA sphere decoding: $O(M^{n_t^2 / 2}) = O(M^{12.5})$ for $n_t = 5$ .

For $n_t T = 10$ : use $K_{10}$ or Barnes-Wall $\Lambda_{16}^{BW}$ (for $n_t T = 8$ ) with $n_t = 4$ .

For $n_t T = 12$ : use $K_{12}$ , a known dense lattice in dimension 12.

MMSE-GDFE complexity: $O((n_t T)^3) = O(5^3) = 125$ for $T = 1$ .

Solution

Complexity comparison

CDA sphere decoder at $n_t = 5$ : $O(M^{12.5}) = 16^{12.5}$ per block — $\approx 10^{15}$ , intractable at $M = 16$ . MMSE-GDFE

structured LAST: $O((n_t T)^3) = 125 \cdot T^3$ . For $T = 2$ (gives $n_t T = 10$ , matching $K_{12}$ after padding), complexity $\approx 1000$ . MMSE-GDFE wins by 12 orders of magnitude.

Coding gain

$K_{12}$ (Coxeter-Todd lattice in dim 12) has $\gamma_c \approx 4/3 \cdot \sqrt[12]{3} \approx 1.5$ — modest. Still, it gives $\sim 1.7$ dB over $\mathbb{Z}^{12}$ . Combined with polynomial decoding, this gives structured LAST the edge at moderate SNR.

Conclusion

For $n_t \ge 5$ , CDA codes are impractical (exponential decoding); structured LAST is practical with some coding-gain penalty. At $n_t = 5$ , structured LAST is the clear winner. This is the scaling-robustness argument for lattice codes over algebraic codes. $\blacksquare$

ex-ch17-18

Medium

State whether the following modifications of the LAST construction preserve DMT-optimality, and explain why. (a) Replace the common random dither $\mathbf{d}$ by $\mathbf{d} = \mathbf{0}$ . (b) Replace the fine lattice $\Lambda_c$ by a half-density sub-lattice. (c) Replace MMSE-GDFE by plain MMSE (no backsubstitution). (d) Replace the inner lattice by $E_8$ in appropriate dimension.

Show Hint

Think about which step of Thm. TLAST + MMSE-GDFE Achieves the Zheng-Tse DMT (El Gamal-Caire-Damen 2004) each modification breaks.

Solution

Part (a): No dither

Breaks DMT. Without the dither, the crypto-lemma (uniform codeword distribution) fails, and the Minkowski-Hlawka averaging argument cannot be applied. The Erez-Zamir step of the proof breaks. Undithered LAST has strictly worse DMT than $d^*(r)$ .

Part (b): Half-density sublattice

Preserves DMT. As long as $\Lambda_c$ has density $\ge 2^{-c n_t T}$ for some $c < \log_2 e$ (Thm. TStructured LAST with Dense Inner Lattice Achieves the DMT (Kumar-Caire 2008)), DMT is preserved. Half-density is still asymptotically adequate; only the finite-SNR coding gain degrades.

Part (c): Plain MMSE

Breaks DMT. Plain MMSE achieves only diversity $n_r - n_t + 1$ (ZF baseline), not the full $(n_t - r)(n_r - r)$ . The backsubstitution is what extracts the transmit diversity. See pitfall ⚠MMSE-GDFE Is Not Plain MMSE — The Feedback Structure Is Essential.

Part (d): $E_8$

Preserves DMT and gains coding gain. Thm. TStructured LAST with Dense Inner Lattice Achieves the DMT (Kumar-Caire 2008) (Kumar-Caire 2008). DMT preserved; $+3$ dB coding gain over $\mathbb{Z}^8$ . $\blacksquare$

ex-ch17-19

Medium

Write the MATLAB/Python pseudocode for the MMSE-GDFE receiver of a LAST code with an explicit $E_8$ inner lattice. Assume: $n_t = 4, T = 2$ (so $2 n_t T = 16$ is doubled $E_8$ , or use two $E_8$ blocks), $\text{SNR} = 15$ dB, channel matrix given. Include: augmented matrix construction, QR decomposition, filter application, dither removal, and layer-by-layer $E_8$ -decoding.

Show Hint

Use numpy.linalg.qr for QR decomposition.

$E_8$ -decoding is a lookup against 240 nearest neighbours or a standard $E_8$ decoder (Leech-style).

Solution

Pseudocode

import numpy as np

def last_decode(y_vec, H, snr_lin, dither, E8_lattice_points):
    # 1. Augment channel
    alpha = 1.0 / snr_lin
    H_tilde = np.kron(np.eye(T), H)        # n_r T x n_t T
    sqrt_alpha_I = np.sqrt(alpha) * np.eye(H_tilde.shape[1])
    H_aug = np.vstack([H_tilde, sqrt_alpha_I])
    # 2. QR
    Q, R = np.linalg.qr(H_aug)
    F = Q[:H_tilde.shape[0], :].conj().T   # upper block
    # 3. Filter and remove dither
    z = F @ y_vec - F @ H_tilde @ dither
    # 4. Layer-by-layer lattice decode
    x_hat = np.zeros_like(z)
    for i in range(len(z) - 1, -1, -1):
        z_i_tilde = (z[i] - R[i, i+1:] @ x_hat[i+1:]) / R[i, i]
        # E8 nearest-neighbour via inner-lattice table
        x_hat[i] = nearest_in_E8_coord(z_i_tilde, E8_lattice_points)
    # 5. Undo modulo shaping
    u_hat = inverse_G @ ((x_hat + dither) % Lambda_s)
    return u_hat

def nearest_in_E8_coord(z_i, E8_points):
    # Return the nearest coordinate from precomputed E8 table
    return E8_points[np.argmin(np.abs(E8_points - z_i))]

Notes

In production one precomputes the 240 nearest-neighbour offsets of $E_8$ for fast per-layer lookup. For Leech, the Conway-Sloane efficient decoder uses a tree-based algorithm that avoids the full $196560$ -point search. The overall structure — augment, QR, filter, backsubstitute — is identical across inner lattices. $\blacksquare$

ex-ch17-20

Hard

Consider the Zheng-Tse DMT $d^*(r) = (n_t - r)(n_r - r)$ on the $(n_t, n_r) = (2, 3)$ asymmetric channel. (a) Plot the DMT curve for $r \in [0, 2]$ . (b) Identify the multiplexing gain $r^*$ at which the diversity is exactly $2$ . (c) What LAST block length $T$ is required to approach $d^*(r^*)$ in a single channel block?

Show Hint

$d^*(r) = (2 - r)(3 - r)$ .

Solve $(2 - r)(3 - r) = 2$ .

Solution

Part (a): Plot

$d^*(0) = 6, d^*(1) = 2, d^*(2) = 0$ . Piecewise linear between integer points.

Part (b): $r^*$ where $d = 2$

$(2 - r)(3 - r) = 2 \implies 6 - 5 r + r^2 = 2 \implies r^2 - 5 r + 4 = 0 \implies r = 1$ or $r = 4$ . Only $r = 1$ is in range $[0, 2]$ . So $r^* = 1$ .

Part (c): Block length

Zheng-Tse requires $T \ge n_t$ for the DMT to be achievable. At $n_t = 2$ , $T \ge 2$ suffices. Standard choice: $T = 2$ ( $n_t T = 4$ , so $E_8$ or $D_4$ inner lattice if complex-to- real). At $T = 2$ , LAST achieves the full DMT $d^* = 2$ at $r = 1$ . $\blacksquare$

ex-ch17-21

Hard

(Conceptual.) The chapter distinguishes between two CommIT contributions — El Gamal-Caire-Damen 2004 (existence of DMT- optimal LAST) and Kumar-Caire 2008 (structured LAST from dense lattices). Discuss whether these two contributions should be viewed as (a) two steps of a single research programme, (b) two independent contributions, or (c) a single contribution distributed over two papers. Use the content of §§1-5 to support your answer.

Show Hint

A research programme involves setting out a long-term goal; two steps of the same programme may span years.

Independent contributions address unrelated questions.

A single contribution artificially split would use different papers for the same result.

Solution

Two steps of a single research programme

The most defensible reading is (a), two steps of a single programme. The 2004 paper established the existence of DMT-optimal LAST codes via a random-lattice argument; its limitation (non-constructive) was explicitly noted in the paper's Section IV. The 2008 paper addressed the constructive question left open in 2004: it replaced the random inner lattice by dense structured lattices and showed that (i) DMT is preserved, (ii) finite-SNR coding gain is gained. This is the natural follow-up.

Evidence from the content

(1) The 2008 paper cites the 2004 paper as its starting point and uses the same DMT framework (Zheng-Tse 2003) and the same decoder (MMSE-GDFE). (2) The authors overlap (G. Caire on both). (3) The 2008 paper proves its theorem by reducing to the 2004 theorem — an internal dependency (Thm. TStructured LAST with Dense Inner Lattice Achieves the DMT (Kumar-Caire 2008), Step 1). (4) The chapter's §1-5 structure is built around the two papers as complementary pieces of the LAST story, with §4 explicitly bridging theory and practice.

Counter-argument

A reading favoring (b) or (c) would note that the two papers have four years between them and use different technical tools (Minkowski-Hlawka vs. Conway-Sloane catalogue). But given Caire's consistent authorship and the explicit forward/ backward references between the papers, (a) is the most accurate summary. $\blacksquare$

ex-ch17-22

Hard

Derive the Wishart-Laplace channel outage exponent for the $(n_t, n_r)$ i.i.d. Rayleigh channel and verify it equals $d^*(r) = (n_t - r)(n_r - r)$ at $r = 0$ (full diversity) and at integer $r$ .

Show Hint

Eigenvalues of $\mathbf{H}^{H} \mathbf{H}$ have the Wishart joint density $\prod_i \lambda_i^{n_r - n_t} \prod_{i<j} (\lambda_i - \lambda_j)^2 e^{-\sum \lambda_i}$ .

Reparametrise via $\alpha_i = -\log_\text{SNR}(\lambda_i)$ ; high-SNR density is $\doteq \text{SNR}^{-\sum_i (n_r - n_t + 2 i - 1) \alpha_i / 2}$ .

Outage constraint: $\sum_i (1 - \alpha_i)^+ < r$ .

Minimise exponent subject to outage.

Solution

Wishart density in $\alpha$-coordinates

At high SNR, the joint density of $\{\alpha_i\}$ (sorted so $\alpha_1 \le \ldots \le \alpha_{n_t}$ ) is $\doteq \text{SNR}^{-\sum_i (n_t + n_r - 2 i + 1) \alpha_i / 2}$ under i.i.d. Rayleigh.

Outage region

Outage: $\sum_i (1 - \alpha_i)^+ < r$ . Feasible region: $\alpha_i \ge 0$ , sorted. For integer $r = k$ , the most probable point in the outage region has $\alpha_i = 0$ for $i \le k$ (no outage from the first $k$ eigenvalues) and $\alpha_i > 0$ for $i > k$ .

Minimisation at $r = k$

Minimise $\sum_{i > k} (n_t + n_r - 2 i + 1) \alpha_i / 2$ subject to $\alpha_i \ge 0$ . The linear minimum is $\alpha_i = 1$ for $i > k$ , giving exponent $\sum_{i > k} (n_t + n_r - 2 i + 1) / 2 = (n_t - k)(n_r - k)$ (after summation algebra).

Verify at $r = 0$

At $r = 0$ : $(n_t - 0)(n_r - 0) = n_t n_r$ — the full-diversity order, correct. At $r = 1$ : $(n_t - 1)(n_r - 1)$ — consistent with the $(2, 2)$ case's $d^*(1) = 1$ and $(3, 3)$ 's $d^*(1) = 4$ . At $r = \min(n_t, n_r)$ : $0$ — the full-multiplexing degenerate case. All consistent with Zheng-Tse. $\blacksquare$

ex-ch17-23

Medium

Explain in your own words why the MMSE-GDFE is the lattice analog of MMSE-SIC, but not a literal "lattice SIC."

Show Hint

MMSE-SIC: iterative. Receive, demodulate strongest stream, subtract, recurse.

MMSE-GDFE: non-iterative. Augment, QR, filter, backsubstitute.

The difference is in how the triangularisation is done.

Solution

The analogy

Both MMSE-SIC and MMSE-GDFE triangularise the MIMO channel: they convert the $n_t \to n_r$ channel into $n_t$ scalar channels, decoded sequentially, with earlier decisions feeding back into later ones. In both cases the aggregate rate equals the full MIMO mutual information (no capacity lost). In both cases the order of decoding matters (strongest first for MMSE-SIC; the QR structure determines order for MMSE-GDFE).

The difference

MMSE-SIC is iterative: decode stream 1, make a hard decision, subtract its reconstructed contribution from the channel output, redo the MMSE filter on the residual, decode stream 2, and so on. MMSE-GDFE is non-iterative: form the augmented matrix, do one QR decomposition, filter through it once, and then backsubstitute through the triangular system. The feedback is implicit in the triangularisation rather than explicit in an iterative loop.

Why the non-iterative form for lattices?

For Gaussian random codes, iterative MMSE-SIC makes sense because each stream is independently coded — soft information flows cleanly between the demodulator and the outer-code decoder. For lattice codes, the codewords span multiple layers simultaneously (they are structured jointly on the lattice), so iterative per-stream decoding with hard decisions would lose the lattice structure. The non-iterative QR-backsubstitution form of MMSE-GDFE preserves the lattice structure throughout. $\blacksquare$

ex-ch17-24

Challenge

Open problem (circa 2026, based on the current state of the literature): for very large MIMO ( $n_t = n_r = 64$ , such as 5G massive MIMO), is structured LAST competitive with codebook-based precoding? Sketch the relevant trade-offs and state what would need to be shown for a structured-LAST-based receiver to be adopted in a 6G standard.

Show Hint

Massive MIMO: $n_t = n_r = 64$ , $T = 1-4$ . Dimension $2 n_t T \in [128, 512]$ .

Dense lattices at these dimensions: Barnes-Wall BW_64, BW_128, ...

Complexity: $O((n_t T)^3) = 64^3 \approx 3 \cdot 10^5$ for $T = 1$ .

Codebook precoding: discrete set, low receiver complexity.

Solution

Trade-off for massive MIMO

For $n_t = n_r = 64, T = 1$ : $n_t T = 64$ , Barnes-Wall BW_128 has dim $128 = 2 n_t T$ (after complex-to-real), $\gamma_c \approx 2.9$ — strong coding gain. MMSE-GDFE complexity $O(64^3) = 2.6 \times 10^5$ flops per block. Single-block latency $\sim 10 \mu s$ at 100 GFLOPS. Feasible.

For $n_t T = 128, 256$ (to get $E_{8n}$ compatibility): $n_t T$ cubed is $2 \times 10^6$ or $1.6 \times 10^7$ — pushing against real-time limits.

What would need to be shown

(1) Finite-SNR performance: structured-BW-LAST must beat Type-II codebook precoding at typical 5G operating points (SNR 10-30 dB, rate $\sim 10$ bits/ch.use/user). (2) Real-time implementation: the MMSE-GDFE at $n_t = 64$ must fit in the receiver latency budget ( $\sim 1$ ms) across realistic user configurations. (3) Robustness: the code must be robust to CSI estimation errors, which are significant in massive-MIMO regimes. (4) Standards inertia: 3GPP has significant investment in the codebook paradigm; switching would require a compelling gain ( $\ge 3$ dB at standard operating points).

None of these have been demonstrated at the scale required for a 6G standard. Research in this direction is active but has not crossed the threshold for standardisation adoption. This is an open problem with significant practical implications. $\blacksquare$

Exercises

ex-ch17-01

Cardinality

Rate

ex-ch17-02

Role of the dither

Without dither

ex-ch17-03

MMSE coefficient

Augmented matrix

ex-ch17-04

Setup

Compute $\mathbf{Q}_2^H \mathbf{Q}_2$

Recover $\mathbf{F}^H \mathbf{F}$

ex-ch17-05

Three ingredients

Cross-references

ex-ch17-06

Tabulation

Diversity and slope at $r = 1.5$

ex-ch17-07

Per-layer effective SNR

Aggregate SNR

ex-ch17-08

Coding gain

BER shift

ex-ch17-09

Statement

Proof

ex-ch17-10

Equation

Interpretation

ex-ch17-11

Part (a): Multiplexing gain

Part (b): DMT exponent

Part (c): $E_8$ compatibility

Part (d): Coding-gain advantage

ex-ch17-12

Per-layer SNR (formal)

Ensemble average

Aggregate log

ex-ch17-13

ZF equivalent channel

Distribution of the diagonal

DMT conclusion

ex-ch17-14

Per-layer capacity

Aggregate

Conclusion

ex-ch17-15

Part (a): Dimension check

Part (b): Cardinality

Part (c): Coding-gain advantage

ex-ch17-16

Yes and no

Implication for DMT

ex-ch17-17

Complexity comparison

Coding gain

Conclusion

ex-ch17-18

Part (a): No dither

Part (b): Half-density sublattice

Part (c): Plain MMSE

Part (d): $E_8$

ex-ch17-19

Pseudocode

Notes

ex-ch17-20

Part (a): Plot

Part (b): $r^*$ where $d = 2$

Part (c): Block length

ex-ch17-21

Two steps of a single research programme

Evidence from the content

Counter-argument

ex-ch17-22

Wishart density in $\alpha$-coordinates

Outage region

Minimisation at $r = k$