Exercises
ex-ch16-01
EasyVerify that the standard 64-QAM constellation (after a unit-mean shift into an odd-integer lattice) can be expressed as a nested lattice code . Identify and , compute the codebook size, and give the rate in bits per real dimension.
Use with an appropriate coset offset.
The constellation fits in , the Voronoi region of .
Fine lattice
(odd-integer coset absorbed into the dither). .
Coarse lattice and codebook size
with . . Matches 64-QAM.
Rate
bits per real dimension (i.e., bits per complex symbol).
ex-ch16-02
EasyShow that the modulo-lattice reduction is a group homomorphism: for any and any lattice ,
Use the definition .
Show that (idempotence).
Setup
Let with , and similarly with .
Sum modulo
. Also , so .
Conclude
Since , translating a point by a lattice vector and then quantising to the nearest lattice point shifts the quantiser output by the same lattice vector: . Therefore , which equals .
ex-ch16-03
EasyDerive the MMSE coefficient by minimising where with , , and independent. Verify that the residual error variance is .
Take the derivative of and set to zero.
Use and .
For the residual, compute at the optimal .
Derivative
.
Stationary point
.
Residual variance
Plugging back: . Simplifying: .
ex-ch16-04
EasyFor the AWGN channel at SNR dB (), compute: (a) the MMSE coefficient ; (b) the Shannon capacity in bits per real dimension; (c) the per-dimension rate achievable with uncoded 16-QAM at bit error probability , and the gap to capacity.
.
Use the Q-function approximation for 16-QAM: .
MMSE coefficient
.
Shannon capacity
bits per real dimension.
Uncoded 16-QAM rate
Uncoded 16-QAM has bits per real dimension (fixed by the constellation). The SNR required for is dB, not . At dB, for 16-QAM is , so well below target. Gap to Shannon at : dB for the Shannon of dB, so uncoded 16-QAM at operates dB above the Shannon limit. Erez–Zamir mod- closes this dB gap.
ex-ch16-05
MediumProve the crypto-lemma rigorously: if is uniform on and independent of , then is uniform on and independent of .
Condition on and compute the conditional density.
Use the fact that shifting a uniform distribution on by gives a uniform distribution on the translated Voronoi region .
Show that mod- reduction of a uniform distribution on any fundamental region yields a uniform distribution on .
Conditional density
Condition on . The conditional density of is the uniform density on : for and elsewhere.
Translated fundamental region
The set is a translated Voronoi region, hence a fundamental region of (any translate of a fundamental region is a fundamental region). It tiles under -translation.
Modulo-reduction is bijective
The map is a bijection from any fundamental region to that preserves Lebesgue measure. So the conditional density of is on , i.e., uniform.
Independence
The conditional distribution is the same uniform distribution for every , so is independent of .
ex-ch16-06
MediumFor the Erez–Zamir scheme, verify the effective-noise variance calculation: , where , uniform on with per-dim second moment , and independent of .
Use independence of and to split the variance.
Second moment of the dither per dimension is by construction.
simplifies nicely using .
Expand
.
Substitute $\alpha$
, so . . Sum: .
ex-ch16-07
MediumShow that the normalised second moment of is for every , and that the shaping gain dB — confirming that cubic shaping (standard QAM) is the baseline.
Voronoi region of is the cube .
Per-dimension second moment of uniform on is .
Per-dimension second moment
Per-dimension second moment of uniform on is .
Normalised second moment
, so for every .
Shaping gain
, i.e., dB. The cube is the reference.
ex-ch16-08
MediumCompute the asymptotic shaping gain ceiling in dB, and compare it to the difference in bits per real dim between Shannon's capacity and Poltyrev's capacity at dB. Explain the connection.
.
Poltyrev capacity at : .
Asymptotic ceiling in dB
, so dB. Wait — this is closer to , not the often- quoted dB. The figure is the natural-log equivalent — hmm.
Let me redo: the standard dB is the factor expressed as where ... actually , , dB.
The key: , so and dB. This is the shaping gain ceiling.
Shannon vs Poltyrev at 20 dB
Shannon: bits/real dim. Poltyrev: bits/real dim. Gap: bits/real dim.
At dB, bits/real dim = dB of SNR gap at the same rate. This is the shaping gap — the exact dB by which a cubic-shaped (Poltyrev-optimal) lattice falls short of Shannon.
Connection
The gap between Shannon and Poltyrev capacities is exactly dB for any SNR (take the limit: Shannon Poltyrev bits as , which is dB). The Erez–Zamir scheme closes this dB by using a shaping lattice with — reducing the normalised second moment from the cube's to the ball's limit.
ex-ch16-09
MediumFor the dirty-paper channel , show that a naïve strategy (ignoring , decoding as if interference is noise) achieves rate , and compute the gap to the DPC capacity at , , .
Naïve strategy treats as Gaussian noise with combined variance.
.
Naïve rate
bits/real dim.
DPC rate
bits/real dim.
Gap
Rate gap: bits/real dim ≈ loss. The DPC advantage is massive in this regime. As , the naïve rate tends to while DPC remains at the clean-channel . The non-causal knowledge of at the transmitter is worth arbitrarily many bits when interference dominates.
ex-ch16-10
MediumRun the Schnorr–Euchner sphere decoder on with generator matrix and received vector . Identify the closest point and the number of sphere-decoder operations compared to brute force over a box.
QR-decompose first.
The lattice is the rotation of by , so after QR the search reduces to scaled .
QR decomposition
. Since 's columns have equal norm , is orthogonal and is upper-triangular. A direct computation gives (already orthogonal here after normalisation), .
Sphere decoder
Apply QR-rotated received: (after normalisation).
Babai: , . Then , . Total: . Closest point: . Verify: , norm . Hmm — might not be optimal. Try : distance — smaller!
Revise: at distance is closer than at . The correct closest point is .
Operation count
Sphere decoder: norm evaluations + QR overhead = operations. Brute force: candidates to check, each requiring a norm evaluation — operations. The sphere decoder is faster even at this tiny dimension; the advantage grows exponentially with .
ex-ch16-11
HardProve the "MMSE inflation lemma" from step 3 of the Erez–Zamir proof: show that the decoding error probability of a lattice on the effective channel with is upper-bounded (for large ) by the error probability on a pure Gaussian channel of the same variance .
Use the fact that the dither component has bounded support on , while the Gaussian component is unbounded.
Apply a large-deviation argument: the Gaussian tails dominate at high SNR.
For full rigour, use Erez–Zamir Lemma 6 (cumulant-generating-function domination).
Decompose $\mathbf{z}$
where has bounded norm ( where is the covering radius of ), and is Gaussian with per-dim variance .
Large-deviation argument
For the error event to occur, must leave the Voronoi region of . The Gaussian component alone has the tightest tails (Gaussian > bounded for a given variance), so the error probability is dominated by the Gaussian part: uniformly over .
Variance comparison
Erez–Zamir Lemma 6: the cumulant-generating function of is dominated (componentwise) by that of a Gaussian of per-dim variance . Hence The right-hand side vanishes exponentially for any rate below the Poltyrev capacity at effective noise variance .
ex-ch16-12
HardConsider a two-user Gaussian broadcast channel () with , , . The transmitter has a single antenna; User 1 is strong, User 2 is weak. Show how the DPC result of this chapter yields the capacity region of this two-user BC via sequential DPC (Cover–van der Meulen 1975; Weingarten–Steinberg–Shamai 2006), and compute the sum-capacity.
Split power: .
Sequential encoding: encode user 1's signal, then encode user 2's signal with user 1's signal known to the transmitter as interference.
Apply DPC to user 2's encoding: user 1's signal does not count as interference.
Setup
Two-user degraded Gaussian BC. User 1 (strong) sees ; User 2 (weak) sees .
Sequential DPC: encode user 2 first
Allocate power to user 2's signal . Rate: — the user 2 signal treats user 1's signal plus noise as interference.
Encode user 1 with DPC
User 1's signal is encoded via DPC against the (already-encoded) . By the DPC theorem, — no interference penalty from .
Sum-rate optimisation
Sum-rate: . Maximise over . The two-user BC sum-capacity is bits/real dim — achieved by (all power to the stronger user).
More interestingly, for general power allocations, the achievable rate region is , for . This is the full BC capacity region (Cover–van der Meulen).
Interpretation
The DPC allows the base station to support weak users without harming strong users — the weak-user interference is pre-cancelled via DPC for the strong user. This is exactly the principle behind MU-MIMO vector precoding (Weingarten–Steinberg–Shamai 2006).
ex-ch16-13
HardShow that in the Schnorr–Euchner zigzag order, at each layer the enumeration is optimal in the sense of minimum-cost branch-and-bound: for the same total search time, no other enumeration order prunes more of the tree.
At each layer, the partial-sum cost is increasing in where is the Babai estimate.
Zigzag ordering visits candidates in order of increasing cost: .
Any early termination via pruning is pessimised when the optimal candidate is reached first.
Partial-sum cost at a layer
At layer , the partial-sum cost is — increasing in .
Zigzag explores in cost order
The zigzag visits candidates in strictly increasing order of , hence in increasing order of layer cost.
Prune optimality
Branch-and-bound prunes when the partial sum exceeds the current best. Since costs are visited in increasing order, once a cost exceeds the best, all subsequent costs also do (they are monotonically larger). Hence the zigzag termination is the earliest possible: any other order visits some candidates out of cost order, wasting evaluations on costs larger than the eventual prune threshold.
Caveat
The optimality here is layer-local; global tree pruning depends on the initial sphere radius and on interactions across layers. Schnorr–Euchner's order is locally optimal but not globally optimal (exact global optimality would require an oracle for the optimal codeword's cost). In practice, initializing via the Babai estimate gets within a small constant of the optimal pruning.
ex-ch16-14
HardFor a rate bits/real dim nested lattice code with (Leech, shaping gain dB) and chosen to be Poltyrev-optimal in dimension (coding gain dB), compute the SNR gap to Shannon capacity and compare to the asymptotic dB + coding residual.
Shannon limit at is dB.
Erez–Zamir with shaping loses dB.
The Poltyrev-optimal in dimension achieves the full dB coding-gain target at the Shannon-minus-shaping-gap SNR.
Shannon limit
At : dB.
Shaping-gap loss
With shaping (gain dB vs cube's dB), the shaping gap is dB. An ideal Erez–Zamir scheme with shaping achieves at dB — dB above Shannon, due to the finite-dim shaping limit.
Coding-gap loss
A Poltyrev-optimal in dimension achieves the Poltyrev capacity, meaning it has zero coding-gap asymptotically. In practice, a dimension- lattice can be within dB of Poltyrev capacity. Total SNR gap: dB vs Shannon.
Comparison to the asymptotic $\pi e/6$
In the limit , both coding and shaping gaps close to zero; in finite dimension , roughly half of the dB shaping gap is recovered, and most of the coding gap is recovered, giving a total of dB from Shannon. This matches the published numerics for LDPC-lattice constructions in dimension – (Sommer–Feder–Shalvi 2008).
ex-ch16-15
MediumThe Erez–Zamir scheme uses a dither uniform on . Explain in operational terms what happens if the dither is wrong by (e.g., due to a synchronisation error between transmitter and receiver), and quantify the rate loss.
If the transmitter uses but the receiver uses , the decoder sees an effective dither error.
The mod- at the decoder maps the error into .
For small , the effective noise variance increases by per dimension.
Dither error propagation
Suppose the transmitter uses dither and the receiver uses . The decoder's mod reduction is instead of the correct .
Effective noise with dither error
The effective noise , where is the original effective noise. Assuming independent of : .
Rate loss
Rate loss bits/real dim. For small errors (typical synchronisation), this is negligible. For catastrophic errors ( on the scale of the constellation), the scheme breaks down entirely — error probability .
Practical implication
Dither synchronisation is a real concern in practice. It is typically handled via a shared seed (as in 3GPP scrambling codes) or via a pilot preamble that the receiver uses to lock onto the pseudo-random dither sequence. A dB synchronisation budget is typical in LAST-code implementations (Ch. 17).
ex-ch16-16
MediumCompare the information-theoretic gains of three modulation strategies at bits/real dim: (i) uncoded 256-QAM; (ii) Erez–Zamir with cubic shaping (no shaping gain); (iii) Erez–Zamir with shaping ( dB shaping gain). Use the Shannon limit as the reference.
Shannon at : dB.
Uncoded 256-QAM at BER : need dB.
Cubic-shaping Erez–Zamir: dB above Shannon (loses the shaping gain).
-shaping Erez–Zamir: dB above Shannon.
Shannon limit
: dB.
Uncoded 256-QAM
At BER : dB. Gap: dB. But: this is with no coding. With a rate-1 FEC code (no error correction), uncoded 256-QAM is bounded by its own constellation; to reach the Shannon limit we'd need infinite modulation order or perfect coding.
Erez–Zamir cubic
Cubic shaping: , dB. Erez–Zamir with perfect : dB dB. Gap to Shannon: dB (the unresolved shaping gap).
Erez–Zamir $E_8$-shaping
shaping: dB. Erez–Zamir with perfect : dB. Gap to Shannon: dB.
Summary
Moving from uncoded 256-QAM to cubic-shaping Erez–Zamir saves ~ dB; moving to -shaping saves another dB, for total ~ dB improvement. The remaining dB requires Ch. 15's Leech-lattice shaping and asymptotic-dimension arguments — the Rogers/Poltyrev limit.
ex-ch16-17
HardProve that a lattice decoder (argmin over ) applied to the Erez–Zamir mod- channel achieves the SAME rate as a message-set decoder (argmin over the nested code ) for any rate strictly below the Erez–Zamir capacity.
The lattice decoder returns ; the message-set decoder returns .
After mod-, the received signal already lies in ; closest point lies in this Voronoi cell.
Show that at rates below Erez–Zamir capacity, the lattice decoder's output is (w.h.p.) in .
Lattice decoder output
The lattice decoder finds with . The closest point in to a point in is typically in itself — i.e., in .
When does it leave $\mathcal{V}(\Lambda_s)$?
Only when the effective noise exceeds the covering radius , in which case the closest point may lie outside . Probability of this: .
At rates below capacity
At any rate , vanishes exponentially in (by the Erez–Zamir MMSE inflation lemma). Hence lattice decoder and message-set decoder agree with high probability; their rates are the same.
Near capacity
At the capacity, the two decoders diverge in their asymptotic error probability — the lattice decoder's error probability does not quite match the message-set decoder's. The gap, however, is sub-exponentially small. Erez–Litsyn–Zamir (2005) showed the lattice decoder achieves the capacity of the unconstrained AWGN (Poltyrev) channel, which coincides with the message-set capacity of the mod- channel.
ex-ch16-18
ChallengeExtend the Erez–Zamir theorem to the parallel AWGN channels setting: sub-channels with per-sub-channel noise variances , , and a total-power constraint . Show that a nested lattice code with appropriately-scaled per-sub- channel dithers achieves the water-filling capacity.
Water-filling: for Lagrange .
Per-sub-channel MMSE coefficients .
Use a separate nested-lattice pair per sub-channel, or a joint -dimensional lattice with scaled Voronoi.
Water-filling solution
The capacity-achieving power allocation is where is chosen to meet the total power constraint . The capacity is .
Per-sub-channel Erez–Zamir
Apply the Erez–Zamir scheme to each sub-channel independently: nested lattices with second moment , MMSE coefficient , dither uniform on .
Achievable rate
Each sub-channel achieves bits/real dim (Erez–Zamir). Summing: total rate = water-filling capacity.
Alternative: joint lattice
A single -dimensional lattice with Voronoi region anisotropically scaled (second-moment profile matching the water-filling profile) also works. In practice, the per-sub-channel scheme is simpler; the joint scheme can give marginal shaping-gain improvements. This construction is the lattice-code version of OFDM's adaptive bit-loading — see Ch. 9 of this book for BICM-OFDM, or Ch. 17 of the MIMO book for the MIMO- parallel-stream equivalent.
ex-ch16-19
ChallengeShow that the computational cost of the Erez–Zamir scheme per encoded symbol is polynomial in , specifically: encoding plus mod-; decoding plus mod-, assuming a fast quantiser for (e.g., ) and a sphere decoder for .
Encoding: map to (lookup, ), subtract dither and reduce mod-.
Decoding: QR + Schnorr–Euchner on .
For , the quantiser is per application (Forney 1989).
Encoder complexity
Step 1: : lookup via the enumeration (typically via a finite-state machine). Step 2: : requires . For or , the quantiser is (Forney 1989). Total encoder: .
Decoder complexity
Step 1: : . Step 2: : via quantiser. Step 3: : This is the CLP problem, dominated by QR () + Schnorr- Euchner zigzag ( at high SNR on average). Total decoder: .
LLL pre-processing
For poorly-conditioned , pre-process once with LLL: one-time cost. Per-symbol amortised cost: unchanged .
Comparison
Random Gaussian codebook decoding: per symbol — exponential. Erez–Zamir: per symbol — polynomial. The lattice-code formulation provides an exponential speedup, and this is the practical reason the entire theory exists.
ex-ch16-20
HardDescribe how the Erez–Zamir MMSE scalar appears in the Tomlinson–Harashima precoding (THP) for known ISI. Specifically: given an FIR channel with , , , derive the THP encoder and the effective noise variance after decoding.
Scalar version of lattice DPC: for integer .
THP: .
After scalar mod, effective noise is (as in Erez–Zamir).
THP encoder
. The past tap is the known ISI to be pre-cancelled. Mod- confines the transmit power.
Channel output
for some integer (absorbing the mod). After the receiver's own mod-: — the ISI is gone.
With MMSE $\alpha$
The full lattice DPC adds MMSE scaling: . Mod and reduce: effective noise variance per channel use (same as Erez–Zamir).
Capacity
Lattice DPC capacity: bits/real use. THP with scalar shaping: bits/real use (loses the dB shaping gain because is cubic). Non-cubic trellis shaping in V.34 recovers most of the bit/real dim.
ex-ch16-21
HardShow that the MU-MIMO broadcast channel capacity (with transmit antennas and single-antenna receivers, Gaussian channel) is achieved by sequential DPC applied in a user-encoding order that maximises the weighted sum-rate, where each user encodes against previously-encoded users as known interference.
Weingarten–Steinberg–Shamai (2006): sequential DPC achieves BC sum-capacity.
Each user's effective channel has interference from earlier users.
For each user, DPC pre-cancels this interference — no rate loss.
Setup
MIMO BC: for user , with the transmit vector and the user- channel.
Sequential DPC
Fix a permutation of . Encode users in order . When encoding user , the signals of users are known at the base station; they count as known interference. User 's encoder pre-cancels this interference via DPC.
Achievable rate for user $\pi(k)$
where is the user- transmit- covariance matrix.
The interference from users has been cancelled via DPC, so does not appear.
Sum-capacity
Optimising over transmit covariances and the encoding order : the Weingarten–Steinberg–Shamai 2006 theorem establishes this as the BC sum-capacity region. For symmetric-user cases, the encoding order does not matter; for asymmetric, different orders achieve different corner points of the capacity region.
Practical realisation
Vector perturbation (Hochwald–Peel–Swindlehurst 2005): each user's DPC is approximated via a lattice perturbation chosen to minimise the total transmit power. The perturbations live in and are found by a sphere decoder — the CLP algorithm of s05. This is fielded in 5G NR FR1 base stations (conceptually; the exact implementation is vendor-specific).
ex-ch16-22
ChallengeProve that the Erez–Zamir scheme is robust to SNR mismatch: if the receiver uses instead of the true , with mis-estimated by up to dB, the rate loss is at most dB.
With , the effective noise variance becomes .
At this is minimised at ; for it is slightly larger.
Quantify the degradation as a Taylor expansion around .
Effective noise with mismatch
Let for small . Effective noise variance: .
Taylor expand
. At , (this is the minimum). at any .
So .
Rate loss
Rate at mismatched : , shortfall from optimal: .
Using and : rate loss .
For ( dB SNR error = error of about at moderate SNR): rate loss bits/real dim, i.e., dB SNR gap. Rounded: dB.
Conclusion
The Erez–Zamir scheme is quadratically insensitive to mismatch — a error in (corresponding to a few dB of SNR estimation error) costs dB of rate loss. This is a key practical property: systems can use a stale or coarse SNR estimate and still reap most of the capacity benefit.