IR-LAST Codes Achieving the ARQ-DMT

From Existence to Construction: Explicit Codes for the ARQ-DMT

Section 2 proved that the ARQ-DMT dARQ(r,L)=Lβ‹…dβˆ—(r/L)d_\mathrm{ARQ}(r, L) = L \cdot d^{*}(r/L) is achievable β€” by a random Gaussian codebook and a joint ML decoder. But a random code is not a code one can deploy in a standard. To ship an ARQ-DMT-optimal scheme in LTE or NR, one needs an explicit construction: a deterministic code with a computable encoder, a computable (lattice-)decoder, and a structured puncturing pattern that supports incremental redundancy.

The question is the same one that drove Elia-Kumar-Pawar-Kumar-Lu- Caire 2006 (Ch. 13) and El Gamal-Caire-Damen 2004 (Ch. 17) in the static DMT setting: we need a code with the nonvanishing- determinant property, so that its codeword-difference matrix avoids near-singularity with probability one β€” the algebraic fingerprint of DMT optimality. The additional requirement for the ARQ setting is that the code must support puncturing: there must be a family of nested sub-codes C(1)βŠ‚C(2)βŠ‚β‹―βŠ‚C(L)C^{(1)} \subset C^{(2)} \subset \cdots \subset C^{(L)}, each a valid space-time code of shrinking rate, such that round β„“\ell's transmission Xβ„“\mathbf{X}_\ell consists of the symbols in C(β„“)βˆ–C(β„“βˆ’1)C^{(\ell)} \setminus C^{(\ell-1)}.

El Gamal, Caire, and Damen (2006) constructed exactly such a family: incremental-redundancy lattice space-time (IR-LAST) codes, obtained from nested lattice codes over a CDA-structured alphabet, with common random dithering across rounds. They proved that IR-LAST achieves the ARQ-DMT at every (r,L)(r, L). This was the first explicit DMT-optimal ARQ construction in the literature β€” and it remains the canonical reference construction twenty years later.

This section develops IR-LAST in three stages. First, we define nested lattice codes and the puncturing pattern (Β§3.1). Second, we state the IR-LAST DMT theorem (Β§3.2). Third, we illustrate the transmission timeline with an animated walk-through and a throughput simulation (Β§3.3).

,

Definition:

Nested Lattice Codes

A nested lattice code is a pair of lattices Ξ›fβŠƒΞ›c\Lambda_f \supset \Lambda_c in CntN\mathbb{C}^{n_t N}, where Ξ›f\Lambda_f is the fine lattice (the codebook) and Ξ›c\Lambda_c is the coarse lattice (the shaping region). The codebook is the set of cosets Cβ€…β€Š=β€…β€ŠΞ›f∩V(Ξ›c)β€…β€Š=β€…β€Š{Ξ»+Ξ›cβ€…β€Š:β€…β€ŠΞ»βˆˆΞ›f},C \;=\; \Lambda_f \cap \mathcal{V}(\Lambda_c) \;=\; \{\lambda + \Lambda_c \;:\; \lambda \in \Lambda_f\}, where V(Ξ›c)={x:βˆ₯xβˆ₯≀βˆ₯xβˆ’Ξ»βˆ₯β€‰βˆ€Ξ»βˆˆΞ›c}\mathcal{V}(\Lambda_c) = \{\mathbf{x} : \|\mathbf{x}\| \le \|\mathbf{x} - \lambda\|\, \forall \lambda \in \Lambda_c\} is the Voronoi region of Ξ›c\Lambda_c.

The rate of the nested code is R=log⁑2βˆ£Ξ›f/Ξ›c∣=log⁑2(det⁑Λc/det⁑Λf)R = \log_2 |\Lambda_f / \Lambda_c| = \log_2 (\det \Lambda_c / \det \Lambda_f) bits per codeword. For a LAST code with a CDA-structured generator matrix (as in Elia-Kumar-Pawar-Kumar-Lu-Caire 2006), the rate-rr code has nonvanishing determinant: inf⁑Xβ‰ X^det⁑(ΔΔH)β‰₯c(r)>0\inf_{\mathbf{X} \ne \hat{\mathbf{X}}} \det(\boldsymbol{\Delta}\boldsymbol{\Delta}^H) \ge c(r) > 0, where c(r)c(r) is an algebraic constant that does not decay with SNR. This nonvanishing-determinant property is the key to DMT optimality.

For the ARQ setting, we need a tower of nested lattices Ξ›f,1βŠƒΞ›f,2βŠƒβ‹―βŠƒΞ›f,LβŠƒΞ›c\Lambda_{f,1} \supset \Lambda_{f,2} \supset \cdots \supset \Lambda_ {f,L} \supset \Lambda_c, each pair DMT-optimal at its rate, with the decoding of round β„“\ell being equivalent to decoding in the coset Ξ›f,β„“/Ξ›c\Lambda_{f,\ell} / \Lambda_c of effective rate Rβ„“=log⁑2βˆ£Ξ›f,β„“/Ξ›c∣R_\ell = \log_2 |\Lambda_{f,\ell} / \Lambda_c|.

Two aspects of nested lattice codes matter for ARQ. First, the nesting gives a natural incremental-redundancy structure: each round unveils one more layer of the lattice hierarchy, and the decoder's search space at round β„“\ell is the coset Ξ›f,β„“/Ξ›c\Lambda_{f,\ell} / \Lambda_c. Second, the lattice structure admits efficient decoding via sphere decoding or MMSE-generalised-decision- feedback equalisation (MMSE-GDFE) β€” much cheaper than the Gaussian-random-code ML decoder assumed in the achievability proof of Β§2. Forward reference to Ch. 17 (LAST codes) for the lattice decoder side.

,

Definition:

Incremental-Redundancy Lattice Space-Time (IR-LAST) Codes

An IR-LAST code of mother-code rate RR per round and maximum round budget LL is a nested lattice code CβŠ‚CLntNC \subset \mathbb{C}^{L n_t N} constructed as follows.

  1. Pick a CDA (cyclic division algebra) A\mathcal{A} over a number field FF of degree ntn_t, giving an ntΓ—ntn_t \times n_t generator matrix GA\mathbf{G}_\mathcal{A} (Elia et al. 2006).
  2. Build the mother lattice Ξ›fβŠ‚CLntN\Lambda_f \subset \mathbb{C}^{L n_t N} by tensoring GA\mathbf{G}_\mathcal{A} with a scaled integer lattice Ξ²ZLN\beta \mathbb{Z}^{L N} of appropriate scale Ξ²\beta determined by the rate.
  3. Build the shaping lattice Ξ›c\Lambda_c as a scaled version of Ξ›f\Lambda_f so that Ξ›f/Ξ›c\Lambda_f / \Lambda_c has cardinality 2LRN2^{L R N} (total information bits =LRN= L R N).
  4. Partition X∈C\mathbf{X} \in C into LL fragments X1,…,XL∈CntΓ—N\mathbf{X}_1, \ldots, \mathbf{X}_L \in \mathbb{C}^{n_t \times N} along the temporal axis. Round β„“\ell transmits Xβ„“\mathbf{X}_\ell.
  5. Apply common random dithering: a dither vector d∼Unif(V(Ξ›c))\mathbf{d} \sim \text{Unif}(\mathcal{V}(\Lambda_c)) is shared with the receiver and added to the codeword before transmission, so that each transmitted fragment Xβ„“\mathbf{X}_\ell is effectively a uniformly-distributed element of the Voronoi region (matching the Gaussian-random-code statistics to leading order).
  6. Decoding at round β„“\ell is MMSE-GDFE lattice decoding on the accumulated observations (Y1,…,Yβ„“)(\mathbf{Y}_1, \ldots, \mathbf{Y}_\ell), operating over the coset Ξ›f,β„“/Ξ›c\Lambda_{f,\ell} / \Lambda_c induced by puncturing.

The construction delivers a code with (i) a CDA-structured nonvanishing-determinant property at every puncturing level, (ii) a decoder that is polynomial-time in the MIMO dimension ntn_t at fixed rate, and (iii) a natural incremental-redundancy schedule that lines up with the HARQ mechanism of LTE/NR.

The key novelty over static LAST codes is the common random dithering across rounds: a single dither vector d\mathbf{d} is shared across all LL rounds (not redrawn each round). This preserves the lattice structure required for the nonvanishing- determinant property while emulating the statistics of the Gaussian-random-code ensemble of Β§2. Without the common dither, the IR structure breaks the nonvanishing-determinant property and the code fails to achieve the ARQ-DMT at non-integer r/Lr/L.

Theorem: IR-LAST Codes Achieve the ARQ-DMT

Consider an ntΓ—nrn_t \times n_r i.i.d. Rayleigh block-fading MIMO channel with per-round block length Nβ‰₯ntN \ge n_t and at most LL ARQ rounds. An IR-LAST code of mother-code rate R(SNR)=rlog⁑2SNRR(\text{SNR}) = r \log_2 \text{SNR}, constructed from a CDA of degree ntn_t and operated with MMSE-GDFE lattice decoding and common random dithering, achieves the ARQ-DMT curve dIR-LAST(r,L)β€…β€Š=β€…β€ŠLβ‹…dβˆ—(r/L)β€…β€Š=β€…β€ŠdARQ(r,L).d_\mathrm{IR\text{-}LAST}(r, L) \;=\; L \cdot d^{*}(r/L) \;=\; d_\mathrm{ARQ}(r, L). for every (r,L)(r, L) with r∈[0,Lmin⁑(nt,nr)]r \in [0, L \min(n_t, n_r)] and Lβ‰₯1L \ge 1.

The argument stacks three existing pieces. First, the CDA construction (Elia et al. 2006, Ch. 13) gives a nonvanishing determinant at every puncturing level β€” so each of the LL incremental-redundancy layers individually achieves the static DMT at its effective rate. Second, the MMSE-GDFE lattice decoder (El Gamal-Caire-Damen 2004, Ch. 17) achieves the same exponent as the Gaussian-random-code ML decoder in Β§2. Third, the common random dither matches the code's statistics to the Gaussian ensemble, closing the gap between the structured-code exponent and the random-code exponent.

Putting the three pieces together: the LL rounds of IR-LAST are equivalent (in DMT exponent) to LL rounds of Gaussian random coding with joint ML decoding, which is the achievability construction of Thm. TARQ-DMT (El Gamal-Caire-Damen 2006). So IR-LAST achieves the ARQ-DMT.

,
πŸŽ“CommIT Contribution(2006)

IR-LAST Codes Achieving the ARQ Diversity-Multiplexing Tradeoff

H. El Gamal, G. Caire, M. O. Damen β€” IEEE Trans. Inf. Theory, vol. 52, no. 8, pp. 3601–3621

The 2006 El Gamal-Caire-Damen paper is a landmark contribution on two fronts.

(1) A new dimension for DMT. The paper generalises the Zheng-Tse DMT of Chapter 12 to the MIMO ARQ channel, introducing a third axis β€” the delay budget LL β€” into the diversity-multiplexing tradeoff. Before this paper, ARQ was studied as a throughput-smoothing mechanism (Wozencraft-Jacobs tradition) or a rate-flat retransmission protocol (Chase combining); the paper reframes ARQ as a genuine information-theoretic resource that reshapes the fundamental tradeoff curve.

(2) A clean closed-form ARQ-DMT. The central result is dARQ(r,L)=Lβ‹…dβˆ—(r/L)d_\mathrm{ARQ}(r, L) = L \cdot d^{*}(r/L) β€” a product structure so elegant it looks like it must be wrong. Each additional round gives a multiplicative factor in the diversity exponent: LL rounds deliver LL times the exponent of a single shot at the rate-stretched operating point r/Lr/L. The formula interpolates smoothly between the static DMT (L=1L = 1) and the ergodic-capacity regime (Lβ†’βˆžL \to \infty), making it a unifying result across the single-shot, multi-shot, and ergodic channel models.

(3) The first explicit ARQ-DMT-optimal code family. The paper constructs Incremental-Redundancy Lattice Space-Time (IR-LAST) codes β€” nested lattice codes over a CDA-structured alphabet with common random dithering β€” and proves they achieve dARQ(r,L)d_\mathrm{ARQ}(r, L) at every (r,L)(r, L). This was the first known code family to attain the ARQ-DMT and remains the canonical construction in the literature.

(4) An information-theoretic foundation for practical HARQ. The ARQ-DMT underpins the HARQ design of 3GPP LTE and 5G NR: the observation that every additional retransmission multiplies (rather than merely adds to) the diversity exponent is what motivates the circular-buffer IR-HARQ design of LTE Rel-8 and NR Rel-15. Every 3GPP link-level simulation reporting a BLER-vs-SNR slope for HARQ-IR is implicitly validating the ARQ-DMT prediction. The paper is cited repeatedly in 3GPP contributions on HARQ feedback timing, soft-buffer sizing, and URLLC latency budgeting.

This is a landmark result: it opens the delay dimension in the information-theoretic analysis of MIMO channels and establishes the mathematical foundation for HARQ protocols in every modern cellular standard.

arq-dmtir-lastharqmimodiversity-multiplexing-delayView Paper β†’

IR-LAST: Incremental Redundancy Across LL ARQ Rounds

Animated transmission timeline for an IR-LAST code with L=3L = 3 ARQ rounds. The video walks through the protocol step by step: round 1 transmits codeword fragment X1\mathbf{X}_1 through channel H1\mathbf{H}_{1}, the decoder attempts decoding, emits an ACK or NACK; on NACK, round 2 transmits fresh-parity fragment X2\mathbf{X}_2 through independent channel H2\mathbf{H}_{2}, the decoder now combines (Y1,Y2)(\mathbf{Y}_1, \mathbf{Y}_2) jointly against the nested lattice code Ξ›f,2/Ξ›c\Lambda_{f,2} / \Lambda_c, and so on up to round 3. The animation highlights the nested-lattice structure: each round unveils one more layer of the lattice hierarchy, and the decoder's search space shrinks (relative to the observation dimension) as more rounds accumulate.
The IR-LAST protocol delivers ARQ-DMT-optimal performance via three mechanisms: (i) CDA nonvanishing-determinant per round, (ii) nested-lattice puncturing across rounds, (iii) common random dithering for Gaussian-ensemble statistics. The diversity exponent after LL rounds is Lβ‹…dβˆ—(r/L)L \cdot d^{*}(r/L) β€” a multiplicative gain per round.

IR-LAST Effective Throughput vs SNR

Effective throughput Ξ·eff(SNR,L)=Rβ‹…(1βˆ’Pout(R,L))\eta_\mathrm{eff}(\text{SNR}, L) = R \cdot (1 - P_\mathrm{out}(R, L)) of an IR-LAST code at target per-round rate RR for several values of the round budget L∈{1,2,3,4}L \in \{1, 2, 3, 4\}. As LL grows, the BLER decays faster (ARQ-DMT scaling), so the throughput envelope rises toward the single-round capacity with a steeper slope. At high SNR all curves saturate at RR; the interesting region is the transition near the rate-RR outage threshold, where the ARQ-DMT slope benefit materialises.

Parameters
4
4

Example: IR-LAST BER at 1010 dB SNR on 2Γ—22\times 2, L=4L = 4

For an IR-LAST code on 2Γ—22 \times 2 MIMO with long-term effective multiplexing r=1r = 1 and delay budget L=4L = 4, estimate the decoding error exponent and the corresponding BER at SNR=10\text{SNR} = 10 dB. Compare with the L=1L = 1 (no ARQ) baseline at the same long-term rate.

⚠️Engineering Note

Finite-SNR Gap: ARQ-DMT Is Asymptotic

The ARQ-DMT exponent Lβ‹…dβˆ—(r/L)L \cdot d^{*}(r/L) is an asymptotic statement, exact only as SNRβ†’βˆž\text{SNR} \to \infty. At the moderate SNRs typical of cellular operation (1010–2525 dB), the empirical BLER-vs-SNR slope of an IR-LAST code typically lags the asymptotic slope by 2020–4040% β€” i.e., the empirical diversity is 0.6dARQ0.6 d_ \mathrm{ARQ} to 0.8dARQ0.8 d_\mathrm{ARQ} at 15 dB, reaching 0.9dARQ0.9 d_ \mathrm{ARQ} around 25 dB.

Three implications for a system designer.

  1. Coding gain matters as much as DMT at moderate SNR. The multiplicative constant in front of SNRβˆ’dARQ\text{SNR}^{-d_\mathrm{ARQ}} β€” the ARQ coding gain β€” depends on the CDA determinant constant and the lattice shaping. Two IR-LAST codes at the same dARQd_ \mathrm{ARQ} can differ by 22–44 dB at moderate SNR.
  2. The per-round rate r/Lr/L should be chosen so that the static DMT dβˆ—(r/L)d^{*}(r/L) is close to its maximum, i.e., r/Lβ†’0r/L \to 0. This means L≫rL \gg r β€” many rounds at low per-round rate. In LTE this translates to the L=4L = 4 default (4 HARQ rounds) at moderate rates and down to L=2L = 2 at high rates.
  3. Short block length NN degrades both the static DMT and the ARQ gain. If N<nt+nrβˆ’1N < n_t + n_r - 1, the static DMT is truncated (Ch. 12 Β§5). The ARQ-DMT formula assumes the per-round block length is long enough for the static DMT β€” if not, the formula overstates the achievable exponent.
Practical Constraints
  • β€’

    ARQ-DMT asymptotic SNR regime: typically SNRβ‰₯20\text{SNR} \ge 20 dB for 2Γ—22 \times 2, β‰₯25\ge 25 dB for 4Γ—44 \times 4.

  • β€’

    Per-round block length Nβ‰₯nt+nrβˆ’1N \ge n_t + n_r - 1 required for the static DMT to apply.

  • β€’

    Empirical diversity at 15 dB is ∼0.6\sim 0.6–0.8Γ—0.8 \times the asymptotic ARQ-DMT exponent.

πŸ“‹ Ref: El Gamal-Caire-Damen 2006, Β§VI, Fig. 3

Why Common Random Dither Is Needed

The common random dither d\mathbf{d} in IR-LAST (item 5 of Def. DIncremental-Redundancy Lattice Space-Time (IR-LAST) Codes) is a subtle but essential ingredient. Without dithering, the transmitted codeword is deterministic given the information bits, and the PEP bound on a particular codeword difference Ξ”\boldsymbol{\Delta} depends on the specific algebraic structure of Ξ”\boldsymbol{\Delta} β€” in particular on whether its determinant is the minimum allowed by the CDA nonvanishing-determinant property. Structured codes can have worst-case PEP exponents that are below the Gaussian-random-code exponent at finite SNR.

The dither fixes this by making the transmitted codeword uniformly distributed over the shaping Voronoi region. The PEP bound now averages over the dither, and the average matches the Gaussian-random-code bound β€” the exponent is determined by the distribution, not by a specific worst-case codeword pair. Dithering is the probabilistic ingredient that turns a deterministic algebraic code into a code whose ensemble-average performance matches the random-coding achievability proof.

The common (rather than per-round independent) dither is needed so that the lattice structure is preserved across rounds: if dβ„“\mathbf{d}_\ell were independent per round, the concatenated codeword (X1,…,XL)(\mathbf{X}_1, \ldots, \mathbf{X}_L) would no longer lie in the mother lattice Ξ›f\Lambda_f and the joint decoder would lose the lattice advantage. A single common dither shared across rounds reconciles the two requirements.

IR-HARQ Scheduler

Complexity: Per-round decoding complexity: O(nt3β‹…2rlog⁑2SNR)O(n_t^3 \cdot 2^{r \log_2 \text{SNR}}) for exact sphere decoding; O(nt3β‹…log⁑(SNR))O(n_t^3 \cdot \log(\text{SNR})) for MMSE-GDFE lattice decoding (polynomial at fixed rate).
Input: message W, mother code C, max rounds L, target rate R
Output: ACK (decoded) or error (block failure)
1. Encode W into long codeword c <- Encode(W, C)
2. Partition c into fragments (X_1, X_2, ..., X_L) of N channel uses each
3. Initialise accumulated observation buffer: Y_acc <- empty
4. For round ell = 1, 2, ..., L:
a. Transmitter sends X_ell over channel H_ell
b. Receiver obtains Y_ell = sqrt(snr/n_t) H_ell X_ell + noise_ell
c. Append Y_ell to Y_acc; update accumulated channel H_acc
d. Run MMSE-GDFE lattice decoder on (Y_acc, H_acc) over coset Lambda_{f,ell}/Lambda_c
e. If CRC passes: send ACK, return DECODED
f. Else: send NACK, continue
5. Return ERROR (block failure after L rounds)

This is the information-theoretic reference scheduler. Real implementations (LTE/NR) make several simplifications: (a) the decoder is LDPC-based rather than lattice-based (simpler hardware); (b) puncturing is specified by redundancy-version indices rather than explicit nested-lattice construction (see Β§5); (c) ACK/NACK is conveyed via PUCCH with its own error probability budget; (d) parallel HARQ processes (up to 16 in NR) prevent head-of-line blocking β€” see Β§5.

Quick Check

The nonvanishing-determinant property of IR-LAST means

inf⁑Δ≠0det⁑(ΔΔH)β‰₯c(r)>0\inf_{\boldsymbol{\Delta} \ne 0} \det(\boldsymbol{\Delta}\boldsymbol{\Delta}^H) \ge c(r) > 0 uniformly in SNR

The codeword-difference determinant vanishes linearly in SNR

All codewords have norm β‰₯c(r)\ge c(r)

The mother-code rate RR grows with SNR as R=rlog⁑2SNRR = r \log_2 \text{SNR}

Quick Check

IR-LAST codes achieve the ARQ-DMT because

CDA nonvanishing determinant + MMSE-GDFE + common random dither reproduce the Gaussian-random-code exponent

They repeat the same codeword LL times with different powers

They use a systematic LDPC code with rate matching

They avoid incremental redundancy in favour of retransmission of the systematic bits

IR-LAST Code

Incremental-Redundancy Lattice Space-Time code: a nested lattice code over a cyclic-division-algebra alphabet with common random dithering, constructed by El Gamal-Caire-Damen (2006) to achieve the ARQ-DMT Lβ‹…dβˆ—(r/L)L \cdot d^{*}(r/L) at every (r,L)(r, L). The first explicit ARQ-DMT-optimal code family in the literature.

Related: ARQ Diversity-Multiplexing-Delay Tradeoff dARQ(r,L)d_\mathrm{ARQ}(r, L), CDA Codewords Are Full Rank, Nested Lattice Codes, MMSE-GDFE Lattice Decoder

Nonvanishing Determinant (NVD)

Property of a space-time code CC that infβ‘Ξ”βˆˆCβˆ’C,Ξ”β‰ 0det⁑(ΔΔH)β‰₯c>0\inf_{\boldsymbol{\Delta} \in C - C, \boldsymbol{\Delta} \ne 0}\det(\boldsymbol{\Delta} \boldsymbol{\Delta}^H) \ge c > 0 uniformly in the SNR and the rate parameter. The NVD property is the algebraic fingerprint of DMT optimality, both in the static (Ch. 12) and ARQ (this chapter) settings. Achieved by CDA-structured codes (Elia et al. 2006).

Related: Incremental-Redundancy Lattice Space-Time (IR-LAST) Codes, Diversity-Multiplexing Tradeoff Curve dβˆ—(r)d^*(r), CDA Codewords Are Full Rank, The Golden Code

MMSE-GDFE Lattice Decoder

Minimum-mean-squared-error generalised-decision-feedback equaliser followed by a lattice closest-point decoder. On a MIMO channel, the MMSE-GDFE achieves the same DMT exponent as the joint maximum-likelihood decoder while running in polynomial time at fixed rate (El Gamal-Caire-Damen 2004, Ch. 17). The reference decoder for IR-LAST codes.

Related: Incremental-Redundancy Lattice Space-Time (IR-LAST) Codes, Lattice Decoding, MMSE Coefficient Ξ±\alpha, Pohst / Schnorr–Euchner Sphere Decoder