ARQ over MIMO Channels

Every Real Wireless System Retransmits

Chapter 12 gave us the DMT for a single block of a block-fading MIMO channel. A single-shot transmission is a mathematical fiction: it is the right abstraction for analysing capacity, but no operational wireless system on Earth actually works that way. LTE, 5G NR, Wi-Fi, DVB-S2X, and WiMAX all retransmit on decoding failure. The receiver sends back an ACK or a NACK, and the transmitter reacts — re-sending the same codeword, or sending fresh parity bits, until either the packet is decoded or a deadline is hit.

The point is that each retransmission happens on a new channel realisation — the fading coefficients in H2\mathbf{H}_{2} are (to a very good approximation) independent of those in H1\mathbf{H}_{1}, provided the gap between rounds exceeds the coherence time. That independence is operationally a free resource. Zheng and Tse showed what you can extract from one Wishart matrix; the question of this chapter is what you can extract from LL of them.

The answer, due to El Gamal, Caire, and Damen (2006), is elegant and surprising: with optimal incremental redundancy, the diversity exponent scales linearly in LL at fixed effective rate. The tradeoff generalises from the two-dimensional curve d(r)d^{*}(r) of Ch. 12 to a three-dimensional surface dARQ(r,L)d_\mathrm{ARQ}(r, L), with the beautiful product structure dARQ(r,L)  =  Ld(r/L).d_\mathrm{ARQ}(r, L) \;=\; L \cdot d^{*}(r/L). Each additional round multiplies the diversity exponent by 1/L1/L of the static curve evaluated at the same rate — each round earns its keep, and the earnings compound.

This chapter develops the protocol (§1), proves the ARQ-DMT theorem (§2), constructs the incremental-redundancy lattice codes that achieve it (§3), and maps the whole story onto the HARQ mechanism used in 3GPP LTE and 5G NR (§§4–5).

, ,

Definition:

ARQ Protocol over a Block-Fading MIMO Channel

An LL-round ARQ protocol over an nt×nrn_t \times n_r block-fading MIMO channel is the following sequence of operations, parametrised by a target rate RR and a maximum number of rounds L1L \ge 1:

  1. The transmitter has a single information message W{1,2,,2LNR}W \in \{1, 2, \ldots, 2^{LNR}\} of rate RR per round (so total rate LRLR over the LL-round block).
  2. In round =1\ell = 1, the transmitter sends a codeword matrix X1Cnt×N\mathbf{X}_1 \in \mathbb{C}^{n_t \times N} through channel H1\mathbf{H}_{1}; the receiver gets Y1  =  SNRntH1X1+w1,H1CN(0,I).\mathbf{Y}_1 \;=\; \sqrt{\tfrac{\text{SNR}}{n_t}}\,\mathbf{H}_{1} \mathbf{X}_1 + \mathbf{w}_{1}, \qquad \mathbf{H}_{1} \sim \mathcal{CN}(0, \mathbf{I}).
  3. The receiver attempts to decode using all observations so far (Y1,,Y)(\mathbf{Y}_1, \ldots, \mathbf{Y}_\ell). If decoding succeeds, it sends an ACK and the protocol halts. Otherwise, it sends a NACK on an (assumed error-free, zero-delay) feedback link.
  4. On NACK, the transmitter sends X+1\mathbf{X}_{\ell+1} — a new codeword matrix, possibly a function of the message WW alone or of WW together with (X1,,X)(\mathbf{X}_1, \ldots, \mathbf{X}_\ell). The channel H+1\mathbf{H}_{\ell+1} is drawn independently.
  5. The protocol halts at round LL whether or not decoding has succeeded; if it has not, the block is declared in error.

The protocol is characterised by the round codebooks X1,X2,,XL\mathbf{X}_1, \mathbf{X}_2, \ldots, \mathbf{X}_L and the decoding rule. Chase combining (CC) uses X=X1\mathbf{X}_\ell = \mathbf{X}_1 for all \ell — the same codeword retransmitted. Incremental redundancy (IR) uses X\mathbf{X}_\ell containing fresh parity symbols of a common mother code. The mother-code rate is RR per round, and after \ell rounds the effective rate seen by the decoder is Reff()=LR/R_\mathrm{eff}(\ell) = LR/\ell (the same information bits carried by Nnt\ell N n_t instead of LNntL N n_t channel uses).

Three subtleties. First, the channel matrices H\mathbf{H}_\ell are assumed i.i.d. across rounds — this is the independence assumption that makes ARQ buy diversity. If the coherence time is longer than the HARQ round-trip time, consecutive rounds see correlated fading and the diversity claim weakens (see ARQ Diversity Requires Independent Retransmissions). Second, the feedback link is idealised: error-free, zero-delay, single-bit ACK/NACK. In practice, NACK-to-ACK misinterpretation rates are 103\sim 10^{-3} and ACK/NACK travel over PUCCH/PDCCH with their own reliability budget. Third, the decoder is a joint ML decoder across all received rounds — it does not decode round-by-round, except as a complexity-limited approximation.

,

Definition:

Chase Combining (CC-HARQ)

A Chase-combining HARQ protocol transmits X=X1\mathbf{X}_\ell = \mathbf{X}_1 at every round {1,,L}\ell \in \{1, \ldots, L\} — i.e., it repeats the same codeword. The receiver combines the received signals at the log-likelihood level. For a binary-input AWGN round model y=hx+wy_\ell = h_\ell x + w_\ell, the combined LLR after \ell rounds is LLR(x)  =  k=1LLRk(x)  =  k=12hk2σ22Re(hkyk).\mathrm{LLR}_\ell(x) \;=\; \sum_{k=1}^\ell \mathrm{LLR}_k(x) \;=\; \sum_{k=1}^\ell \frac{2|h_k|^2}{{\sigma^2}^{2}}\,\mathrm{Re}(h_k^* y_k). The combined LLR is equivalent to coherent combining of \ell independent copies of the same symbol, so the effective SNR grows linearly in \ell: SNReff()=SNR\text{SNR}_\mathrm{eff}(\ell) = \ell \cdot \text{SNR}. In the MIMO setting, round \ell sees HX1+w\mathbf{H}_\ell \mathbf{X}_1 + \mathbf{w}_\ell, and the combined decoder sees a virtual nr×nt\ell n_r \times n_t channel stacking the individual Hk\mathbf{H}_{k}.

Chase combining is simple, legacy, and still the workhorse of low-complexity HARQ. Its name honours David Chase's 1985 IEEE Trans. Comm. paper which proved that summing LLRs across repetitions is asymptotically the ML decoder for the concatenated-packet ensemble. The rate after \ell rounds is R/R/\ell — the decoder is trying to decode the same RR bits spread across \ell times as many channel uses.

,

Definition:

Incremental Redundancy (IR-HARQ)

An incremental-redundancy HARQ protocol transmits a different codeword X\mathbf{X}_\ell at each round, typically obtained by puncturing a common mother code of low rate Rm=R/LR_m = R/L into LL consecutive parity fragments. Round \ell sends the \ell-th fragment. After \ell rounds, the decoder has observed an effective code of rate R=R(L/)R_\ell = R (L / \ell) — the systematic bits plus 1\ell - 1 additional parity fragments.

In the block-fading MIMO setting, round \ell uses codeword XCnt×N\mathbf{X}_\ell \in \mathbb{C}^{n_t \times N} and the receiver sees Y  =  SNRntHX+w.\mathbf{Y}_\ell \;=\; \sqrt{\tfrac{\text{SNR}}{n_t}}\,\mathbf{H}_\ell \mathbf{X}_\ell + \mathbf{w}_\ell. A joint ML decoder treats the concatenation (X1,,X)(\mathbf{X}_1, \ldots, \mathbf{X}_\ell) as a single long codeword passed through the block-diagonal channel diag(H1,,H)\mathrm{diag}(\mathbf{H}_{1}, \ldots, \mathbf{H}_\ell). The mutual information accumulated over \ell rounds is I  =  k=1log2det ⁣(I+SNRntHkHkH),I_\ell \;=\; \sum_{k=1}^\ell \log_2 \det\!\left(\mathbf{I} + \tfrac{\text{SNR}}{n_t} \mathbf{H}_{k} \mathbf{H}_{k}^{H}\right), independently over the \ell rounds. Decoding succeeds when ILRI_\ell \ge LR, i.e., when the cumulative mutual information exceeds the total number of information bits LRLR.

The IR flavour earns its name from the fact that round >1\ell > 1 carries no new information bits (the message WW is already fully determined by X1\mathbf{X}_1 in principle) but does carry new redundancy — fresh parity on the same underlying bits. The decoder strictly benefits from each additional fragment.

Two equivalent pictures of IR-HARQ are worth holding in mind. First, puncturing: round \ell unpunctures additional positions of a fixed rate-RmR_m mother code, lowering the effective rate. Second, rate matching via a circular buffer: the encoded bits of the mother code are laid out along a circular buffer, and each round reads out a fragment of length NntN n_t starting at a round-dependent offset. LTE and 5G NR adopt the circular-buffer formulation — see §5.

, ,

Chase Combining vs Incremental Redundancy vs No HARQ

PropertyNo HARQ (one-shot)Chase combining (CC)Incremental redundancy (IR)
Rounds allowedL=1L = 1L1L \ge 1L1L \ge 1
Round \ell payloadCodeword X1\mathbf{X}_1Repeat X1\mathbf{X}_1Fresh parity fragment X\mathbf{X}_\ell
Effective rate after \ell roundsRRR/R/\ellR(L/)R(L/\ell) of a rate-R/LR/L mother code
Combining at receiverNoneSum LLRs (coherent)Joint ML across (X1,,X)(\mathbf{X}_1, \ldots, \mathbf{X}_\ell)
Diversity per round at fixed rated(r)d^{*}(r)Ld(r)\le L \cdot d^{*}(r) (strictly)=Ld(r/L)= L \cdot d^{*}(r/L) (achieves ARQ-DMT)
Buffer at transmitterNoneNone (resend)Mother-code circular buffer
Use caseURLLC one-shotLegacy / low-complexityLTE, 5G NR eMBB, Wi-Fi
Computational complexityBaselineLow (LLR sum)Higher (full decoder on each round)

Theorem: Chase Combining Achieves at Most Ld(r)L \cdot d^{*}(r) Diversity

Consider an LL-round CC-HARQ protocol on an nt×nrn_t \times n_r i.i.d. Rayleigh MIMO channel with fixed per-round rate R(SNR)=rlog2SNRR(\text{SNR}) = r \log_2 \text{SNR} for all rounds. The LL-round error probability satisfies PeCC(SNR,L)    SNRdCC(r,L),dCC(r,L)    Ld(r).P_e^{\mathrm{CC}}(\text{SNR}, L) \;\doteq\; \text{SNR}^{-d^\mathrm{CC}(r, L)}, \qquad d^\mathrm{CC}(r, L) \;\le\; L \cdot d^{*}(r). In words: CC-HARQ can achieve at most LL times the static DMT evaluated at the original rate rr. The bound is attained only when the decoder is ML and the LL round channels are independent.

Comparison to IR. For the same LL and the same effective long-term rate rˉ=r\bar r = r, IR achieves diversity Ld(rˉ/L)=Ld(r/L)L \cdot d^{*}(\bar r / L) = L \cdot d^{*}(r/L), which is strictly greater than Ld(r)L \cdot d^{*}(r) for all r>0r > 0 because d()d^{*}(\cdot) is strictly decreasing on [0,min(nt,nr)][0, \min(n_t, n_r)].

CC resends the same codeword LL times. Each round adds an independent fading realisation, so the effective SNR after LL rounds is SNReffLSNR\text{SNR}_\mathrm{eff} \doteq L \cdot \text{SNR} — i.e., the combined LL-round channel is like a single round at SNR LSNRL \cdot \text{SNR}. But the rate has stayed at RR, not LRLR: CC trades the per-round rate for SNR, it does not use the extra channel uses to lower the rate slope. In DMT language, CC operates the original curve at a shifted SNR — which gives diversity Ld(r)L \cdot d^{*}(r), not Ld(r/L)L \cdot d^{*}(r/L). Since d(r/L)d(r)d^{*}(r/L) \ge d^{*}(r) with equality only at r=0r = 0, IR is strictly better.

,

ARQ Diversity Contribution vs Number of Rounds LL

Sweep the number of ARQ rounds L{1,2,,6}L \in \{1, 2, \ldots, 6\} and plot the ARQ-DMT diversity dARQ(r,L)=Ld(r/L)d_\mathrm{ARQ}(r, L) = L \cdot d^{*}(r/L) at several effective rates r{0.5,1,1.5,2}r \in \{0.5, 1, 1.5, 2\} for an nt×nrn_t \times n_r channel. The curves are strictly increasing in LL for r>0r > 0 — each round earns positive diversity — and the marginal gain per round is largest at high rr (where dd^{*} is steep) and smallest at low rr (where dd^{*} is flat). This is the operational illustration of the ARQ-DMT product rule.

Parameters
2
2

Example: 2×22\times 2 MIMO with L=3L = 3: CC vs IR at r=1r = 1

Compare the diversity exponents of CC-HARQ and IR-HARQ on a 2×22 \times 2 i.i.d. Rayleigh channel with L=3L = 3 ARQ rounds at long-term effective rate r=1r = 1. Use the 2×22 \times 2 static DMT d(r)=(2r)2d^{*}(r) = (2 - r)^2 at integer corners with piecewise-linear interpolation between.

,

Common Mistake: ARQ Diversity Requires Independent Retransmissions

Mistake:

Assuming that the ARQ-DMT diversity Ld(r/L)L \cdot d^{*}(r/L) is delivered simply because LL retransmissions took place. E.g., reading off the ARQ-DMT curve as if LL were the number of retransmitted copies rather than the number of independent fading realisations the decoder has access to.

Correction:

The ARQ-DMT presumes the LL channel realisations H1,,HL\mathbf{H}_{1}, \ldots, \mathbf{H}_{L} are independent. This requires the ARQ round-trip time TrttT_\mathrm{rtt} (the delay between round \ell and round +1\ell + 1) to exceed the channel coherence time TcohT_\mathrm{coh}. In practice:

  • Slow-moving users (pedestrian speeds, v1v \approx 1 m/s at 33 GHz give Tcoh100T_\mathrm{coh} \sim 100 ms): consecutive HARQ rounds Trtt4T_\mathrm{rtt} \sim 488 ms fall inside the coherence time; the effective number of independent realisations LeffL_\mathrm{eff} is closer to 11 than to the nominal LL.
  • Vehicular users (v30v \approx 30 m/s, Tcoh3T_\mathrm{coh} \sim 3 ms at 33 GHz): HARQ rounds are approximately independent; LeffLL_\mathrm{eff} \approx L.
  • mmWave / high-mobility (sub-millisecond coherence time): HARQ rounds are strongly independent; LeffLL_\mathrm{eff} \approx L.

The effective diversity in the correlated regime is Leffd(r/Leff)L_\mathrm{eff} \cdot d^{*}(r / L_\mathrm{eff}), not Ld(r/L)L \cdot d^{*}(r/L). Standards (LTE/NR) sometimes insert deliberate frequency hopping across HARQ retransmissions to decorrelate rounds even at low mobility — see §5.

,
⚠️Engineering Note

Latency Cost of ARQ

ARQ buys diversity but it costs latency. Each retransmission adds one HARQ round-trip time TrttT_\mathrm{rtt}, which in 5G NR depends on numerology:

Numerology μ\mu Slot length TrttT_\mathrm{rtt} (typical)
μ=0\mu = 0 (15 kHz) 11 ms 4488 ms
μ=1\mu = 1 (30 kHz) 0.50.5 ms 2244 ms
μ=3\mu = 3 (120 kHz, FR2) 0.1250.125 ms 0.50.511 ms
Mini-slot (URLLC) 2\sim 2 symbols 0.250.250.50.5 ms

For eMBB traffic (mobile broadband, L4L \le 4 HARQ rounds), a worst-case 4-round episode on μ=1\mu = 1 costs 16\le 16 ms of latency — inside the typical web-application latency budget. For URLLC traffic with a 11 ms end-to-end latency target, a single HARQ retransmission is already on the edge of the budget, and no-HARQ one-shot transmission (with a KK-repetition transmit-side diversity instead) is sometimes preferred. This is one of the reasons URLLC designs often skip HARQ entirely; the diversity gain of the ARQ-DMT is valuable only when the delay cost is tolerable.

Practical Constraints
  • HARQ RTT in 5G NR FR1 is 4\sim 488 ms; FR2 is 0.5\sim 0.511 ms.

  • Each HARQ round adds one RTT of end-to-end latency.

  • URLLC latency budget (1\sim 1 ms) allows at most one HARQ retransmission or no HARQ at all.

📋 Ref: 3GPP TS 38.214 §5.3 (scheduling and HARQ timing)

Historical Note: Wozencraft & Jacobs 1961: The Original ARQ

1961

The idea of automatic-repeat-request (ARQ) predates digital wireless. John Wozencraft and Irwin Mark Jacobs, in their 1965 textbook Principles of Communication Engineering (based on MIT lecture notes dating to 1961), formalised the three canonical ARQ protocols — stop-and-wait, go-back-NN, and selective-repeat — together with a rigorous analysis of their throughput and reliability. At the time, the target was wireline and satellite data links, not wireless cellular, and the channels of interest were binary symmetric or AWGN, not block-fading MIMO.

The Wozencraft-Jacobs formulation is rate-flat: every retransmission re-sends the same codeword. Diversity on a fading channel was not yet a concept in 1961 — the term "diversity order" emerged from wireless in the 1980s (Jakes, Turin). It was not until Caire-Tuninetti 2001 and especially El Gamal-Caire-Damen 2006 that the question "does ARQ earn diversity on a fading channel?" was answered with a closed-form tradeoff curve. The 45-year gap reflects the evolution of the relevant channel models: ARQ over AWGN is about throughput; ARQ over fading is about reliability-diversity-delay tradeoff.

Quick Check

For Chase-combining HARQ at per-round rate RR with LL independent rounds, the long-term effective rate delivered to the application is

R/LR/L, because the same bits are retransmitted LL times

RR, because each round can in principle decode

LRL \cdot R, because the decoder sees LL independent looks

R(L1)/LR \cdot (L - 1) / L, as with IR-HARQ

Quick Check

For the ARQ-DMT diversity Ld(r/L)L \cdot d^{*}(r/L) to be delivered, the LL channel realisations H1,,HL\mathbf{H}_{1}, \ldots, \mathbf{H}_{L} must be

Approximately independent — i.e., the HARQ RTT must exceed the channel coherence time

Identical — CC relies on the channel being the same across rounds

Correlated with a specific correlation matrix chosen by the standard

Block-diagonal in some eigenbasis of the covariance matrix

Hybrid ARQ (HARQ)

An ARQ protocol combined with forward-error correction: the receiver stores soft observations (LLRs) from failed rounds and combines them with fresh observations in subsequent rounds. The two canonical HARQ flavours are Chase combining (CC) — retransmit the same codeword, sum LLRs — and incremental redundancy (IR) — transmit fresh parity bits, run the decoder on the accumulated observations.

Related: Chase Combining (CC-HARQ), Incremental Redundancy (IR-HARQ), ARQ Diversity-Multiplexing-Delay Tradeoff dARQ(r,L)d_\mathrm{ARQ}(r, L), Redundancy Version (RV)

Chase Combining (CC-HARQ)

A HARQ flavour in which every round retransmits the same codeword X1\mathbf{X}_1. The receiver combines the LL rounds at the log-likelihood-ratio level, giving an effective SNR of LSNRL \cdot \text{SNR}. CC achieves at most Ld(r)L \cdot d^{*}(r) diversity at per-round rate rr (Chase 1985, Caire-Tuninetti 2001).

Related: 5G NR HARQ Process, Incremental Redundancy (IR-HARQ), ARQ Diversity-Multiplexing-Delay Tradeoff dARQ(r,L)d_\mathrm{ARQ}(r, L)

Incremental Redundancy (IR-HARQ)

A HARQ flavour in which round >1\ell > 1 transmits fresh parity obtained by puncturing a common mother code. The effective rate decreases with each round. IR achieves the ARQ-DMT Ld(r/L)L \cdot d^{*}(r/L) — strictly better than CC at every r>0r > 0 (El Gamal-Caire-Damen 2006).

Related: 5G NR HARQ Process, Chase Combining (CC-HARQ), ARQ Diversity-Multiplexing-Delay Tradeoff dARQ(r,L)d_\mathrm{ARQ}(r, L), Redundancy Version (RV)