ARQ over MIMO Channels
Every Real Wireless System Retransmits
Chapter 12 gave us the DMT for a single block of a block-fading MIMO channel. A single-shot transmission is a mathematical fiction: it is the right abstraction for analysing capacity, but no operational wireless system on Earth actually works that way. LTE, 5G NR, Wi-Fi, DVB-S2X, and WiMAX all retransmit on decoding failure. The receiver sends back an ACK or a NACK, and the transmitter reacts — re-sending the same codeword, or sending fresh parity bits, until either the packet is decoded or a deadline is hit.
The point is that each retransmission happens on a new channel realisation — the fading coefficients in are (to a very good approximation) independent of those in , provided the gap between rounds exceeds the coherence time. That independence is operationally a free resource. Zheng and Tse showed what you can extract from one Wishart matrix; the question of this chapter is what you can extract from of them.
The answer, due to El Gamal, Caire, and Damen (2006), is elegant and surprising: with optimal incremental redundancy, the diversity exponent scales linearly in at fixed effective rate. The tradeoff generalises from the two-dimensional curve of Ch. 12 to a three-dimensional surface , with the beautiful product structure Each additional round multiplies the diversity exponent by of the static curve evaluated at the same rate — each round earns its keep, and the earnings compound.
This chapter develops the protocol (§1), proves the ARQ-DMT theorem (§2), constructs the incremental-redundancy lattice codes that achieve it (§3), and maps the whole story onto the HARQ mechanism used in 3GPP LTE and 5G NR (§§4–5).
Definition: ARQ Protocol over a Block-Fading MIMO Channel
ARQ Protocol over a Block-Fading MIMO Channel
An -round ARQ protocol over an block-fading MIMO channel is the following sequence of operations, parametrised by a target rate and a maximum number of rounds :
- The transmitter has a single information message of rate per round (so total rate over the -round block).
- In round , the transmitter sends a codeword matrix through channel ; the receiver gets
- The receiver attempts to decode using all observations so far . If decoding succeeds, it sends an ACK and the protocol halts. Otherwise, it sends a NACK on an (assumed error-free, zero-delay) feedback link.
- On NACK, the transmitter sends — a new codeword matrix, possibly a function of the message alone or of together with . The channel is drawn independently.
- The protocol halts at round whether or not decoding has succeeded; if it has not, the block is declared in error.
The protocol is characterised by the round codebooks and the decoding rule. Chase combining (CC) uses for all — the same codeword retransmitted. Incremental redundancy (IR) uses containing fresh parity symbols of a common mother code. The mother-code rate is per round, and after rounds the effective rate seen by the decoder is (the same information bits carried by instead of channel uses).
Three subtleties. First, the channel matrices are assumed i.i.d. across rounds — this is the independence assumption that makes ARQ buy diversity. If the coherence time is longer than the HARQ round-trip time, consecutive rounds see correlated fading and the diversity claim weakens (see ⚠ARQ Diversity Requires Independent Retransmissions). Second, the feedback link is idealised: error-free, zero-delay, single-bit ACK/NACK. In practice, NACK-to-ACK misinterpretation rates are and ACK/NACK travel over PUCCH/PDCCH with their own reliability budget. Third, the decoder is a joint ML decoder across all received rounds — it does not decode round-by-round, except as a complexity-limited approximation.
Definition: Chase Combining (CC-HARQ)
Chase Combining (CC-HARQ)
A Chase-combining HARQ protocol transmits at every round — i.e., it repeats the same codeword. The receiver combines the received signals at the log-likelihood level. For a binary-input AWGN round model , the combined LLR after rounds is The combined LLR is equivalent to coherent combining of independent copies of the same symbol, so the effective SNR grows linearly in : . In the MIMO setting, round sees , and the combined decoder sees a virtual channel stacking the individual .
Chase combining is simple, legacy, and still the workhorse of low-complexity HARQ. Its name honours David Chase's 1985 IEEE Trans. Comm. paper which proved that summing LLRs across repetitions is asymptotically the ML decoder for the concatenated-packet ensemble. The rate after rounds is — the decoder is trying to decode the same bits spread across times as many channel uses.
Definition: Incremental Redundancy (IR-HARQ)
Incremental Redundancy (IR-HARQ)
An incremental-redundancy HARQ protocol transmits a different codeword at each round, typically obtained by puncturing a common mother code of low rate into consecutive parity fragments. Round sends the -th fragment. After rounds, the decoder has observed an effective code of rate — the systematic bits plus additional parity fragments.
In the block-fading MIMO setting, round uses codeword and the receiver sees A joint ML decoder treats the concatenation as a single long codeword passed through the block-diagonal channel . The mutual information accumulated over rounds is independently over the rounds. Decoding succeeds when , i.e., when the cumulative mutual information exceeds the total number of information bits .
The IR flavour earns its name from the fact that round carries no new information bits (the message is already fully determined by in principle) but does carry new redundancy — fresh parity on the same underlying bits. The decoder strictly benefits from each additional fragment.
Two equivalent pictures of IR-HARQ are worth holding in mind. First, puncturing: round unpunctures additional positions of a fixed rate- mother code, lowering the effective rate. Second, rate matching via a circular buffer: the encoded bits of the mother code are laid out along a circular buffer, and each round reads out a fragment of length starting at a round-dependent offset. LTE and 5G NR adopt the circular-buffer formulation — see §5.
Chase Combining vs Incremental Redundancy vs No HARQ
| Property | No HARQ (one-shot) | Chase combining (CC) | Incremental redundancy (IR) |
|---|---|---|---|
| Rounds allowed | |||
| Round payload | Codeword | Repeat | Fresh parity fragment |
| Effective rate after rounds | of a rate- mother code | ||
| Combining at receiver | None | Sum LLRs (coherent) | Joint ML across |
| Diversity per round at fixed rate | (strictly) | (achieves ARQ-DMT) | |
| Buffer at transmitter | None | None (resend) | Mother-code circular buffer |
| Use case | URLLC one-shot | Legacy / low-complexity | LTE, 5G NR eMBB, Wi-Fi |
| Computational complexity | Baseline | Low (LLR sum) | Higher (full decoder on each round) |
Theorem: Chase Combining Achieves at Most Diversity
Consider an -round CC-HARQ protocol on an i.i.d. Rayleigh MIMO channel with fixed per-round rate for all rounds. The -round error probability satisfies In words: CC-HARQ can achieve at most times the static DMT evaluated at the original rate . The bound is attained only when the decoder is ML and the round channels are independent.
Comparison to IR. For the same and the same effective long-term rate , IR achieves diversity , which is strictly greater than for all because is strictly decreasing on .
CC resends the same codeword times. Each round adds an independent fading realisation, so the effective SNR after rounds is — i.e., the combined -round channel is like a single round at SNR . But the rate has stayed at , not : CC trades the per-round rate for SNR, it does not use the extra channel uses to lower the rate slope. In DMT language, CC operates the original curve at a shifted SNR — which gives diversity , not . Since with equality only at , IR is strictly better.
Stack the rounds: the CC-HARQ decoder sees a virtual channel with block-diagonal structure all multiplying the same codeword .
The outage event is . Note the rate budget is , not : each round targets the same as a single-shot code.
The outage exponent is a sum of i.i.d. static-DMT exponents evaluated at : .
Stack the rounds
In CC-HARQ, round transmits over channel . Write the stacked observation as with of size .
Outage event
The mutual information of the stacked channel is Decoding at target rate per round succeeds iff (the code occupies channel uses per round, so the normalising denominator is , not — the rate is per-round, not per-block).
High-SNR exponent
For independent rounds, . By the same Wishart large-deviations argument as Ch. 12, the outage exponent of this sum is a convolution of i.i.d. copies of the single-round outage exponent. At rate , the cumulative outage exponent satisfies This is because each round independently pays in exponent, and the outage event requires all rounds to jointly fail — which gives an exponent of .
Converse inequality
Any code operating on the stacked channel must satisfy , so holds for every CC-HARQ scheme. Equality is attained by a Gaussian random code of rate and ML decoding across the rounds.
Comparison to IR
Fix a long-term effective rate ; CC sets per-round rate and achieves exponent . IR sets per-round rate (mother-code rate stretched across rounds) and achieves exponent — wait, this looks the same, but the rate per round is different. The proper comparison: at long-term effective rate , IR operates the static DMT at per round (because the mother code rate is and the rounds carry fragments), giving exponent . This is the ARQ-DMT advantage of §2.
ARQ Diversity Contribution vs Number of Rounds
Sweep the number of ARQ rounds and plot the ARQ-DMT diversity at several effective rates for an channel. The curves are strictly increasing in for — each round earns positive diversity — and the marginal gain per round is largest at high (where is steep) and smallest at low (where is flat). This is the operational illustration of the ARQ-DMT product rule.
Parameters
Example: MIMO with : CC vs IR at
Compare the diversity exponents of CC-HARQ and IR-HARQ on a i.i.d. Rayleigh channel with ARQ rounds at long-term effective rate . Use the static DMT at integer corners with piecewise-linear interpolation between.
Static DMT values
On : , , ; interpolation linear between. We will need and (the linear segment from to has slope ).
CC-HARQ exponent at $r = 1$, $L = 3$
CC operates the static DMT at per-round rate and sums exponents across independent rounds:
IR-HARQ exponent at $r = 1$, $L = 3$
IR uses a mother-code rate per round (a lower per-round rate than CC) and pays diversity per round:
Interpretation
IR triples the diversity exponent of CC ( vs ) at the same long-term rate. Operationally, at high SNR the IR-HARQ scheme delivers a 9-slope BLER-vs-SNR curve, compared to CC's 3-slope. For a target BLER of , IR needs roughly decades of SNR of coding-gain margin vs CC's decades at slope — i.e., IR outperforms CC by decades of SNR dB at the target BLER. This is the ARQ-DMT payoff.
Why IR wins
The key asymmetry: CC fixes the per-round rate at and buys only fading realisations at the same point on the static DMT. IR lowers the per-round rate to — where is larger — and buys the same fading realisations at a better point on the static DMT. IR is strictly better whenever is strictly decreasing, i.e., for all .
Latency Cost of ARQ
ARQ buys diversity but it costs latency. Each retransmission adds one HARQ round-trip time , which in 5G NR depends on numerology:
| Numerology | Slot length | (typical) |
|---|---|---|
| (15 kHz) | ms | – ms |
| (30 kHz) | ms | – ms |
| (120 kHz, FR2) | ms | – ms |
| Mini-slot (URLLC) | symbols | – ms |
For eMBB traffic (mobile broadband, HARQ rounds), a worst-case 4-round episode on costs ms of latency — inside the typical web-application latency budget. For URLLC traffic with a ms end-to-end latency target, a single HARQ retransmission is already on the edge of the budget, and no-HARQ one-shot transmission (with a -repetition transmit-side diversity instead) is sometimes preferred. This is one of the reasons URLLC designs often skip HARQ entirely; the diversity gain of the ARQ-DMT is valuable only when the delay cost is tolerable.
- •
HARQ RTT in 5G NR FR1 is – ms; FR2 is – ms.
- •
Each HARQ round adds one RTT of end-to-end latency.
- •
URLLC latency budget ( ms) allows at most one HARQ retransmission or no HARQ at all.
Historical Note: Wozencraft & Jacobs 1961: The Original ARQ
1961The idea of automatic-repeat-request (ARQ) predates digital wireless. John Wozencraft and Irwin Mark Jacobs, in their 1965 textbook Principles of Communication Engineering (based on MIT lecture notes dating to 1961), formalised the three canonical ARQ protocols — stop-and-wait, go-back-, and selective-repeat — together with a rigorous analysis of their throughput and reliability. At the time, the target was wireline and satellite data links, not wireless cellular, and the channels of interest were binary symmetric or AWGN, not block-fading MIMO.
The Wozencraft-Jacobs formulation is rate-flat: every retransmission re-sends the same codeword. Diversity on a fading channel was not yet a concept in 1961 — the term "diversity order" emerged from wireless in the 1980s (Jakes, Turin). It was not until Caire-Tuninetti 2001 and especially El Gamal-Caire-Damen 2006 that the question "does ARQ earn diversity on a fading channel?" was answered with a closed-form tradeoff curve. The 45-year gap reflects the evolution of the relevant channel models: ARQ over AWGN is about throughput; ARQ over fading is about reliability-diversity-delay tradeoff.
Quick Check
For Chase-combining HARQ at per-round rate with independent rounds, the long-term effective rate delivered to the application is
, because the same bits are retransmitted times
, because each round can in principle decode
, because the decoder sees independent looks
, as with IR-HARQ
CC repeats the same codeword times, so the bits of are now spread across channel uses, giving effective rate . This is a substantial rate cost compared to IR's asymptotic-cost at the same protocol length.
Quick Check
For the ARQ-DMT diversity to be delivered, the channel realisations must be
Approximately independent — i.e., the HARQ RTT must exceed the channel coherence time
Identical — CC relies on the channel being the same across rounds
Correlated with a specific correlation matrix chosen by the standard
Block-diagonal in some eigenbasis of the covariance matrix
Exactly. Independence is what makes each round a fresh draw from the fading distribution, so that the failure probabilities multiply. See ⚠ARQ Diversity Requires Independent Retransmissions.
Hybrid ARQ (HARQ)
An ARQ protocol combined with forward-error correction: the receiver stores soft observations (LLRs) from failed rounds and combines them with fresh observations in subsequent rounds. The two canonical HARQ flavours are Chase combining (CC) — retransmit the same codeword, sum LLRs — and incremental redundancy (IR) — transmit fresh parity bits, run the decoder on the accumulated observations.
Related: Chase Combining (CC-HARQ), Incremental Redundancy (IR-HARQ), ARQ Diversity-Multiplexing-Delay Tradeoff , Redundancy Version (RV)
Chase Combining (CC-HARQ)
A HARQ flavour in which every round retransmits the same codeword . The receiver combines the rounds at the log-likelihood-ratio level, giving an effective SNR of . CC achieves at most diversity at per-round rate (Chase 1985, Caire-Tuninetti 2001).
Related: 5G NR HARQ Process, Incremental Redundancy (IR-HARQ), ARQ Diversity-Multiplexing-Delay Tradeoff
Incremental Redundancy (IR-HARQ)
A HARQ flavour in which round transmits fresh parity obtained by puncturing a common mother code. The effective rate decreases with each round. IR achieves the ARQ-DMT — strictly better than CC at every (El Gamal-Caire-Damen 2006).
Related: 5G NR HARQ Process, Chase Combining (CC-HARQ), ARQ Diversity-Multiplexing-Delay Tradeoff , Redundancy Version (RV)