Practical HARQ: Chase Combining vs Incremental Redundancy

From IR-LAST to LDPC-HARQ: What Real Systems Actually Do

Section 3 gave us an information-theoretic reference construction β€” IR-LAST β€” that achieves the ARQ-DMT at every (r,L)(r, L). But IR-LAST is not what LTE or 5G NR actually deploys. Real systems use LDPC codes (NR) or Turbo codes (LTE) with a circular-buffer rate matcher that generates a small number of predefined redundancy versions. The question of this section is: how close does the practical scheme get to the ARQ-DMT, and where does it fall short?

The short answer is that a well-designed LDPC-based IR-HARQ scheme achieves the ARQ-DMT asymptotically β€” the NVD-type property of a capacity-achieving LDPC code, combined with puncturing-compatible rate matching, gives the same Lβ‹…dβˆ—(r/L)L \cdot d^{*}(r/L) exponent at high SNR. The gap from IR-LAST is a matter of coding gain and finite-length effects, not asymptotic exponent.

This section develops the LTE/NR HARQ mechanism in three pieces. First, we define the redundancy versions and the circular-buffer rate matcher. Second, we state the LDPC-IR achieves-the-ARQ-DMT theorem. Third, we examine the practical tradeoffs between CC-HARQ and IR-HARQ in terms of BLER and latency.

,

Definition:

Redundancy Version (RV)

A redundancy version (RV) is an integer index RV∈{0,1,2,3}\mathrm{RV} \in \{0, 1, 2, 3\} that specifies which consecutive fragment of the mother code's circular buffer is transmitted in a given HARQ round.

Formally: the mother LDPC encoder produces a long codeword c∈{0,1}Nc\mathbf{c} \in \{0, 1\}^{N_c} of rate Rm=K/NcR_m = K / N_c where KK is the number of information bits. The circular-buffer rate matcher lays out c\mathbf{c} along a ring of NcN_c positions and defines four starting offsets k0(RV)k_0^{(\mathrm{RV})} for RV∈{0,1,2,3}\mathrm{RV} \in \{0, 1, 2, 3\}: k0(0)=0,k0(1)=⌈Nc/4βŒ‰,k0(2)=⌈Nc/2βŒ‰,k0(3)=⌈3Nc/4βŒ‰.k_0^{(0)} = 0,\qquad k_0^{(1)} = \lceil N_c / 4 \rceil,\qquad k_0^{(2)} = \lceil N_c / 2 \rceil,\qquad k_0^{(3)} = \lceil 3 N_c / 4\rceil. Round β„“\ell transmits a fragment of length E=K/Rβ„“E = K / R_\ell starting from position k0(RVβ„“)k_0^{(\mathrm{RV}_\ell)} and wrapping around the ring. Rβ„“R_\ell is the effective per-round code rate (chosen by the MCS β€” modulation and coding scheme β€” and the resource allocation).

The RV_0 fragment is chosen so that the systematic bits (the original information bits before encoding) are always at the start; hence RV_0 contains mainly systematic bits and RV_2 contains mainly parity bits from the far side of the ring. The typical RV sequence across retransmissions is 0, 2, 3, 1 β€” chosen to maximise the coverage of the circular buffer across four rounds.

The circular-buffer + RV mechanism elegantly realises incremental redundancy: each RV transmits a different portion of the mother code's output, and after all four RVs have been transmitted the receiver has seen (most of) the full mother code of rate RmR_m. The ARQ-DMT effective rate drops as Rm→0R_m \to 0 in the limit of many retransmissions.

Not every HARQ round bumps the RV: if the transport block (TB) is retransmitted verbatim (Chase combining mode), the RV stays at 0 and the scheme degenerates to CC-HARQ. In NR, the scheduler chooses the RV per retransmission based on the estimated channel quality β€” aggressive IR at low SNR (bigger coverage gain from new parity) vs CC-style at high SNR (simpler LLR combining).

,

Theorem: LDPC-IR Asymptotically Achieves the ARQ-DMT

Consider an IR-HARQ protocol on an ntΓ—nrn_t \times n_r i.i.d. Rayleigh block-fading MIMO channel with BICM signalling (LDPC mother code + circular-buffer rate matching + higher-order modulation). As the LDPC block length Kβ†’βˆžK \to \infty with effective rate Rmβ†’r/LR_m \to r / L, the LL-round BLER satisfies PeLDPC-IR(SNR,L)β€…β€Šβ‰β€…β€ŠSNRβˆ’Lβ‹…dβˆ—(r/L)β€…β€Š=β€…β€ŠSNRβˆ’dARQ(r,L).P_e^\mathrm{LDPC\text{-}IR}(\text{SNR}, L) \;\doteq\; \text{SNR}^{-L \cdot d^{*}(r/L)} \;=\; \text{SNR}^{-d_\mathrm{ARQ}(r, L)}. In words: LDPC-based IR-HARQ achieves the ARQ-DMT in the limit of long block length.

The proof uses the BICM capacity lifting of Ch. 5–7 together with the ARQ-DMT of Β§2: BICM is DMT-optimal under uniform input and Gray labelling, and the LDPC+IR system realises BICM plus the IR mechanism.

A capacity-achieving LDPC code at rate RmR_m achieves outage- limited performance on any memoryless channel with BICM capacity β‰₯Rm\ge R_m. On the LL-round ARQ channel, the combined observation after β„“\ell rounds is equivalent to a BICM channel with capacity βˆ‘k≀ℓIBICM(Hk)\sum_{k \le \ell} I_{\rm BICM}(\mathbf{H}_{k}); outage happens iff this sum is less than LRm=rlog⁑2SNRL R_m = r \log_2 \text{SNR}. The outage exponent of this event is (by the same argument as Β§2) Lβ‹…dβˆ—(r/L)L \cdot d^{*}(r/L). A long enough LDPC code gets arbitrarily close to this outage bound β€” hence the ARQ-DMT is asymptotically achieved.

,

BLER vs Number of HARQ Rounds: CC vs IR

Block error rate after β„“βˆˆ{1,2,3,4}\ell \in \{1, 2, 3, 4\} HARQ rounds for CC-HARQ and IR-HARQ at fixed SNR. The IR curve decays with a slope proportional to dβˆ—(r/L)d^{*}(r/L) per round, while CC decays with the (smaller) slope of dβˆ—(r)d^{*}(r) per round. At moderate SNR the gap between the two is ∼2\sim 2–44 dB per round; at high SNR the gap widens as the ARQ-DMT asymptote takes over.

Parameters
5

Example: IR-HARQ Effective Throughput: r=1r = 1 bit/ch.use on 2Γ—22\times 2

For a 2Γ—22 \times 2 MIMO channel with IR-HARQ, long-term rate r=1r = 1 bit/channel use, and delay budget L=4L = 4, compute the effective throughput Ξ·eff\eta_\mathrm{eff} as a function of SNR. What is the SNR needed for Ξ·effβ‰₯0.95\eta_\mathrm{eff} \ge 0.95?

,

Common Mistake: Budget LL by End-to-End Latency, Not by DMT Gain

Mistake:

Choosing the ARQ round budget LL by maximising the DMT gain Lβ‹…dβˆ—(r/L)L \cdot d^{*}(r/L), which grows unboundedly in LL. This leads to L=8L = 8 or higher β€” far beyond any practical latency budget.

Correction:

The ARQ round budget is fundamentally latency-bound, not diversity-bound. The end-to-end latency of a transmission that uses LL HARQ rounds is approximately Lβ‹…TrttL \cdot T_\mathrm{rtt}, and this must fit inside the application-level latency budget.

  • eMBB (mobile broadband, ~100 ms budget): L≀4L \le 4 is typical. The marginal DMT gain past L=4L = 4 is theoretically huge (each round adds ∼dβˆ—(0)=ntnr\sim d^{*}(0) = n_t n_r to the exponent), but the scheme spends most of its time in the cold-start latency regime before the gain materialises.
  • URLLC (ultra-reliable low latency, ~1 ms budget): L≀1L \le 1–22. Sometimes the preferred choice is no HARQ with aggressive one-shot transmission at very low rate β€” this simply cannot accommodate even one retransmission within the budget.
  • Satellite / NTN (non-terrestrial, RTT = 20–600 ms): L≀2L \le 2 is typical despite the long latency budget, because ACK/NACK takes hundreds of milliseconds round-trip.

The ARQ-DMT formula tells you the reliability per LL; the latency budget tells you the feasible LL. A system design picks the smallest LL that meets a target reliability at the given SNR β€” not the largest.

πŸ”§Engineering Note

HARQ Soft Buffer Sizing

A receiver performing IR-HARQ must store the soft LLRs of previously-received rounds so that they can be combined with fresh rounds. The required storage is roughly proportional to the number of coded bits transmitted across all rounds β€” a KK-bit transport block, rate-matched to a circular buffer of Ncβ‰ˆK/RmN_c \approx K / R_m positions, needs up to NcN_c LLR slots of typically 6–8 bits each.

For a typical 5G NR transport block of K=8448K = 8448 information bits and mother-code rate Rm=1/3R_m = 1/3, the soft buffer is β‰ˆ25,000\approx 25{,}000 LLRs. A UE supporting NHARQ=16N_{\rm HARQ} = 16 parallel HARQ processes (see Β§5) thus needs β‰ˆ400,000\approx 400{,}000 LLRs Γ—\times 8 bits =3.2= 3.2 Mbit of RAM just for HARQ.

NR's limited-buffer rate matching (LBRM) caps the soft-buffer size for low-tier UEs by limiting the effective mother-code rate they can use. A Category 4 UE (the mid-range class) is allowed up to 25% of the full mother code; a Category 20 UE (flagship) gets the full mother code. This is a real-world example where the ARQ-DMT prediction of "arbitrarily many IR rounds" is capped by silicon, not by information theory.

Practical Constraints
  • β€’

    IR-HARQ soft buffer ∝NcΓ—LLRΒ bit-width\propto N_c \times \text{LLR bit-width} per transport block per process.

  • β€’

    5G NR LBRM limits the fraction of the mother code that can be stored.

  • β€’

    Buffer size directly caps the achievable LL-round effective rate.

πŸ“‹ Ref: 3GPP TS 38.214 Β§5.4 (limited-buffer rate matching)

Quick Check

In 5G NR HARQ, RV_0 is designed to contain

The systematic bits of the LDPC codeword

Only parity bits from the far side of the circular buffer

A random subset of the codeword bits

The same bits as RV_1, shifted by half the buffer length

Quick Check

In the high-SNR limit, the BLER of an IR-HARQ scheme with L=3L = 3 rounds on 2Γ—22 \times 2 MIMO at long-term rate r=1r = 1 decays as

SNRβˆ’9\text{SNR}^{-9} β€” a slope of 99 on a log⁑\log-log⁑\log BLER plot

SNRβˆ’3\text{SNR}^{-3} β€” the number of rounds LL

SNRβˆ’1\text{SNR}^{-1} β€” the static DMT

SNRβˆ’4\text{SNR}^{-4} β€” the maximum single-round diversity

Why This Matters: Forward Link: BICM-OFDM in Wireless Standards

The LDPC-IR-HARQ architecture of this section is one piece of a larger wireless-standards ecosystem. In Chapter 21 we will see how it composes with OFDM (for frequency-selective channelisation) and space-time coding (Alamouti / V-BLAST) to form the full physical-layer pipeline of LTE and 5G NR. The flow of operations is: information bits β†’\to LDPC encoder β†’\to rate matcher (RV selection) β†’\to BICM interleaver β†’\to QAM mapper β†’\to layer mapping (spatial streams) β†’\to OFDM modulator β†’\to antenna ports. The ARQ-DMT of this chapter characterises the information-theoretic ceiling of the whole pipeline; Chapter 21 examines where real standards fall short of the ceiling and why.