HARQ in 5G NR: Redundancy Versions and Processes

5G NR: HARQ in the Real Radio Access Network

We arrive at the practical instantiation. 5G New Radio (NR) Release 15 adopted a HARQ design that directly reflects the ARQ-DMT principle of this chapter: stop-and-wait per-process operation, with up to 16 HARQ processes running in parallel, LDPC mother codes with circular-buffer rate matching driven by RV_0 through RV_3, and a flexible scheduling framework that picks the RV and the MCS per retransmission.

The fine details β€” the exact LDPC base graphs, the numerology- dependent HARQ RTT, the PUCCH/PDCCH feedback timing, the URLLC mini-slot HARQ β€” are all in the 3GPP specs (TS 38.212 for coding and TS 38.214 for procedures). We pull out the three design aspects that are information-theoretic consequences of the ARQ-DMT: (1) the choice of LL via the latency budget; (2) the choice of per-round MCS via the rate-rr/diversity-dd tradeoff; (3) the frequency-hopping across rounds to enforce independence.

,

Definition:

5G NR HARQ Process

A HARQ process in 5G NR is a state machine that tracks the current transmission status of a single transport block (TB). Each UE maintains up to NHARQ=16N_{\rm HARQ} = 16 parallel HARQ processes (NR supports up to 16 on the physical downlink shared channel PDSCH; 32 on NR-U unlicensed). A process carries:

  • HARQ process ID β€” an integer ∈{0,1,…,NHARQβˆ’1}\in \{0, 1, \ldots, N_{\rm HARQ} - 1\} uniquely identifying the slot assignment of this TB.
  • New-data indicator (NDI) β€” a toggle bit signalling whether the current transmission is a new TB or a retransmission of the prior TB in this process.
  • Redundancy version (RV) β€” the RV∈{0,1,2,3}\mathrm{RV} \in \{0, 1, 2, 3\} index specifying the circular-buffer fragment for this retransmission.
  • Transport block size (TBS) β€” number of information bits; constant across retransmissions of the same TB.
  • Soft buffer β€” LLR accumulator storing the decoder's combined observations from the previous rounds.

Stop-and-wait means: a process cannot send a new TB until the prior TB is acknowledged or abandoned. Parallel processes allow multiple TBs to be in flight simultaneously β€” the gNB schedules TB kk on process jj in slot nn, and the UE sends the ACK/NACK in slot n+K1n + K_1 (typically K1=2K_1 = 2 slots at ΞΌ=0\mu = 0). While waiting for ACK/NACK on process jj, other processes jβ€²β‰ jj' \ne j can transmit fresh TBs β€” filling the pipeline.

The NHARQ=16N_{\rm HARQ} = 16 choice reflects a specific tradeoff: with HARQ RTT ∼4\sim 4 slots and one TB per slot, 4 processes would be enough to keep the pipeline full without considering the variation in transmission times. The extra 1212 processes absorb jitter from scheduling, interference, and handover. Going beyond 16 yields diminishing returns; 16 is a round-number compromise between memory overhead (soft buffers!) and pipeline efficiency.

5G NR HARQ-IR Throughput Envelope

Effective throughput Ξ·eff\eta_\mathrm{eff} of a 5G NR IR-HARQ link as a function of the max round budget. Models LDPC mother code of rate 1/31/3, NR-style RV selection (RV_0 first, then RV_2, RV_3, RV_1), and i.i.d. Rayleigh 2Γ—22 \times 2 MIMO across rounds. The envelope grows with LL, saturating at the no-outage ceiling RR at high SNR. At moderate SNR, the incremental benefit per round decreases quickly beyond L=4L = 4 β€” the motivation for the 3GPP default of L=4L = 4 retransmissions.

Parameters
4

Definition:

Per-Round Frequency Hopping in 5G NR

To enforce the independence-across-rounds assumption of the ARQ-DMT, 5G NR supports per-round frequency hopping: the physical resource block (PRB) allocation for retransmission β„“\ell can be different from that of round β„“βˆ’1\ell - 1. Hopping is signalled via the 1-bit "frequency hopping flag" in the DCI (downlink control information) and can take one of two patterns:

  • Inter-slot hopping. Round β„“\ell uses PRBs {p0,…,p0+NPRBβˆ’1}\{p_0, \ldots, p_0 + N_\mathrm{PRB} - 1\}; round β„“+1\ell + 1 uses {p1,…,p1+NPRBβˆ’1}\{p_1, \ldots, p_1 + N_\mathrm{PRB} - 1\} with p1β‰ p0p_1 \ne p_0. Rounds decorrelate because the frequency offset exceeds the coherence bandwidth Bcoh∼1/(πτrms)B_\mathrm{coh} \sim 1 / (\pi \tau_\mathrm{rms}).
  • Intra-slot hopping. Within a single slot, the first and second halves of the transmission use different PRBs β€” providing decorrelation even within a single round for extra frequency-diversity protection.

Operationally, inter-slot hopping is the relevant mechanism for HARQ-round decorrelation: it ensures that H1,…,HL\mathbf{H}_{1}, \ldots, \mathbf{H}_{L} are approximately independent even when the user is stationary and the temporal coherence time is large.

,
🚨Critical Engineering Note

URLLC HARQ: One Retransmission Budget

Ultra-reliable low-latency communication (URLLC) has a target end-to-end latency of 11 ms at 99.999%99.999\% reliability (sometimes 99.9999%99.9999\%). The HARQ budget is correspondingly tight:

  • Mini-slot scheduling (2–7 symbols instead of the 14 of a normal slot) reduces the per-round transmission time to ∼0.125\sim 0.125–0.50.5 ms.
  • HARQ RTT target ≀0.5\le 0.5 ms in FR2 (millimetre-wave) numerology (ΞΌ=3\mu = 3, 0.1250.125 ms slots).
  • Max L=1L = 1–22 in practice: one retransmission if the budget allows, otherwise no HARQ.
  • PUCCH blind-decoding overhead (∼100\sim 100 ΞΌ\mus) eats into the budget.

When L=1L = 1 (no retransmission allowed), the ARQ-DMT reduces to the static DMT of Ch. 12. The URLLC reliability target is instead met by aggressive redundancy in a single shot β€” low-rate coding (mother code Rm=1/5R_m = 1/5 or lower), large MIMO arrays (nr=8n_r = 8 or more), and multiple replicas transmitted on different frequencies within the one allowed transmission. The diversity-multiplexing-delay tradeoff of Β§2 makes this quantitative: URLLC prefers to spend the "budget" on diversity (low rr, high dβˆ—(r)d^{*}(r)) rather than delay (high LL, high dARQd_\mathrm{ARQ}).

In practice, both mechanisms coexist: URLLC "PDCP duplication" sends the same packet on two independent physical paths (e.g., two base stations, or Wi-Fi + cellular). This is NOT HARQ β€” it is a separate reliability mechanism layered above HARQ β€” but it exploits the same underlying principle that independent observations multiply the reliability exponent.

Practical Constraints
  • β€’

    URLLC target: 1 ms end-to-end latency at 99.999% reliability.

  • β€’

    Max HARQ rounds in URLLC: L=1L = 1–22 (often zero).

  • β€’

    Mini-slot scheduling required to fit HARQ RTT inside latency budget.

  • β€’

    PDCP duplication provides path diversity above HARQ.

πŸ“‹ Ref: 3GPP TS 38.214 Β§5.3, Β§11 (URLLC scheduling)
πŸ”§Engineering Note

5G NR HARQ Process Count: Why 16?

NR specifies 16 HARQ processes per UE per cell on the downlink (and similarly on the uplink), up from LTE's 8. The rationale:

  • LTE (sub-6 GHz, ΞΌ=0\mu = 0, 1 ms slots, HARQ RTT ∼8\sim 8 slots): 8 processes fill the pipeline.
  • NR FR1 (sub-6 GHz, ΞΌ=0\mu = 0 or 11): same pipeline depth as LTE.
  • NR FR2 (millimetre-wave, ΞΌ=2\mu = 2 or 33, 0.250.25-0.1250.125 ms slots, HARQ RTT ∼16\sim 16 slots): requires 16 processes to keep the pipeline full at high throughput.

The tradeoff is that each HARQ process needs its own soft buffer storage (see Β§4). Sixteen processes at ∼400,000\sim 400{,}000 LLRs each Γ—8\times 8 bits =3.2= 3.2 Mbit of RAM per direction per cell. This is a non-trivial silicon cost for a low-end UE β€” hence the tiering of UE categories by HARQ-buffer size (LBRM).

A subtle point: HARQ process count does not affect the asymptotic ARQ-DMT; it affects throughput utilisation by preventing head-of-line blocking when multiple TBs are in flight. The diversity benefit comes from the number of rounds per TB (LL), not from the total number of processes.

Practical Constraints
  • β€’

    NR supports 16 HARQ processes per UE per direction per cell.

  • β€’

    Pipeline throughput requires NHARQβ‰₯ceil(Trtt/Tslot)N_\mathrm{HARQ} \ge \mathrm{ceil}(T_\mathrm{rtt}/T_\mathrm{slot}).

  • β€’

    Process count trades silicon area for pipeline efficiency.

πŸ“‹ Ref: 3GPP TS 38.214 Β§5.1 (HARQ processes)

Example: 5G NR HARQ Budget: ΞΌ=1\mu = 1, eMBB Service

A 5G NR eMBB service operates on numerology ΞΌ=1\mu = 1 (30 kHz subcarrier spacing, 0.5 ms slots). The PDSCH-to-PUCCH delay K1=2K_1 = 2 slots and the PUSCH-preparation time K2=2K_2 = 2 slots. What is the HARQ RTT, and how many HARQ processes are needed to keep the pipeline full at 100 Mbps?

Common Mistake: ACK/NACK Is Not Error-Free

Mistake:

Treating the ACK/NACK feedback in 5G NR as the noiseless one-bit channel assumed in the ARQ-DMT theorem. In reality, the PUCCH (physical uplink control channel) carries ACK/NACK at a target miss-detection rate of ∼10βˆ’3\sim 10^{-3} to 10βˆ’410^{-4}.

Correction:

The ARQ-DMT theorem assumes a zero-delay, zero-error ACK/NACK feedback link. In practice, PUCCH HARQ feedback has non-trivial error modes:

  • NACK-to-ACK (more serious): the UE NACKed but the gNB decoded it as an ACK. The gNB assumes success and does not retransmit; the TB is lost. This is the "silent failure" mode and is typically limited to ≀10βˆ’4\le 10^{-4}.
  • ACK-to-NACK: the UE ACKed but the gNB decoded it as a NACK. The gNB retransmits unnecessarily; no loss, just throughput penalty. Typically ≀10βˆ’3\le 10^{-3}.
  • DTX-to-NACK: no feedback received (UE didn't transmit anything); the gNB typically treats as NACK and retransmits.

Silent-failure rate sets a hard floor on the achievable BLER: even with infinite ARQ rounds, the end-to-end error rate is bounded below by the NACK-to-ACK rate. For URLLC's 10βˆ’510^{-5} target, the PUCCH reliability must exceed the data-channel reliability β€” hence NR's dedicated URLLC PUCCH format with Reed-Muller encoding and repetition.

Historical Note: From LTE HARQ to NR HARQ: The Flexibility Leap

2018

LTE's HARQ (Rel-8, 2009) was a direct realisation of Caire- Tuninetti 2001 and El Gamal-Caire-Damen 2006: LDPC mother codes (actually Turbo in LTE), circular-buffer rate matching, RVs 0–3, stop-and-wait with 8 parallel processes. The design was pragmatic and remained essentially unchanged through LTE-A.

NR (Rel-15, 2018) preserved the core structure but added flexibility along three axes:

  1. Numerology-dependent slots (ΞΌ=0\mu = 0 to 44, slots from 1 ms to 31 ΞΌ\mus): the same HARQ machinery works across a 200x dynamic range of HARQ RTT.
  2. Mini-slot scheduling (2–7 symbols) for URLLC: a single TB can fit in a fraction of a slot, enabling HARQ RTTs of ∼0.2\sim 0.2 ms.
  3. Code-block-group (CBG) re-transmission: instead of retransmitting the entire TB on NACK, NR can retransmit only the failed CBGs β€” saving air-time on partial failures.

The net effect is that NR HARQ realises the ARQ-DMT across a much wider range of (r,L,latency)(r, L, \mathrm{latency}) tuples than LTE could. The original El Gamal-Caire-Damen paper is cited in many 3GPP contributions on HARQ design β€” a concrete example of information theory shaping standards at Release-boundary depth.

Quick Check

In 5G NR HARQ, the "silent-failure" ACK/NACK error mode is

NACK-to-ACK: the UE NACKed but the gNB decoded as ACK, leading to no retransmission and an unrecoverable TB

ACK-to-NACK: unnecessary retransmission causes throughput penalty

DTX: the UE failed to transmit anything

The gNB scheduler picks the wrong process ID

Why This Matters: Forward Link: The Full BICM-OFDM-STBC Pipeline

This chapter sets the information-theoretic foundation for HARQ in cellular systems. Chapter 21 will put the HARQ mechanism in its full physical-layer context: BICM (Chs. 5–9), HARQ (this chapter), and OFDM-STBC (Ch. 22), composed into the transmit / receive chain of 5G NR. The ARQ-DMT tells us what the pipeline could achieve; Chapter 21 tells us what it actually does and where the gap comes from. Particular topics for Ch. 21: how rate matching interacts with MCS-adaptation at the scheduler level, how frequency-selective scheduling on an OFDM-subcarrier basis further decorrelates HARQ rounds, and how link adaptation (outer-loop link adaptation, OLLA) adjusts the MCS target based on the HARQ residual error rate.

HARQ Process

A state machine tracking the current transmission status of a single transport block in LTE/NR. Up to NHARQ=16N_{\rm HARQ} = 16 processes operate in parallel per UE per direction, each carrying its own soft buffer, NDI, RV, and TBS. Enables pipelined transmission without head-of-line blocking while preserving stop-and-wait semantics per process.

Related: 5G NR HARQ Process, Incremental Redundancy (IR-HARQ), Redundancy Version (RV), Transport Block

Redundancy Version (RV)

An integer index ∈{0,1,2,3}\in \{0, 1, 2, 3\} specifying the starting offset of the circular-buffer fragment transmitted in a given HARQ round (3GPP TS 38.212 §5.4.2). RV_0 includes the systematic bits of the LDPC codeword and is typically transmitted first; RV_2 is chosen from the far side of the buffer for maximum incremental-redundancy coverage on the second round.

Related: Incremental Redundancy (IR-HARQ), 5G NR HARQ Process, Circular Buffer, Rate Matching