HARQ in 5G NR: Redundancy Versions and Processes
5G NR: HARQ in the Real Radio Access Network
We arrive at the practical instantiation. 5G New Radio (NR) Release 15 adopted a HARQ design that directly reflects the ARQ-DMT principle of this chapter: stop-and-wait per-process operation, with up to 16 HARQ processes running in parallel, LDPC mother codes with circular-buffer rate matching driven by RV_0 through RV_3, and a flexible scheduling framework that picks the RV and the MCS per retransmission.
The fine details β the exact LDPC base graphs, the numerology- dependent HARQ RTT, the PUCCH/PDCCH feedback timing, the URLLC mini-slot HARQ β are all in the 3GPP specs (TS 38.212 for coding and TS 38.214 for procedures). We pull out the three design aspects that are information-theoretic consequences of the ARQ-DMT: (1) the choice of via the latency budget; (2) the choice of per-round MCS via the rate-/diversity- tradeoff; (3) the frequency-hopping across rounds to enforce independence.
Definition: 5G NR HARQ Process
5G NR HARQ Process
A HARQ process in 5G NR is a state machine that tracks the current transmission status of a single transport block (TB). Each UE maintains up to parallel HARQ processes (NR supports up to 16 on the physical downlink shared channel PDSCH; 32 on NR-U unlicensed). A process carries:
- HARQ process ID β an integer uniquely identifying the slot assignment of this TB.
- New-data indicator (NDI) β a toggle bit signalling whether the current transmission is a new TB or a retransmission of the prior TB in this process.
- Redundancy version (RV) β the index specifying the circular-buffer fragment for this retransmission.
- Transport block size (TBS) β number of information bits; constant across retransmissions of the same TB.
- Soft buffer β LLR accumulator storing the decoder's combined observations from the previous rounds.
Stop-and-wait means: a process cannot send a new TB until the prior TB is acknowledged or abandoned. Parallel processes allow multiple TBs to be in flight simultaneously β the gNB schedules TB on process in slot , and the UE sends the ACK/NACK in slot (typically slots at ). While waiting for ACK/NACK on process , other processes can transmit fresh TBs β filling the pipeline.
The choice reflects a specific tradeoff: with HARQ RTT slots and one TB per slot, 4 processes would be enough to keep the pipeline full without considering the variation in transmission times. The extra processes absorb jitter from scheduling, interference, and handover. Going beyond 16 yields diminishing returns; 16 is a round-number compromise between memory overhead (soft buffers!) and pipeline efficiency.
5G NR HARQ-IR Throughput Envelope
Effective throughput of a 5G NR IR-HARQ link as a function of the max round budget. Models LDPC mother code of rate , NR-style RV selection (RV_0 first, then RV_2, RV_3, RV_1), and i.i.d. Rayleigh MIMO across rounds. The envelope grows with , saturating at the no-outage ceiling at high SNR. At moderate SNR, the incremental benefit per round decreases quickly beyond β the motivation for the 3GPP default of retransmissions.
Parameters
Definition: Per-Round Frequency Hopping in 5G NR
Per-Round Frequency Hopping in 5G NR
To enforce the independence-across-rounds assumption of the ARQ-DMT, 5G NR supports per-round frequency hopping: the physical resource block (PRB) allocation for retransmission can be different from that of round . Hopping is signalled via the 1-bit "frequency hopping flag" in the DCI (downlink control information) and can take one of two patterns:
- Inter-slot hopping. Round uses PRBs ; round uses with . Rounds decorrelate because the frequency offset exceeds the coherence bandwidth .
- Intra-slot hopping. Within a single slot, the first and second halves of the transmission use different PRBs β providing decorrelation even within a single round for extra frequency-diversity protection.
Operationally, inter-slot hopping is the relevant mechanism for HARQ-round decorrelation: it ensures that are approximately independent even when the user is stationary and the temporal coherence time is large.
URLLC HARQ: One Retransmission Budget
Ultra-reliable low-latency communication (URLLC) has a target end-to-end latency of ms at reliability (sometimes ). The HARQ budget is correspondingly tight:
- Mini-slot scheduling (2β7 symbols instead of the 14 of a normal slot) reduces the per-round transmission time to β ms.
- HARQ RTT target ms in FR2 (millimetre-wave) numerology (, ms slots).
- Max β in practice: one retransmission if the budget allows, otherwise no HARQ.
- PUCCH blind-decoding overhead ( s) eats into the budget.
When (no retransmission allowed), the ARQ-DMT reduces to the static DMT of Ch. 12. The URLLC reliability target is instead met by aggressive redundancy in a single shot β low-rate coding (mother code or lower), large MIMO arrays ( or more), and multiple replicas transmitted on different frequencies within the one allowed transmission. The diversity-multiplexing-delay tradeoff of Β§2 makes this quantitative: URLLC prefers to spend the "budget" on diversity (low , high ) rather than delay (high , high ).
In practice, both mechanisms coexist: URLLC "PDCP duplication" sends the same packet on two independent physical paths (e.g., two base stations, or Wi-Fi + cellular). This is NOT HARQ β it is a separate reliability mechanism layered above HARQ β but it exploits the same underlying principle that independent observations multiply the reliability exponent.
- β’
URLLC target: 1 ms end-to-end latency at 99.999% reliability.
- β’
Max HARQ rounds in URLLC: β (often zero).
- β’
Mini-slot scheduling required to fit HARQ RTT inside latency budget.
- β’
PDCP duplication provides path diversity above HARQ.
5G NR HARQ Process Count: Why 16?
NR specifies 16 HARQ processes per UE per cell on the downlink (and similarly on the uplink), up from LTE's 8. The rationale:
- LTE (sub-6 GHz, , 1 ms slots, HARQ RTT slots): 8 processes fill the pipeline.
- NR FR1 (sub-6 GHz, or ): same pipeline depth as LTE.
- NR FR2 (millimetre-wave, or , - ms slots, HARQ RTT slots): requires 16 processes to keep the pipeline full at high throughput.
The tradeoff is that each HARQ process needs its own soft buffer storage (see Β§4). Sixteen processes at LLRs each bits Mbit of RAM per direction per cell. This is a non-trivial silicon cost for a low-end UE β hence the tiering of UE categories by HARQ-buffer size (LBRM).
A subtle point: HARQ process count does not affect the asymptotic ARQ-DMT; it affects throughput utilisation by preventing head-of-line blocking when multiple TBs are in flight. The diversity benefit comes from the number of rounds per TB (), not from the total number of processes.
- β’
NR supports 16 HARQ processes per UE per direction per cell.
- β’
Pipeline throughput requires .
- β’
Process count trades silicon area for pipeline efficiency.
Example: 5G NR HARQ Budget: , eMBB Service
A 5G NR eMBB service operates on numerology (30 kHz subcarrier spacing, 0.5 ms slots). The PDSCH-to-PUCCH delay slots and the PUSCH-preparation time slots. What is the HARQ RTT, and how many HARQ processes are needed to keep the pipeline full at 100 Mbps?
HARQ RTT
ms (plus sub-ms processing overhead). Within the 2 ms RTT, slots pass.
Processes needed for pipeline
To transmit a new TB every slot without stalling, we need processes. NR's maximum of 16 comfortably accommodates this with margin for handover, error cases, and jitter.
Latency for $L = 4$ retransmissions
In the worst case of 4 HARQ rounds, latency ms β well inside the typical - ms eMBB application-layer budget. The system can comfortably exploit the ARQ-DMT gain of .
ARQ-DMT gain at $r = 2$
At long-term effective rate bits/channel use on : ; . Compared to (one- shot at β literally no reliability at all), this is a qualitative jump: from "barely works" to "works with reliability ".
Common Mistake: ACK/NACK Is Not Error-Free
Mistake:
Treating the ACK/NACK feedback in 5G NR as the noiseless one-bit channel assumed in the ARQ-DMT theorem. In reality, the PUCCH (physical uplink control channel) carries ACK/NACK at a target miss-detection rate of to .
Correction:
The ARQ-DMT theorem assumes a zero-delay, zero-error ACK/NACK feedback link. In practice, PUCCH HARQ feedback has non-trivial error modes:
- NACK-to-ACK (more serious): the UE NACKed but the gNB decoded it as an ACK. The gNB assumes success and does not retransmit; the TB is lost. This is the "silent failure" mode and is typically limited to .
- ACK-to-NACK: the UE ACKed but the gNB decoded it as a NACK. The gNB retransmits unnecessarily; no loss, just throughput penalty. Typically .
- DTX-to-NACK: no feedback received (UE didn't transmit anything); the gNB typically treats as NACK and retransmits.
Silent-failure rate sets a hard floor on the achievable BLER: even with infinite ARQ rounds, the end-to-end error rate is bounded below by the NACK-to-ACK rate. For URLLC's target, the PUCCH reliability must exceed the data-channel reliability β hence NR's dedicated URLLC PUCCH format with Reed-Muller encoding and repetition.
Historical Note: From LTE HARQ to NR HARQ: The Flexibility Leap
2018LTE's HARQ (Rel-8, 2009) was a direct realisation of Caire- Tuninetti 2001 and El Gamal-Caire-Damen 2006: LDPC mother codes (actually Turbo in LTE), circular-buffer rate matching, RVs 0β3, stop-and-wait with 8 parallel processes. The design was pragmatic and remained essentially unchanged through LTE-A.
NR (Rel-15, 2018) preserved the core structure but added flexibility along three axes:
- Numerology-dependent slots ( to , slots from 1 ms to 31 s): the same HARQ machinery works across a 200x dynamic range of HARQ RTT.
- Mini-slot scheduling (2β7 symbols) for URLLC: a single TB can fit in a fraction of a slot, enabling HARQ RTTs of ms.
- Code-block-group (CBG) re-transmission: instead of retransmitting the entire TB on NACK, NR can retransmit only the failed CBGs β saving air-time on partial failures.
The net effect is that NR HARQ realises the ARQ-DMT across a much wider range of tuples than LTE could. The original El Gamal-Caire-Damen paper is cited in many 3GPP contributions on HARQ design β a concrete example of information theory shaping standards at Release-boundary depth.
Quick Check
In 5G NR HARQ, the "silent-failure" ACK/NACK error mode is
NACK-to-ACK: the UE NACKed but the gNB decoded as ACK, leading to no retransmission and an unrecoverable TB
ACK-to-NACK: unnecessary retransmission causes throughput penalty
DTX: the UE failed to transmit anything
The gNB scheduler picks the wrong process ID
Yes. NACK-to-ACK is the critical error mode because the gNB discards the TB assuming success. NR caps this rate at and uses DTX-aware feedback formats to manage it.
Why This Matters: Forward Link: The Full BICM-OFDM-STBC Pipeline
This chapter sets the information-theoretic foundation for HARQ in cellular systems. Chapter 21 will put the HARQ mechanism in its full physical-layer context: BICM (Chs. 5β9), HARQ (this chapter), and OFDM-STBC (Ch. 22), composed into the transmit / receive chain of 5G NR. The ARQ-DMT tells us what the pipeline could achieve; Chapter 21 tells us what it actually does and where the gap comes from. Particular topics for Ch. 21: how rate matching interacts with MCS-adaptation at the scheduler level, how frequency-selective scheduling on an OFDM-subcarrier basis further decorrelates HARQ rounds, and how link adaptation (outer-loop link adaptation, OLLA) adjusts the MCS target based on the HARQ residual error rate.
HARQ Process
A state machine tracking the current transmission status of a single transport block in LTE/NR. Up to processes operate in parallel per UE per direction, each carrying its own soft buffer, NDI, RV, and TBS. Enables pipelined transmission without head-of-line blocking while preserving stop-and-wait semantics per process.
Related: 5G NR HARQ Process, Incremental Redundancy (IR-HARQ), Redundancy Version (RV), Transport Block
Redundancy Version (RV)
An integer index specifying the starting offset of the circular-buffer fragment transmitted in a given HARQ round (3GPP TS 38.212 Β§5.4.2). RV_0 includes the systematic bits of the LDPC codeword and is typically transmitted first; RV_2 is chosen from the far side of the buffer for maximum incremental-redundancy coverage on the second round.
Related: Incremental Redundancy (IR-HARQ), 5G NR HARQ Process, Circular Buffer, Rate Matching