Type I and Type II Codebooks

Why 5G NR Has Two Codebooks

In an FDD cell, the downlink channel cannot be reconstructed from uplink measurements β€” the UE must feed back a description of it. The feedback is necessarily a compressed version of the true channel because bit budgets are tight: a few tens to a few hundred bits per reporting instance. The design problem is what to quantize: the full channel vector? a beam direction? a set of DFT basis coefficients?

5G NR answers this differently for two operating points. For simple single-user deployments and cell-edge coverage, Type I codebooks quantize a single DFT beam plus a phase term β€” cheap, compatible with older hardware, and sufficient for SU-MIMO. For multi-user spatial multiplexing with 4-16 users sharing the same resource block, Type II codebooks quantize a linear combination of L∈{2,3,4}L \in \{2,3,4\} DFT beams, giving much higher precoder resolution at the cost of 5-10x more feedback bits. Rel-17 eType II and Rel-18 ML-based codebooks push the resolution-versus-overhead trade-off further.

Definition:

Type I Single-Panel Codebook

A Type I single-panel codebook quantizes the precoder as a product of two beam indices i1=(i1,1,i1,2)i_1 = (i_{1,1}, i_{1,2}) and one co-phasing index i2i_2: v(i1,i2)=12[vi1,1,i1,2Ο•i2 vi1,1,i1,2],\mathbf{v}(i_1, i_2) = \frac{1}{\sqrt{2}} \begin{bmatrix} \mathbf{v}_{i_{1,1}, i_{1,2}} \\ \phi_{i_2}\,\mathbf{v}_{i_{1,1}, i_{1,2}} \end{bmatrix}, where vi1,1,i1,2\mathbf{v}_{i_{1,1}, i_{1,2}} is a 2D DFT steering vector indexed over an oversampled (O1N1)Γ—(O2N2)(O_1 N_1) \times (O_2 N_2) grid of azimuth/elevation beams, and Ο•i2∈{1,j,βˆ’1,βˆ’j}\phi_{i_2} \in \{1, j, -1, -j\} is a QPSK-valued cross-polarization phase correction. The total feedback payload is BTypeΒ I=⌈log⁑2(O1N1)βŒ‰+⌈log⁑2(O2N2)βŒ‰+log⁑24+O(1)B_{\text{Type I}} = \lceil \log_2(O_1 N_1) \rceil + \lceil \log_2(O_2 N_2) \rceil + \log_2 4 + O(1) which for a 32-port CSI-RS with N1=4N_1 = 4, N2=4N_2 = 4, oversampling O1=O2=4O_1 = O_2 = 4 is about 1212-1414 bits per precoder report.

The factor of 12\frac{1}{\sqrt 2} and the block-structured form reflect the fact that NR CSI-RS ports are typically organized in pairs of cross-polarized elements β€” each "port pair" is a dual-polarized antenna. The precoder decomposition is into a beam direction plus a polarization co-phasing.

,

Definition:

Type II Linear-Combination Codebook

A Type II codebook represents the precoder as a linear combination of LL orthogonal DFT beams: vTypeΒ II=βˆ‘β„“=1Lcℓ vi1,β„“,\mathbf{v}^{\text{Type II}} = \sum_{\ell=1}^{L} c_\ell\,\mathbf{v}_{i_{1,\ell}}, where the beam indices {i1,β„“}β„“=1L\{i_{1,\ell}\}_{\ell=1}^L are reported once per wideband report, and the complex combining coefficients {cβ„“}\{c_\ell\} (magnitude 3 bits, phase 8-PSK or 16-PSK) are reported per subband. The parameter L∈{2,3,4}L \in \{2, 3, 4\} controls the precoder resolution.

Because LL beams can reproduce any angle within the beam-grid span via their convex combinations, Type II approaches Grassmannian optimality as LL grows β€” at the cost of feedback overhead that scales as BTypeΒ IIβ‰ˆLlog⁑2(O1N1O2N2)+Lβ‹…(bmag+bph)β‹…NSB,B_{\text{Type II}} \approx L \log_2(O_1 N_1 O_2 N_2) + L \cdot (b_{\text{mag}} + b_{\text{ph}}) \cdot N_{\text{SB}}, where NSBN_{\text{SB}} is the number of subbands (typically 10-30) and bmag+bph=3+4=7b_{\text{mag}} + b_{\text{ph}} = 3 + 4 = 7 bits per coefficient. A typical Rel-15 Type II report is 300-800 bits.

,

Theorem: Type II Sum Rate Gap to Ideal Precoding

Consider an MU-MIMO cell with an NtN_t-element array serving KK users via ZF precoding. If each user feeds back a Type II precoder with LL beams, the per-user rate gap to ideal (unquantized) ZF precoding satisfies Rkidealβˆ’RkTypeΒ II=log⁑2 ⁣(1+SNRΟƒ2 E[sin⁑2ΞΈL])+O(SNRβˆ’1),R_k^{\text{ideal}} - R_k^{\text{Type II}} = \log_2\!\left(1 + \frac{\text{SNR}}{\sigma^2}\, \mathbb{E}\left[\sin^2 \theta_L\right]\right) + O(\text{SNR}^{-1}), where ΞΈL\theta_L is the angle between the true channel direction and the best LL-beam linear combination in the codebook. For random channels under the one-ring model, E[sin⁑2ΞΈL]∝Lβˆ’2\mathbb{E}[\sin^2 \theta_L] \propto L^{-2}, so the rate gap shrinks quadratically with LL.

A Type II codebook approximates the true channel by its projection onto the subspace spanned by the LL chosen DFT beams. The residual (what the projection misses) is what causes inter-user interference after ZF precoding. As LL grows, the subspace becomes richer and the residual shrinks β€” but the quadratic decay in sin⁑2ΞΈL\sin^2 \theta_L hits diminishing returns beyond L=4L = 4 for typical channels.

,

Type I vs Type II: Sum Rate and Feedback Payload

Sum rate achieved by Type I and Type II codebooks versus the user count KK and the Type II beam parameter LL. The feedback payload is shown on a secondary axis; Type II rates approach ideal-CSI at a feedback cost 5-10x that of Type I.

Parameters
4
4
15
32

Example: Feedback Payload for Rel-15 Type II

A 32-port CSI-RS cell uses Rel-15 Type II codebook with L=4L = 4 beams, NSB=13N_{\text{SB}} = 13 subbands (for a 20 MHz allocation at ΞΌ=1\mu = 1), and wideband amplitude (3 bits) plus 8-PSK phase (3 bits) per coefficient. Compute the CSI report payload in bits.

Definition:

eType II (Rel-17 Enhanced Type II)

Rel-17 introduces eType II, which applies a DCT-like compression along the frequency dimension to the Type II coefficient matrix. The original LΓ—NSBL \times N_{\text{SB}} coefficient grid is written as C∈CLΓ—NSB\mathbf{C} \in \mathbb{C}^{L \times N_{\text{SB}}} and approximated by its MM-term principal components: Cβ‰ˆUL D VH,\mathbf{C} \approx \mathbf{U}_L\,\mathbf{D}\,\mathbf{V}^H, where UL\mathbf{U}_L holds the spatial beams (as in Type II) and V∈CNSBΓ—M\mathbf{V} \in \mathbb{C}^{N_{\text{SB}} \times M} holds M≀NSB/2M \leq N_{\text{SB}}/2 frequency-domain DFT basis vectors. Only the coefficients of the MM principal components are reported, yielding a feedback compression factor of NSB/Mβ‰ˆ2N_{\text{SB}}/M \approx 2-4Γ—4\times.

A Rel-17 eType II payload is about 40-60% of Rel-15 Type II for the same rate performance, and the per-subband coefficient reporting is replaced by a frequency-basis reporting scheme.

5G NR Codebook Families

FeatureType IType II (Rel-15)eType II (Rel-17)ML-based (Rel-18+)
Precoder formSingle DFT beam + QPSK co-phasingLinear combination of LL DFT beamsCompressed Type II via frequency DCTAutoencoder-decoder CsiNet style
Typical LL1 (implicit)2, 3, 42, 3, 4, 6N/A (latent dim)
Feedback payload10-14 bits300-800 bits150-400 bits50-200 bits (learned)
Target use caseSU-MIMO, cell edgeMU-MIMO up to 8 usersMU-MIMO up to 12 usersResearch; AI-driven UEs
Rate vs ideal ZF gap3-5 dB1-2 dB1-1.5 dB<< 1 dB claimed
First releaseRel-15Rel-15Rel-17Rel-18 study, Rel-19 spec
UE complexityLowMediumMedium-highHigh (NN inference)
πŸŽ“CommIT Contribution(2013)

JSDM as an Alternative to High-Overhead FDD Codebooks

A. Adhikary, J. Nam, J.-Y. Ahn, G. Caire β€” IEEE Transactions on Information Theory, vol. 59, no. 10, pp. 6441-6463

The CommIT group's 2013 JSDM paper (treated in Chapters 7 and 8 of this book) pre-dates the 5G NR codebook design by five years, but offers a structurally different approach to FDD massive MIMO. Rather than quantize the channel in a fixed DFT codebook, JSDM exploits the long-term statistics of the channel to pre-beamform the CSI-RS into a reduced-dimension effective channel, and then applies a small feedback on the reduced channel. The result is FDD with feedback overhead that scales as O(K)O(K) rather than O(Nt)O(N_t).

Commercial NR Type II codebooks reach a similar operating point by a different route β€” a fixed DFT basis with linear combination β€” but the spirit is the same: compress the channel via structure rather than via raw quantization. eType II's frequency compression explicitly uses statistical structure (the frequency correlation function) that JSDM used in the spatial dimension. The NR community is effectively revisiting the JSDM philosophy with each new release.

jsdmfddcsi-feedbackcodebookView Paper β†’

Common Mistake: Type II Beams Are Not Data Streams

Mistake:

A common error is to interpret the Type II parameter LL as the maximum number of data streams the BS can transmit to the user.

Correction:

LL is the number of DFT basis vectors used to represent the precoder as a linear combination β€” not the number of data streams. The number of streams is the rank indicator (RI), which is separately fed back and takes values in {1,2,3,4}\{1, 2, 3, 4\} (Rel-15 Type II only supports rank 1 and 2, extended to rank 4 in Rel-16). Confusing these two concepts leads to massively overestimated feedback payloads and a misreading of the codebook design.

Historical Note: Type II: The Codebook Debate of 2017

2017-2018

During Rel-15 standardization in 2017, the introduction of Type II was contentious. One camp argued that Type I was sufficient for MU- MIMO at the overhead budget available, and that Type II's 20x payload was wasted on marginal SINR gains. The other camp ran field simulations showing that at K=8K = 8-1616 with 32-port CSI-RS, Type I left 3-5 dB of SINR on the table compared to ideal ZF β€” a gap too large to ignore at the beginning of the 5G deployment era. After heated debate at the August 2017 RAN1 meeting in Berlin, both codebooks were standardized as optional UE capabilities, with Type II explicitly targeted at multi-user deployments.

PMI (Precoding Matrix Indicator)

The index in the UE's CSI report that identifies the recommended downlink precoder from the configured codebook. For Type I it is a tuple of beam-grid indices; for Type II it is a compressed description of the beam combination and coefficient matrix.

Related: Channel Quality Indicator (CQI), Ri, CSI-RS Overhead Scaling with Ports and Periodicity

CSI Subband

A contiguous set of resource blocks over which the UE reports one CSI measurement. For a 20 MHz carrier at ΞΌ=1\mu = 1, the typical subband size is 8 RBs, giving β‰ˆ13\approx 13 subbands per component carrier. Wideband reports average across all subbands.

Related: CSI-RS Overhead Scaling with Ports and Periodicity, Beamforming Codebook and PMI

Quick Check

Which is the primary reason Type II codebook reports are 5-10x larger than Type I reports?

Type II uses more beam indices

Type II reports linear-combination coefficients per subband

Type II is for FDD only

Type II requires more CSI-RS ports