CSIT Acquisition and the Pilot Overhead Penalty

The Pilot Cost Is Quadratic in Antennas

On a block-fading channel, the transmitter does not know hk\mathbf{h}_k a priori. To steer beams it must estimate them β€” and estimation costs channel uses. The standard protocol: each coherence block begins with a pilot phase of Ο„\tau channel uses where users send known training symbols, followed by a data phase of Tcβˆ’Ο„T_c - \tau channel uses where the transmitter uses the estimated channel to precode.

The catch: to estimate KK user channels each of dimension LL, one typically needs Ο„β‰₯L\tau \geq L pilots. As LL grows β€” say in a massive MIMO regime β€” the pilot phase consumes a growing fraction of the coherence block. At Ο„=Lβ‰ˆTc\tau = L \approx T_c, there is essentially no time left for data. This is the pilot wall: effective spatial DoF degrades as L(1βˆ’L/Tc)+L(1 - L/T_c)_+, peaking at L=Tc/2L = T_c/2.

Coded caching doesn't face this problem. Cache contents are pre-placed; the delivery phase uses a mix of cached bits and XOR messages that don't require real-time CSIT. The effective caching gain tt is pilot-free.

Theorem: Effective DoF with Pilot Overhead

For the cache-aided fading BC with coherence block TcT_c, antennas LL, and pilot allocation Ο„=L\tau = L (the minimum for full CSIT), the effective DoF per coherence block is DoFeff(Tc)β€…β€Š=β€…β€Št+L(1βˆ’LTc)+.\mathrm{DoF}_{\text{eff}}(T_c) \;=\; t + L\left(1 - \frac{L}{T_c}\right)_+ . This DoF is maximized over LL by choosing Lβˆ—=Tc/2L^* = T_c/2, yielding spatial DoF Tc/4T_c/4.

Fraction Ο„/Tc=L/Tc\tau/T_c = L/T_c of each coherence block is consumed by pilots; the remaining fraction (1βˆ’L/Tc)(1 - L/T_c) carries data. The spatial multiplexing gain LL is effective only during data. Coded caching uses all TcT_c channel uses (placement is off-coherence-block; cached bits are always usable).

,

Effective DoF vs Coherence Block Length

Plot the effective DoF as a function of the coherence block length TcT_c, for fixed KK, LL, and memory ratio. Three curves: (1) blue cache + MIMO with pilot cost; (2) red dashed pure MIMO (t=0t = 0); (3) green dotted pure caching (CSIT-free). At small TcT_c (high mobility, mmWave), the cache-aided curve approaches the green (CSIT-free) floor; at large TcT_c (quasi-static), it approaches the full t+Lt + L.

Parameters
20
4
0.2

Example: mmWave vs Sub-6 GHz DoF

Compare the effective DoF for two 5G-NR-like scenarios: (a) Sub-6 GHz, Tc=1000T_c = 1000, L=8L = 8, K=50K = 50, ΞΌ=0.1\mu = 0.1. (b) mmWave, Tc=100T_c = 100, L=32L = 32, K=50K = 50, ΞΌ=0.1\mu = 0.1. Both have the same caching gain t=KM/N=5t = KM/N = 5.

Implication for Massive MIMO

The pilot-overhead analysis bears on the "more antennas = more gain" narrative of massive MIMO. For a fixed coherence block TcT_c, adding antennas beyond Lβˆ—=Tc/2L^* = T_c/2 hurts spatial DoF. This is a hard limit of TDD operation with pilot-based estimation.

But the picture changes when coded caching is added. Each extra user KK (with cache) contributes to the aggregate cache KM/NKM/N and hence to the caching gain tt. This gain is not subject to pilot overhead. If the deployment is cache-rich, adding users can compensate for the pilot wall on the spatial side. This is a subtle design point: caching lets us decouple antenna count from CSIT overhead.

Common Mistake: Do Not Confuse Coherence Block with Coherence Time

Mistake:

Using TcT_c in "channel uses" interchangeably with TcohT_{\text{coh}} in "seconds".

Correction:

TcT_c (coherence block length) is measured in channel uses and equals Tcohβ‹…WsT_{\text{coh}} \cdot W_s where TcohT_{\text{coh}} is coherence time (seconds) and WsW_s is the symbol rate (symbols per second). Similarly, TcT_c can be interpreted as Bc/Ξ”fB_c / \Delta f for wideband systems. The formulas of this chapter use TcT_c in channel uses.

A 10 ms coherence time at 100 kBaud is Tc=103T_c = 10^3; the same 10 ms at 10 MBaud is Tc=105T_c = 10^5. These are very different regimes for pilot-overhead analysis.

⚠️Engineering Note

Pilot Design in 5G NR

In 5G NR, pilot design is a nuanced tradeoff:

  1. DMRS (Demodulation Reference Signal). User-specific pilots for coherent demodulation. Overhead: 1-2 OFDM symbols per slot.
  2. SRS (Sounding Reference Signal). Uplink pilots for CSIT acquisition (TDD reciprocity). Periodic; 10-160 ms intervals.
  3. CSI-RS. Downlink CSI measurement in FDD; feedback to BS via PUCCH.
  4. Massive MIMO constraints. Pilot contamination (Marzetta 2010) when nearby cells reuse pilots; bounds per-user rate.

For cache-aided systems, pilot design must balance the usual MU-MIMO tradeoffs with the caching gain. A common design: reserve a smaller pilot allocation than the fully SU-optimal choice, trading a small spatial DoF loss for reduced overhead. The cache-aided Lampiris- Caire scheme tolerates this well because the caching component tt is pilot-insensitive.

Production 5G gNBs handle 4-8 DMRS ports per slot; mmWave mMIMO systems (e.g., 64+ ports) use hybrid beamforming to reduce effective pilot dimensionality.

Practical Constraints
  • β€’

    5G NR DMRS: 1-2 OFDM symbols per 14-symbol slot (7-14% overhead)

  • β€’

    Type II CSI feedback: up to 64 bits per reporting instance

  • β€’

    SRS periodicity: 5-160 ms (vs coherence time of 1-10 ms at 100 km/h, 2 GHz)

  • β€’

    Pilot contamination limits per-cell effective L to ~30 even with 100+ physical antennas