The Overhead-Accuracy Tradeoff

Net Throughput = Rate × (1 − Overhead) × CSI-Gain

We have seen how to estimate G\mathbf{G} with different overheads. The next question, genuinely practical: given a fixed coherence budget TT, what is the optimal τp\tau_p? Too few pilots leaves G\mathbf{G} poorly known, eroding the coherent beamforming gain. Too many pilots leaves little time for data. The optimum balances estimation error against pilot cost, and its solution — a function of SNR, NN, and TT — determines the realizable throughput of any RIS deployment.

Theorem: SNR Loss from Imperfect CSI

Suppose the DFT-codebook estimator produces G^\hat{\mathbf{G}} with normalized error ϵCSI2=G~F2/GF2\epsilon_{\text{CSI}}^2 = \|\tilde{\mathbf{G}}\|_F^2 / \|\mathbf{G}\|_F^2. The beamformer Φ\boldsymbol{\Phi}^\star designed from G^\hat{\mathbf{G}} (matched filter on the estimate) achieves expected SNR

E[SNR]  =  SNRideal(1ϵCSI2),\mathbb{E}[\text{SNR}] \;=\; \text{SNR}^{\text{ideal}} \cdot (1 - \epsilon_{\text{CSI}}^2),

where SNRideal=PtN2/σ2\text{SNR}^{\text{ideal}} = P_t N^2 / \sigma^2 is the perfect-CSI coherent SNR. For small CSI error, the coherent-gain penalty is approximately ϵCSI2N2\epsilon_{\text{CSI}}^2 \cdot N^2 — the N2N^2 gain is eroded by the CSI quality.

If we use the channel estimate G^\hat{\mathbf{G}} to design the RIS phases, the actual received signal sees the true G\mathbf{G}, not the estimate. The mismatch is quantified by the CSI error G~=GG^\tilde{\mathbf{G}} = \mathbf{G} - \hat{\mathbf{G}}. The mean-coherent beamforming gain scales as (Nerror penalty)2(N - \text{error penalty})^2. For small error, the penalty is linear in error, and the effective NN becomes Neff=N(1ϵCSI2)N_{\text{eff}} = N (1 - \epsilon_{\text{CSI}}^2).

Theorem: Optimal Pilot Length Under Pilot-Power Budget

Under the DFT-codebook protocol with per-slot pilot energy PtTsP_t T_s and coherence block TT, the effective throughput is

Reff(τp)=(1τpT)log2 ⁣(1+SNRideal(1Ntσ2τpPt)).R_{\text{eff}}(\tau_p) = \left(1 - \frac{\tau_p}{T}\right) \log_2\!\left(1 + \text{SNR}^{\text{ideal}} \left(1 - \frac{N_t \sigma^2}{\tau_p P_t}\right)\right).

The optimal τp\tau_p^\star satisfies (approximately, at high SNR)

τpNtσ2TPtln2,\tau_p^\star \approx \sqrt{\frac{N_t\, \sigma^2\, T}{P_t\, \ln 2}},

i.e., grows as T/SNR\sqrt{T / \text{SNR}}, not linearly in NN — good news for large-NN deployments where CSI error is the first-order concern rather than pilot count.

Given a fixed total energy (pilot power times pilot time == constant), increasing τp\tau_p reduces per-element MSE at a 1/τp1/\tau_p rate but reduces data time at a 1τp/T1 - \tau_p/T rate. Differentiating the product reveals an interior optimum.

Effective Throughput vs. Pilot Length

Plot Reff(τp)R_{\text{eff}}(\tau_p) for DFT codebook and compare with ideal-CSI throughput. The interior optimum shows the pilot-vs-data tradeoff explicitly. Increase TT to see the optimum τp\tau_p grow; increase SNR and the optimum decreases.

Parameters
256
8
10
500

Example: Optimal Pilot Length for a Mid-Band Deployment

Deploy N=256N = 256 RIS at 3.5 GHz with Tc=500T_c = 500 symbols (roughly 5 ms5\text{ ms} at 10μs10\,\mu\text{s} symbol duration, appropriate for pedestrian mobility). BS antennas Nt=8N_t = 8, pilot SNR = 10 dB10\text{ dB}. Find the optimal τp\tau_p and the resulting fraction of coherent gain retained.

Key Takeaway

Optimal pilot length scales as T/SNR\sqrt{T/\text{SNR}}, not linearly with NN. For reasonable operating conditions (SNR0 dB\text{SNR} \geq 0\text{ dB}, T500T \geq 500), the optimal pilot fraction is <10%< 10\% even for N=256N = 256. The O(N)\mathcal{O}(N) pilot overhead of the naive DFT-codebook protocol is a worst-case figure; the real-world optimum is much better once pilots are not required to hit the CRB exactly.

Common Mistake: Don't Evaluate RIS Gains With Perfect CSI Only

Mistake:

A paper reports a 40 dB40\text{ dB} SNR improvement from N=1024N = 1024 coherent RIS beamforming — under perfect CSI.

Correction:

Under perfect CSI, the N2N^2 gain is real. Under imperfect CSI with realistic pilot overhead and finite pilot SNR, the effective gain is N2(1ϵCSI2)(1τp/T)N^2 (1 - \epsilon_{\text{CSI}}^2)(1 - \tau_p/T), often 101015 dB15\text{ dB} smaller than the perfect-CSI figure. Any RIS paper claiming dB gains without specifying the CSI assumption should be read with deep suspicion. Always report effective throughput ReffR_{\text{eff}} under a realistic pilot protocol.

Quick Check

For N=256N = 256-element RIS with pilot SNR =10 dB= 10\text{ dB}, Nt=8N_t = 8 BS antennas, and T=500T = 500 symbol coherence block, the optimal pilot length τp\tau_p^\star under DFT codebook is approximately:

256\sim 256 (= N)

60\sim 60-8080

20\sim 20-2525

5\sim 5

Cascaded Channel

The two-hop composite channel G=diag(h2)H1CN×Nt\mathbf{G} = \text{diag}(\mathbf{h}_2^*)\mathbf{H}_1 \in \mathbb{C}^{N \times N_t} between the BS and the RIS elements, which includes the effect of the RIS-UE channel h2\mathbf{h}_2. This is the estimable object; H1\mathbf{H}_1 and h2\mathbf{h}_2 are not separately identifiable from passive-RIS pilot observations.

Related: Cascaded Channel Ch1, Identifiability

DFT Codebook

A pilot-design scheme in which the RIS applies phase shifts corresponding to the columns of the N×NN \times N DFT matrix, cycling through NN orthogonal configurations across NN pilot slots. Achieves the minimum MSE for a given total pilot energy and is the practical default for RIS channel estimation.

Related: Pilot Design, The ON/OFF Subset-Switching Protocol

🚨Critical Engineering Note

CSI Budget in Practice

Practical CSI guidelines for real RIS deployments:

  1. Start from the coherence time. Mobile pedestrians at 1m/s1\,\text{m/s} at 3.5 GHz have coherence time 30ms\sim 30\,\text{ms}T3000T \sim 3000 symbols at 10μs10\,\mu\text{s} symbols. Fixed users have T100msT \sim 100\,\text{ms} or more.
  2. Scale pilot overhead by T\sqrt{T}. Optimal τp\tau_p grows with the square root of the coherence budget — large NN deployments are most profitable for slowly-varying users.
  3. Use compressed sensing for mobile scenarios. For T<500T < 500, CS brings τp\tau_p below the T\sqrt{T} square-root scaling by exploiting angular sparsity.
  4. Reserve a retraining budget. RIS channel statistics drift slowly (temperature, hardware aging); plan periodic long-pilot sessions (every few seconds) to track the slow component, interleaved with fast CS-based updates on the fast component.
Practical Constraints
  • Fixed-wireless access (FWA): T105T \sim 10^5 symbols, τp30\tau_p \sim 30 symbols (0.03% overhead).

  • Pedestrian mobility: T3000T \sim 3000, τp20\tau_p \sim 20 (0.7% overhead).

  • Vehicular mobility: T200T \sim 200, τp10\tau_p \geq 10 with CS; overhead >5%> 5\% likely.