The Passive-RIS Estimation Challenge

The Optimizer Needs to Know the Channel

Every optimization result in Chapters 5–15 β€” the N2N^2 coherent gain, the SDR solution, alternating-optimization convergence β€” is stated conditional on known channel state information (CSI). The controller chooses Ξ¦\boldsymbol{\Phi} based on H1\mathbf{H}_1 and H2\mathbf{H}_2; if the channels are unknown, the only defensible choice is random phases, which gives O(N)\mathcal{O}(N) scaling instead of O(N2)\mathcal{O}(N^2) β€” a factor of NN loss. The entire value proposition of RIS hinges on CSI.

But the RIS is passive. It cannot transmit pilots, it cannot correlate received waveforms, it cannot even know whether it is currently being illuminated. The controller must estimate H1\mathbf{H}_1 and h2\mathbf{h}_2 using only signals observed at the BS or the UE, with the RIS itself a passive multiplier. This chapter develops the three workhorse protocols (ON/OFF switching, DFT codebooks, compressed sensing) and derives their overhead-accuracy tradeoffs.

Definition:

The Observable Cascaded Channel

Consider an uplink pilot scheme where the UE transmits pilots xtx_t (t=1,…,Ο„pt = 1, \ldots, \tau_p) and the BS receives

yt=(hd+H1HΦ(t)h2)xt+wt,\mathbf{y}_t = \big(\mathbf{h}_d + \mathbf{H}_1^H \boldsymbol{\Phi}^{(t)} \mathbf{h}_2\big) x_t + \mathbf{w}_t,

where Ξ¦(t)\boldsymbol{\Phi}^{(t)} is the RIS configuration during pilot slot tt. Expanding the RIS term with the diagonal identity,

H1HΞ¦(t)h2=(diag(h2βˆ—)H1)HΟ•(t)=GHΟ•(t),\mathbf{H}_1^H \boldsymbol{\Phi}^{(t)} \mathbf{h}_2 = \big(\text{diag}(\mathbf{h}_2^*)\mathbf{H}_1\big)^H \boldsymbol{\phi}^{(t)} = \mathbf{G}^H \boldsymbol{\phi}^{(t)},

where G=diag(h2βˆ—)H1∈CNΓ—Nt\mathbf{G} = \text{diag}(\mathbf{h}_2^*)\mathbf{H}_1 \in \mathbb{C}^{N \times N_t} is the cascaded channel matrix. The received signal becomes

yt=(hd+GHΟ•(t))xt+wt.\mathbf{y}_t = \big(\mathbf{h}_d + \mathbf{G}^H \boldsymbol{\phi}^{(t)}\big) x_t + \mathbf{w}_t.

The cascaded matrix G\mathbf{G} is what the BS can estimate, not H1\mathbf{H}_1 and h2\mathbf{h}_2 separately. This is the central observation: the RIS introduces an inherent rank-1 ambiguity β€” scaling h2\mathbf{h}_2 by Ξ±\alpha and H1\mathbf{H}_1 by 1/Ξ±1/\alpha leaves G\mathbf{G} unchanged. Fortunately, the BS optimization only needs G\mathbf{G}, so the ambiguity does not matter for beamforming.

Key Takeaway

The estimable object is G=diag(h2βˆ—)H1\mathbf{G} = \text{diag}(\mathbf{h}_2^*)\mathbf{H}_1 β€” not H1\mathbf{H}_1 and h2\mathbf{h}_2 separately. The RIS optimization depends only on G\mathbf{G}; the fact that we cannot separate the two hops is harmless. G\mathbf{G} has NNtN N_t unknowns β€” the scale of the estimation problem.

Theorem: Minimum Pilot Length for Cascaded-Channel Estimation

Let Y∈CNtΓ—Ο„p\mathbf{Y} \in \mathbb{C}^{N_t \times \tau_p} collect Ο„p\tau_p pilot-slot observations. Assuming hd\mathbf{h}_d is either known or separately estimated, the cascaded channel G∈CNΓ—Nt\mathbf{G} \in \mathbb{C}^{N \times N_t} can be uniquely recovered from Y\mathbf{Y} by least squares if and only if the matrix of stacked RIS configurations Ξ¦stack=[Ο•(1),…,Ο•(Ο„p)]∈CNΓ—Ο„p\boldsymbol{\Phi}^{\text{stack}} = [\boldsymbol{\phi}^{(1)}, \ldots, \boldsymbol{\phi}^{(\tau_p)}] \in \mathbb{C}^{N \times \tau_p} has row rank NN. In particular, Ο„pβ‰₯N\tau_p \geq N is necessary, and equality is achievable with an orthogonal configuration set.

Estimating G\mathbf{G} with NNtN N_t complex unknowns requires at least that many linearly independent measurements. If the BS has NtN_t antennas and each pilot slot generates NtN_t scalar observations (one per receive antenna), then we need at least NN pilot slots β€” one per RIS configuration.

The Central Tension

The Ο„pβ‰₯N\tau_p \geq N lower bound is the source of the most important practical concern in RIS: channel-estimation overhead scales linearly with the number of RIS elements. But the coherent SNR gain scales as N2N^2. So, per additional RIS element, we pay one pilot slot and gain (2N+1)/N2β†’0(2N+1)/N^2 \to 0 of the coherent gain. The return on investment in element count is diminishing once we account for estimation.

Two escape routes are developed below:

  1. Structured sparsity (compressed sensing, Section 4.4): if the channel has only Lβ‰ͺNL \ll N dominant paths, we need only O(Llog⁑N)\mathcal{O}(L \log N) pilots.
  2. Multi-user pilot reuse (Chapter 7): when multiple UEs share a BS-RIS link, the cost is amortized.

Common Mistake: Don't Try to Separate H1\mathbf{H}_1 and h2\mathbf{h}_2

Mistake:

A newcomer sets up the problem with NNt+NN N_t + N unknowns (H1\mathbf{H}_1's NNtN N_t entries plus h2\mathbf{h}_2's NN entries) and concludes Ο„pβ‰₯Nt+1\tau_p \geq N_t + 1 pilots suffice.

Correction:

The separation is impossible from the observable data alone: the product h2(n)βˆ—(H1)n,:\mathbf{h}_2^{(n)*} (\mathbf{H}_1)_{n,:} is identifiable, but multiplying h2\mathbf{h}_2 by Ξ±n\alpha_n and dividing the nn-th row of H1\mathbf{H}_1 by Ξ±nβˆ—\alpha_n^* leaves all observations unchanged. Only G\mathbf{G} is identifiable from passive-RIS pilots; the BS beamforming optimization needs only G\mathbf{G}, so the ambiguity is not an obstacle. Do not waste pilot resources trying to recover a separation that doesn't exist.

Pilot Timeline for a Single Coherence Block

Pilot Timeline for a Single Coherence Block
Within one coherence block of length TT, Ο„p\tau_p slots are spent on pilot transmission (with varying RIS configurations Ξ¦(1),…,Ξ¦(Ο„p)\boldsymbol{\Phi}^{(1)}, \ldots, \boldsymbol{\Phi}^{(\tau_p)}); the remaining Tβˆ’Ο„pT - \tau_p slots are used for data with the optimized Φ⋆\boldsymbol{\Phi}^\star. The effective data rate is scaled by (1βˆ’Ο„p/T)(1 - \tau_p/T).

Pilot Overhead Ο„p/T\tau_p / T vs. NN

Show how the pilot fraction grows with RIS size for three strategies: naive element-by-element (Ο„p=2N\tau_p = 2N), ON/OFF or DFT codebook (Ο„p=N\tau_p = N), and compressed sensing (Ο„p=O(Llog⁑N)\tau_p = \mathcal{O}(L \log N)). Change the sparsity LL to see the CS overhead curve shift.

Parameters
500
512
6