Channel Estimation for RIS

The Fundamental Challenge: Estimating Channels Through Passive Elements

In conventional MIMO systems, channel estimation is performed by transmitting known pilot symbols and measuring the received signal. An RIS, however, is purely passive β€” it has no RF chains, no ADCs, and no baseband processing. It can neither transmit pilots nor receive and process them. This creates a fundamental asymmetry: the BS-RIS channel G\mathbf{G} and the RIS-user channel hr\mathbf{h}_r cannot be estimated separately through standard pilot-based methods.

What can be observed at the receiver is only the cascaded channel hrHΘG\mathbf{h}_r^H \boldsymbol{\Theta} \mathbf{G}, which is a function of the RIS configuration Θ\boldsymbol{\Theta}. By varying Θ\boldsymbol{\Theta} across multiple pilot slots and observing the corresponding received signals, the receiver can extract information about the individual channels. The key question is: how many pilot slots are needed, and how should Θ\boldsymbol{\Theta} be varied to enable efficient estimation?

Definition:

RIS Channel Estimation Problem

Consider TT pilot slots during which the BS transmits known pilots {xt}t=1T\{x_t\}_{t=1}^T and the RIS applies configurations {Θt}t=1T\{\boldsymbol{\Theta}_t\}_{t=1}^T. The received signal at pilot slot tt is:

yt=(hdH+hrHΘtG)wtxt+nt,t=1,…,Ty_t = (\mathbf{h}_d^H + \mathbf{h}_r^H \boldsymbol{\Theta}_t \mathbf{G}) \mathbf{w}_t x_t + n_t, \quad t = 1, \ldots, T

For a single BS antenna (M=1M = 1) and denoting the cascaded channel coefficients vn=hr,nβˆ—gnv_n = h_{r,n}^* g_n (where gng_n is the nn-th element of the BS-RIS channel vector), the model simplifies to:

yt=hdxt+Ο•tTv xt+nty_t = h_d x_t + \boldsymbol{\phi}_t^T \mathbf{v} \, x_t + n_t

where Ο•t=[Ο•t,1,…,Ο•t,N]T\boldsymbol{\phi}_t = [\phi_{t,1}, \ldots, \phi_{t,N}]^T is the RIS phase vector at slot tt and v=[v1,…,vN]T\mathbf{v} = [v_1, \ldots, v_N]^T.

Stacking all TT observations (with xt=1x_t = 1 for simplicity):

y=Ξ¦[hdv]+n\mathbf{y} = \boldsymbol{\Phi} \begin{bmatrix} h_d \\ \mathbf{v} \end{bmatrix} + \mathbf{n}

where Φ∈CTΓ—(N+1)\boldsymbol{\Phi} \in \mathbb{C}^{T \times (N+1)} has rows [1,Ο•tT][1, \boldsymbol{\phi}_t^T]. The channel estimation problem is to recover the (N+1)(N+1)-dimensional vector [hd;v][h_d; \mathbf{v}] from TT noisy observations.

Identifiability requirement: The matrix Ξ¦\boldsymbol{\Phi} must have rank N+1N + 1, which requires Tβ‰₯N+1T \geq N + 1 pilot slots. This is the fundamental training overhead of RIS channel estimation.

For MIMO BS with MM antennas, the cascaded channel has NMNM unknowns (plus MM for the direct channel), requiring Tβ‰₯NM+MT \geq NM + M pilot slots in the worst case. This overhead can be prohibitive for large NN and motivates the structured estimation approaches described below.

,

Theorem: Training Overhead Lower Bound for RIS Channel Estimation

For an RIS-assisted system with NN reflecting elements, MM BS antennas, and a single-antenna user, any unbiased estimator of the cascaded channel vn=hr,nβˆ—gn∈CM\mathbf{v}_n = h_{r,n}^* \mathbf{g}_n \in \mathbb{C}^M for all n=1,…,Nn = 1, \ldots, N requires at least

Tβ‰₯N+1T \geq N + 1

pilot training slots (for M=1M = 1), or more generally

Tβ‰₯⌈NM+MMβŒ‰=N+1T \geq \left\lceil \frac{NM + M}{M} \right\rceil = N + 1

pilot time slots when the BS can transmit MM orthogonal pilots per slot. The minimum number of total pilot symbols is NM+M=M(N+1)NM + M = M(N + 1).

Furthermore, the Cram'{e}r-Rao lower bound (CRLB) for the mean squared error of the cascaded channel estimate is:

MSEβ‰₯(N+1)Οƒ2TP\mathrm{MSE} \geq \frac{(N+1)\sigma^2}{TP}

when Ξ¦\boldsymbol{\Phi} is a (N+1)Γ—(N+1)(N+1) \times (N+1) unitary matrix (e.g., a DFT matrix) and PP is the pilot transmit power.

The cascaded channel has NN unknown complex coefficients (one per RIS element), plus the direct channel. Each distinct RIS configuration provides one independent linear measurement of these unknowns. Therefore, at least N+1N + 1 distinct configurations are needed. The DFT-based design achieves the CRLB because the DFT matrix is unitary, providing maximally spread measurements.

,

Grouped Element Channel Estimation

Input: Number of elements NN, group size GG, pilot power PP, noise variance Οƒ2\sigma^2
Output: Estimated cascaded channel v^\hat{\mathbf{v}}
1. Set NΛ‰=⌈N/GβŒ‰\bar{N} = \lceil N/G \rceil (number of groups)
2. Design phase configurations:
- Construct (Nˉ+1)×(Nˉ+1)(\bar{N}+1) \times (\bar{N}+1) DFT matrix F\mathbf{F}
- For slot t=1,…,NΛ‰+1t = 1, \ldots, \bar{N}+1:
- Set group kk's phase: Ο•Λ‰t,k=Ft,k\bar{\phi}_{t,k} = F_{t,k} for k=1,…,NΛ‰k = 1, \ldots, \bar{N}
- All elements in group kk use phase Ο•Λ‰t,k\bar{\phi}_{t,k}:
Ο•t,n=Ο•Λ‰t,⌈n/GβŒ‰\phi_{t,n} = \bar{\phi}_{t,\lceil n/G \rceil} for n=1,…,Nn = 1, \ldots, N
3. Collect observations: transmit pilots with each configuration
yt=hd+βˆ‘k=1NΛ‰Ο•Λ‰t,kvΛ‰k+nty_t = h_d + \sum_{k=1}^{\bar{N}} \bar{\phi}_{t,k} \bar{v}_k + n_t
where vΛ‰k=βˆ‘n∈Gkvn\bar{v}_k = \sum_{n \in \mathcal{G}_k} v_n is the grouped channel
4. Estimate grouped channel:
Ξ·Λ‰^=(Ξ¦Λ‰HΞ¦Λ‰)βˆ’1Ξ¦Λ‰Hy\hat{\bar{\boldsymbol{\eta}}} = (\bar{\boldsymbol{\Phi}}^H \bar{\boldsymbol{\Phi}})^{-1} \bar{\boldsymbol{\Phi}}^H \mathbf{y}
(least squares estimate of [hd,vΛ‰1,…,vΛ‰NΛ‰]T[h_d, \bar{v}_1, \ldots, \bar{v}_{\bar{N}}]^T)
5. Reconstruct element-level channel: Assign v^n=vΛ‰^⌈n/GβŒ‰/G\hat{v}_n = \hat{\bar{v}}_{\lceil n/G \rceil} / G
for n=1,…,Nn = 1, \ldots, N (uniform allocation within each group)
6. Return v^=[v^1,…,v^N]T\hat{\mathbf{v}} = [\hat{v}_1, \ldots, \hat{v}_N]^T
Complexity: O(Nˉ2)=O(N2/G2)O(\bar{N}^2) = O(N^2/G^2) for least squares.
Training overhead: NΛ‰+1=⌈N/GβŒ‰+1\bar{N} + 1 = \lceil N/G \rceil + 1 pilot slots.

Advanced Channel Estimation Strategies

Beyond the basic ON/OFF and grouped estimation protocols, several advanced strategies exploit channel structure:

1. Codebook-based estimation with DFT patterns. The RIS cycles through a codebook of TT phase configurations, typically drawn from a DFT matrix. If T=N+1T = N + 1, the full cascaded channel can be recovered. For T<N+1T < N + 1, the system operates in a compressed regime.

2. Compressed sensing / sparse recovery. In mmWave/sub-THz bands, both the BS-RIS and RIS-user channels exhibit angular sparsity: only a few dominant paths exist. If the cascaded channel v\mathbf{v} is SS-sparse in the angular domain (i.e., v=Adicts\mathbf{v} = \mathbf{A}_{\mathrm{dict}} \mathbf{s} where s\mathbf{s} has only Sβ‰ͺNS \ll N nonzero entries), then only T=O(Slog⁑(N/S))T = O(S \log(N/S)) pilot slots suffice. Standard algorithms (OMP, LASSO, AMP) can recover s\mathbf{s}.

3. ON/OFF protocol (Mishra and Johansson 2019). In round tt, only element tt is turned ON (reflecting) while all others are OFF (absorbing). This provides yt=hd+vt+nty_t = h_d + v_t + n_t, directly revealing vtv_t (after subtracting the known hdh_d). Simple but requires NN rounds and wastes the potential array gain during estimation.

4. Two-timescale estimation. The BS-RIS channel G\mathbf{G} is quasi-static (both nodes are fixed), while the RIS-user channel hr\mathbf{h}_r varies with user mobility. Estimate G\mathbf{G} infrequently (slow timescale) and track hr\mathbf{h}_r at each coherence interval (fast timescale), reducing per-interval overhead from O(NM)O(NM) to O(N)O(N).

Quick Check

An RIS has N=256N = 256 elements and the BS has M=1M = 1 antenna. Using a DFT-based estimation protocol with no sparsity exploitation, how many pilot slots are required for full cascaded channel estimation?

N=256N = 256 pilot slots

N+1=257N + 1 = 257 pilot slots

2N=5122N = 512 pilot slots (real and imaginary parts)

Nlog⁑NN \log N pilot slots for a DFT-based scheme