The Use-and-Then-Forget Bound

From Channel Estimates to Achievable Rates

In Chapter 3 we developed MMSE channel estimation and showed that the base station obtains an estimate H^k\hat{\mathbf{H}}_k of each user's channel. The natural next question is: how much data rate can the system deliver to each user, given imperfect channel knowledge?

Computing the exact ergodic capacity with imperfect CSI is, in general, an open problem. The channel estimate and the true channel are correlated, and the receiver must account for this correlation in its decoding. What we need is a tractable lower bound — tight enough to be useful for system design, yet simple enough to yield closed-form expressions.

The use-and-then-forget (UatF) bound provides exactly this. The idea is beautifully simple: treat the channel estimate as if it were perfect, and absorb all estimation error into an effective noise term. This "forgets" the statistical relationship between the estimate and the error, which can only reduce the achievable rate — hence the result is a lower bound. The bound becomes tight in the massive MIMO regime precisely because channel hardening makes the effective channel nearly deterministic.

Uplink System Model Recap

Consider a single-cell massive MIMO system with NtN_t base station antennas and KK single-antenna users. In the uplink data transmission phase, the received signal at the base station is

y=k=1KPtkHkxk+w,\mathbf{y} = \sum_{k=1}^{K} \sqrt{{P_t}_{k}} \, \mathbf{H}_{k} \, x_k + \mathbf{w},

where xkx_k is the unit-power data symbol from user kk, Ptk{P_t}_{k} is the transmit power, and wCN(0,σ2INt)\mathbf{w} \sim \mathcal{CN}(\mathbf{0}, \sigma^2 \mathbf{I}_{N_t}) is the additive noise.

From Chapter 3, the MMSE channel estimate satisfies Hk=H^k+H~k\mathbf{H}_{k} = \hat{\mathbf{H}}_k + \tilde{\mathbf{H}}_k, where H^k\hat{\mathbf{H}}_k and H~k\tilde{\mathbf{H}}_k are independent (a property of MMSE estimation for Gaussian vectors). We define γkE[H^k2]/Nt\gamma_k \triangleq \mathbb{E}[\|\hat{\mathbf{H}}_k\|^2] / N_t as the normalized estimation quality.

Definition:

UatF Effective SINR

Let vkCNt\mathbf{v}_{k} \in \mathbb{C}^{N_t} be the combining vector for user kk, chosen as a function of the channel estimates {H^1,,H^K}\{\hat{\mathbf{H}}_1, \ldots, \hat{\mathbf{H}}_{K}\}. After combining, the signal for user kk is

x^k=vkHy=PtkvkHH^kdesired signal (known part)xk+PtkvkHH~kxk+jkPtjvkHHjxj+vkHweffective noise (uncorrelated with desired).\hat{x}_k = \mathbf{v}_{k}^{H} \mathbf{y} = \underbrace{\sqrt{{P_t}_{k}} \, \mathbf{v}_{k}^{H} \hat{\mathbf{H}}_k}_{\text{desired signal (known part)}} \, x_k + \underbrace{\sqrt{{P_t}_{k}} \, \mathbf{v}_{k}^{H} \tilde{\mathbf{H}}_k \, x_k + \sum_{j \neq k} \sqrt{{P_t}_{j}} \, \mathbf{v}_{k}^{H} \mathbf{H}_{j} \, x_j + \mathbf{v}_{k}^{H} \mathbf{w}}_{\text{effective noise (uncorrelated with desired)}}.

The UatF effective SINR is defined as

SINRkUatF=PtkE ⁣[vkHHk]2j=1KPtjE ⁣[vkHHj2]PtkE ⁣[vkHHk]2+σ2E ⁣[vk2].\text{SINR}_k^{\text{UatF}} = \frac{{P_t}_{k} \left| \mathbb{E}\!\left[\mathbf{v}_{k}^{H} \mathbf{H}_{k}\right] \right|^2}{\sum_{j=1}^{K} {P_t}_{j} \, \mathbb{E}\!\left[\left|\mathbf{v}_{k}^{H} \mathbf{H}_{j}\right|^2\right] - {P_t}_{k} \left|\mathbb{E}\!\left[\mathbf{v}_{k}^{H} \mathbf{H}_{k}\right]\right|^2 + \sigma^2 \, \mathbb{E}\!\left[\|\mathbf{v}_{k}\|^2\right]}.

,

Theorem: Use-and-Then-Forget Bound

Under the system model above, an achievable ergodic rate for user kk is

RkUatF=log2 ⁣(1+SINRkUatF)[bits/s/Hz],R_k^{\text{UatF}} = \log_2\!\left(1 + \text{SINR}_k^{\text{UatF}}\right) \quad \text{[bits/s/Hz]},

where SINRkUatF\text{SINR}_k^{\text{UatF}} is the UatF effective SINR from Definition DUatF Effective SINR.

The key insight is that E[vkHHk]\mathbb{E}[\mathbf{v}_{k}^{H} \mathbf{H}_{k}] is a deterministic scalar — it does not depend on the instantaneous channel realization. The UatF bound treats this deterministic quantity as the "channel gain" and lumps everything else (estimation error, interference, noise) into an uncorrelated effective noise. Since the effective noise is uncorrelated with the desired signal, the worst-case distribution is Gaussian (by the maximum entropy property), which yields the log formula.

In the massive MIMO regime (NtN_t \to \infty), channel hardening means vkHHk/E[vkHHk]1\mathbf{v}_{k}^{H} \mathbf{H}_{k} / \mathbb{E}[\mathbf{v}_{k}^{H} \mathbf{H}_{k}] \to 1, so the gap between the bound and the true capacity vanishes.

,

When Is the UatF Bound Tight?

The UatF bound is tight when:

  1. Channel hardening holds: vkHHkE[vkHHk]\mathbf{v}_{k}^{H} \mathbf{H}_{k} \approx \mathbb{E}[\mathbf{v}_{k}^{H} \mathbf{H}_{k}] with high probability, which occurs as NtN_t \to \infty for i.i.d. Rayleigh channels.

  2. The effective noise is approximately Gaussian: by the central limit theorem, the sum of many interference terms converges to Gaussian as KK grows.

For finite NtN_t, the bound is loose. The gap can be reduced by using the instantaneous effective SINR vkHHk2/()|\mathbf{v}_{k}^{H} \mathbf{H}_{k}|^2 / (\cdots) and taking Rk=E[log2(1+SINRkinst)]R_k = \mathbb{E}[\log_2(1 + \text{SINR}_k^{\text{inst}})], but this no longer yields a closed-form expression.

Historical Note: Origin of the UatF Bounding Technique

2000-2016

The use-and-then-forget terminology was coined by Marzetta, Larsson, Yang, and Ngo in their 2016 textbook, but the underlying technique is older. The idea of treating the channel estimate as the true channel and absorbing estimation error into effective noise dates back to Medard (2000), who studied the capacity of channels with imperfect CSI. Hassibi and Hochwald (2003) applied similar ideas to MIMO training design. The key contribution of the massive MIMO literature was recognizing that channel hardening makes this bound asymptotically tight, turning an approximation tool into a principled design framework.

,

Use-and-Then-Forget (UatF) Bound

A lower bound on the achievable ergodic rate obtained by treating the channel estimate as if it were the true channel and absorbing the estimation error into an effective noise term whose distribution is then pessimized to Gaussian. Tight under channel hardening.

Related: Channel Hardening, Effective SINR

Channel Hardening

The phenomenon whereby vkHHk/E[vkHHk]1\mathbf{v}_{k}^{H} \mathbf{H}_{k} / \mathbb{E}[\mathbf{v}_{k}^{H} \mathbf{H}_{k}] \to 1 almost surely as NtN_t \to \infty. The effective scalar channel after combining becomes nearly deterministic, eliminating small-scale fading from the user's perspective.

Related: Use-and-Then-Forget (UatF) Bound

Effective SINR

The signal-to-interference-plus-noise ratio computed after linear combining, using the UatF decomposition. For user kk with combining vector vk\mathbf{v}_{k}, the effective SINR determines the achievable rate via Rk=log2(1+SINRk)R_k = \log_2(1 + \text{SINR}_k).

Related: Use-and-Then-Forget (UatF) Bound

Example: UatF Bound for a Single User with MRC

Consider a single-user system (K=1K = 1) with i.i.d. Rayleigh fading: H1CN(0,β1INt)\mathbf{H}_{1} \sim \mathcal{CN}(\mathbf{0}, \beta_{1} \mathbf{I}_{N_t}). After MMSE estimation with τp\tau_p pilot symbols at power Ptp{P_t}_{p}, the estimate satisfies H^1CN(0,γ1NtI)\hat{\mathbf{H}}_1 \sim \mathcal{CN}(\mathbf{0}, \gamma_1 N_t \mathbf{I}) where γ1=β12τpPtp/(β1τpPtp+σ2)\gamma_1 = \beta_{1}^{2} \tau_p {P_t}_{p} / (\beta_{1} \tau_p {P_t}_{p} + \sigma^2). With MRC combining v1=H^1\mathbf{v}_{1} = \hat{\mathbf{H}}_1, compute SINR1UatF\text{SINR}_1^{\text{UatF}}.

Common Mistake: UatF Does Not Account for Correlation Structure

Mistake:

Applying the UatF bound with the same formula to spatially correlated channels without modifying the expectations. With correlated channels (HkCN(0,Rk)\mathbf{H}_{k} \sim \mathcal{CN}(\mathbf{0}, \mathbf{R}_k)), the expectations E[vkHHk]\mathbb{E}[\mathbf{v}_{k}^{H} \mathbf{H}_{k}] and E[vkHHj2]\mathbb{E}[|\mathbf{v}_{k}^{H} \mathbf{H}_{j}|^2] have different structure than in the i.i.d. case.

Correction:

The UatF bounding technique is valid for any channel distribution. However, the closed-form expressions derived under i.i.d. Rayleigh must be rederived for correlated channels. With spatial correlation Rk\mathbf{R}_k, the MMSE estimate becomes H^kCN(0,γkRk)\hat{\mathbf{H}}_k \sim \mathcal{CN}(\mathbf{0}, \gamma_k \mathbf{R}_k) where γk\gamma_k now depends on the correlation structure and pilot contamination pattern. Always check which channel model underlies a given rate expression before applying it.

Quick Check

Why is the UatF bound a lower bound on the true achievable rate?

Because it overestimates the noise power

Because it pessimizes the effective noise distribution to Gaussian

Because it ignores the estimation error entirely

Because it uses Jensen's inequality on the log function

Quick Check

As NtN_t \to \infty with i.i.d. Rayleigh fading, what happens to vkHHk/E[vkHHk]\mathbf{v}_{k}^{H} \mathbf{H}_{k} / \mathbb{E}[\mathbf{v}_{k}^{H} \mathbf{H}_{k}] when vk=H^k\mathbf{v}_{k} = \hat{\mathbf{H}}_k (MRC)?

It diverges to infinity

It converges to 1 almost surely

It oscillates randomly around 1

It converges to 0

Why This Matters: UatF Bound and 5G NR System Design

The UatF bound is not merely a theoretical tool — it is the standard method for evaluating massive MIMO performance in 3GPP studies. 5G NR system-level simulations compute user rates using the UatF SINR formula with MMSE channel estimation. The bound's closed-form nature allows rapid evaluation of scheduling, power control, and pilot assignment algorithms without resorting to Monte Carlo simulation of the full mutual information. This is why the massive MIMO literature almost universally reports UatF rates rather than true ergodic rates.

Key Takeaway

The UatF bound transforms the intractable problem of computing capacity with imperfect CSI into a simple SINR formula. It works by treating the estimated channel as deterministic and absorbing estimation error into effective noise. The bound is tight under channel hardening (Nt1N_t \gg 1), making it the workhorse of massive MIMO rate analysis.