Achievable Rate Expressions

Why Precise Rate Expressions Matter

Chapters 11-14 built the cell-free massive MIMO architecture from the ground up: distributed APs, user-centric clustering, cooperation levels, and fronthaul constraints. This chapter brings the story full circle by deriving precise, computable spectral efficiency expressions that account for all these practical realities. Without closed-form SE bounds, system design would require Monte Carlo simulation for every parameter choice β€” a luxury that 6G standardization timelines do not permit. The achievable rate expressions developed here are the engine that drives every comparison, optimization, and deployment decision in the rest of the chapter.

Definition:

Cell-Free Massive MIMO System Model

Consider a cell-free massive MIMO system with LL access points (APs), each equipped with NN antennas, jointly serving KK single-antenna users over the same time-frequency resource. The system operates in TDD mode with a coherence interval of Ο„c\tau_c samples, of which Ο„p\tau_p are used for uplink pilot transmission.

The channel from AP ll to user kk is modeled as

Hlk=Ξ²lk H~lk,H~lk∼CN(0,Rlk)\mathbf{H}_{lk} = \sqrt{\beta_{lk}} \, \tilde{\mathbf{H}}_{lk}, \quad \tilde{\mathbf{H}}_{lk} \sim \mathcal{CN}(\mathbf{0}, \mathbf{R}_{lk})

where Ξ²lk\beta_{lk} is the large-scale fading coefficient and Rlk∈CNΓ—N\mathbf{R}_{lk} \in \mathbb{C}^{N \times N} is the spatial correlation matrix with tr(Rlk)=N\text{tr}(\mathbf{R}_{lk}) = N. The uplink received signal at AP ll during the data phase is

yl=βˆ‘k=1KHlkxk+wl\mathbf{y}_l = \sum_{k=1}^{K} \mathbf{H}_{lk} x_k + \mathbf{w}_{l}

where xkx_k is user kk's transmitted symbol with power Ptk{P_t}_{k} and wl∼CN(0,Οƒ2IN)\mathbf{w}_{l} \sim \mathcal{CN}(\mathbf{0}, \sigma^2 \mathbf{I}_N).

The model subsumes uncorrelated Rayleigh fading (Rlk=IN\mathbf{R}_{lk} = \mathbf{I}_N) as a special case. The spatial correlation structure becomes important when APs have multiple antennas and when exploiting covariance information for MMSE estimation.

Definition:

MMSE Channel Estimate

After receiving Ο„p\tau_p pilot symbols, AP ll forms the MMSE estimate of Hlk\mathbf{H}_{lk}:

H^lk=PtkΟ„p RlkΞ¨lkβˆ’1ylpilot\hat{\mathbf{H}}_{lk} = \sqrt{{P_t}_{k} \tau_p} \, \mathbf{R}_{lk} \boldsymbol{\Psi}_{lk}^{-1} \mathbf{y}_l^{\text{pilot}}

where Ξ¨lk=βˆ‘j:Ο•j=Ο•kPtjΟ„pRlj+Οƒ2IN\boldsymbol{\Psi}_{lk} = \sum_{j: \phi_j = \phi_k} {P_t}_{j} \tau_p \mathbf{R}_{lj} + \sigma^2 \mathbf{I}_N is the covariance of the received pilot signal and Ο•k\phi_k denotes the pilot index assigned to user kk. The estimation quality is captured by

Ξ³lkβ‰œ1NE ⁣[βˆ₯H^lkβˆ₯2]=1Ntr ⁣(PtkΟ„pRlkΞ¨lkβˆ’1Rlk)\gamma_{lk} \triangleq \frac{1}{N} \mathbb{E}\!\left[\|\hat{\mathbf{H}}_{lk}\|^2\right] = \frac{1}{N} \text{tr}\!\left({P_t}_{k} \tau_p \mathbf{R}_{lk} \boldsymbol{\Psi}_{lk}^{-1} \mathbf{R}_{lk}\right)

The estimation error H~lk=Hlkβˆ’H^lk\tilde{\mathbf{H}}_{lk} = \mathbf{H}_{lk} - \hat{\mathbf{H}}_{lk} is independent of H^lk\hat{\mathbf{H}}_{lk} (by the orthogonality principle) with covariance Clk=Rlkβˆ’PtkΟ„pRlkΞ¨lkβˆ’1Rlk\mathbf{C}_{lk} = \mathbf{R}_{lk} - {P_t}_{k} \tau_p \mathbf{R}_{lk} \boldsymbol{\Psi}_{lk}^{-1} \mathbf{R}_{lk}.

Use-and-Then-Forget (UatF) Bound

A capacity lower bound obtained by treating the channel estimate as the true channel in the beamforming/combining operation and accounting for the estimation error as additional uncorrelated noise. The bound is tight when channel hardening holds and avoids requiring instantaneous effective channel knowledge at the receiver.

Related: Hardening-Based Bound, Ergodic Capacity

Definition:

Hardening-Based Bound

When channel hardening holds (i.e., vlkHHlkβ‰ˆE[vlkHHlk]\mathbf{v}_{lk}^{H} \mathbf{H}_{lk} \approx \mathbb{E}[\mathbf{v}_{lk}^{H} \mathbf{H}_{lk}] with small variance), the effective channel gain is nearly deterministic. The hardening-based bound exploits this by using the deterministic effective channel gain in the rate expression:

SEkhard=Ο„cβˆ’Ο„pΟ„cE ⁣[log⁑2 ⁣(1+SINRkhard)]\text{SE}_k^{\text{hard}} = \frac{\tau_c - \tau_p}{\tau_c} \mathbb{E}\!\left[\log_2\!\left(1 + \text{SINR}_k^{\text{hard}}\right)\right]

where the expectation is over the small-scale fading and the SINR is computed using the instantaneous effective channel gains rather than their expectations. This bound is tighter than UatF when channel hardening is strong but requires knowledge of the effective channel statistics at the receiver.

For co-located massive MIMO with many antennas, hardening is strong and both bounds are tight. For cell-free systems with single-antenna APs, hardening is weaker (averaging over independent large-scale fading coefficients instead of many co-located antenna elements), making the gap between the two bounds more significant.

Hardening-Based Bound

An achievable rate bound that exploits channel hardening to treat the effective channel gain as approximately deterministic. Tighter than UatF when hardening holds, but requires statistical CSI at the receiver.

Related: Use-and-Then-Forget (UatF) Bound, Channel Hardening

Example: SE Under Uncorrelated Rayleigh Fading

Compute the closed-form UatF SE for a cell-free system with LL single-antenna APs (N=1N = 1), uncorrelated Rayleigh fading (Rlk=1\mathbf{R}_{lk} = 1), and MRC combining, assuming no pilot contamination (orthogonal pilots for all users).

Impact of Pilot Contamination on SE

When users kk and jj share the same pilot (Ο•k=Ο•j\phi_k = \phi_j), the MMSE estimate H^lk\hat{\mathbf{H}}_{lk} becomes correlated with Hlj\mathbf{H}_{lj}. This introduces a coherent interference term that does not vanish as Lβ†’βˆžL \to \infty:

E ⁣[H^lkHHlj]=PtjΟ„p tr ⁣(RljΞ¨lkβˆ’1Rlk)β‰ 0\mathbb{E}\!\left[\hat{\mathbf{H}}_{lk}^H \mathbf{H}_{lj}\right] = {P_t}_{j} \tau_p \, \text{tr}\!\left(\mathbf{R}_{lj} \boldsymbol{\Psi}_{lk}^{-1} \mathbf{R}_{lk}\right) \neq 0

This is the cell-free version of pilot contamination. Unlike co-located massive MIMO where pilot contamination from users in other cells creates an asymptotic rate ceiling, in cell-free systems the impact is mitigated by (i) careful pilot assignment exploiting the geographic separation of users, and (ii) MMSE combining that suppresses contaminating users based on their spatial signatures.

Common Mistake: Confusing Coherent and Non-Coherent Combining Gains

Mistake:

Assuming that the combining gain from LL APs is always LL (linear). This confuses non-coherent combining (power addition) with coherent combining (amplitude addition).

Correction:

With coherent combining under perfect CSI, the signal power grows as (βˆ‘lΞ²lk)2(\sum_l \sqrt{\beta_{lk}})^2, which is proportional to L2L^2 when all APs have equal path loss. Non-coherent combining gives only βˆ‘lΞ²lk∝L\sum_l \beta_{lk} \propto L. The UatF bound achieves coherent combining for the desired signal (the numerator squares the sum), while interference combines non-coherently. This L2L^2 vs LL scaling is the fundamental reason cell-free outperforms alternatives.

Theorem: SE with Centralized MMSE (Level 4)

Under centralized MMSE combining (Level 4 cooperation), where the CPU has access to all received signals and channel estimates, the uplink SE of user kk is

SEkL4=Ο„cβˆ’Ο„pΟ„clog⁑2 ⁣(1+PtkH^kH(βˆ‘jβ‰ kPtjH^jH^jH+βˆ‘j=1KPtjCj+Οƒ2ILN)βˆ’1H^k)\text{SE}_k^{\text{L4}} = \frac{\tau_c - \tau_p}{\tau_c} \log_2\!\left(1 + {P_t}_{k} \hat{\mathbf{H}}_k^H \left(\sum_{j \neq k} {P_t}_{j} \hat{\mathbf{H}}_j \hat{\mathbf{H}}_j^H + \sum_{j=1}^{K} {P_t}_{j} \mathbf{C}_j + \sigma^2 \mathbf{I}_{LN}\right)^{-1} \hat{\mathbf{H}}_k\right)

where H^k=[H^1kT,…,H^LkT]T∈CLN\hat{\mathbf{H}}_k = [\hat{\mathbf{H}}_{1k}^T, \ldots, \hat{\mathbf{H}}_{Lk}^T]^T \in \mathbb{C}^{LN} is the stacked channel estimate and Cj=blkdiag(C1j,…,CLj)\mathbf{C}_j = \text{blkdiag}(\mathbf{C}_{1j}, \ldots, \mathbf{C}_{Lj}). This is an ergodic achievable rate (not just a bound) when the effective SINR is treated as a random variable and the expectation is taken over the channel realizations.

Level 4 gives the CPU access to the full LNLN-dimensional received signal. The MMSE combining vector jointly suppresses inter-user interference using spatial signatures from all APs. This is the best achievable performance for linear processing but requires sending all raw baseband samples over the fronthaul β€” a cost of LNLN complex scalars per sample.

Spectral Efficiency vs Number of APs

Explore how the per-user spectral efficiency scales with the number of distributed APs under different cooperation levels (MRC, local MMSE, centralized MMSE) and with/without pilot contamination.

Parameters
10
100
10

Quick Check

In the UatF bound, the beamforming gain uncertainty (the random fluctuation of vlkHHlk\mathbf{v}_{lk}^{H} \mathbf{H}_{lk} around its mean) is treated as:

Part of the desired signal, increasing the SINR

Worst-case uncorrelated Gaussian noise, decreasing the SINR

Ignored entirely, as it averages to zero

Quantified exactly using instantaneous CSI

Example: SE Loss from Pilot Contamination

In a cell-free system with L=64L = 64 single-antenna APs and K=20K = 20 users, only Ο„p=10\tau_p = 10 orthogonal pilots are available. Two users kk and jj share the same pilot and are located such that Ξ²lk=Ξ²lj\beta_{lk} = \beta_{lj} for all ll (worst case: identical large-scale fading). Compare the SE of user kk with and without pilot contamination from user jj.

Key Takeaway

The spectral efficiency of cell-free massive MIMO is governed by the coherent combining gain (βˆ‘lΞ³lk)2(\sum_l \gamma_{lk})^2 in the SINR numerator, which grows quadratically in the number of contributing APs. This macro-diversity gain β€” absent in co-located architectures β€” is the fundamental performance advantage of cell-free systems.

Theorem: SE Under Fronthaul Capacity Constraints

When each AP-to-CPU fronthaul link has capacity CfhC_{\text{fh}} bits per sample and AP ll uses scalar quantization with blb_l bits per real dimension, the effective uplink SE becomes

SEkfh=Ο„cβˆ’Ο„pΟ„clog⁑2 ⁣(1+SINRkul1+Ξ±qβ‹…SINRkul)\text{SE}_k^{\text{fh}} = \frac{\tau_c - \tau_p}{\tau_c} \log_2\!\left(1 + \frac{\text{SINR}_k^{\text{ul}}}{1 + \alpha_q \cdot \text{SINR}_k^{\text{ul}}}\right)

where Ξ±q=Ο€32β‹…2βˆ’2bl\alpha_q = \frac{\pi \sqrt{3}}{2} \cdot 2^{-2b_l} is the quantization distortion factor (from the Bussgang decomposition) and bl≀Cfh/(2N)b_l \leq C_{\text{fh}} / (2N) is the per-dimension bit budget. As blβ†’βˆžb_l \to \infty, Ξ±qβ†’0\alpha_q \to 0 and the full SE is recovered.

Fronthaul quantization introduces an effective noise floor that cannot be overcome by adding more APs. With coarse quantization (blb_l small), Ξ±q\alpha_q is large and the SE saturates regardless of SNR β€” the fronthaul becomes the bottleneck. This motivates the fronthaul-aware designs of Chapter 14.

,
⚠️Engineering Note

Practical Fronthaul Bit-Width Selection

In eCPRI-based fronthaul (O-RAN 7.2x split), the typical quantization resolution is b=8b = 8--1212 bits per I/Q sample. At b=8b = 8, Ξ±qβ‰ˆ2.4Γ—10βˆ’5\alpha_q \approx 2.4 \times 10^{-5}, meaning the SE loss from quantization is negligible for SINR below 46 dB. At b=4b = 4 (aggressive compression), Ξ±qβ‰ˆ0.01\alpha_q \approx 0.01 and the SE ceiling appears at SINR β‰ˆ20\approx 20 dB. For cell-free systems with many APs and moderate per-AP SINR, b=6b = 6--88 bits suffices for less than 0.5 dB SE loss.

Practical Constraints
  • β€’

    eCPRI fronthaul rate = 2 * N * b * f_s bits/s per AP, where f_s is the sampling rate

  • β€’

    For N = 4 antennas, b = 8, f_s = 30.72 MHz: fronthaul = 1.97 Gbps per AP

  • β€’

    25 Gbps Ethernet can support ~12 multi-antenna APs with b = 8

Spectral Efficiency (SE)

The achievable rate per unit bandwidth, measured in bits/s/Hz. In cell-free massive MIMO, the per-user SE accounts for pilot overhead via the pre-log factor (Ο„cβˆ’Ο„p)/Ο„c(\tau_c - \tau_p)/\tau_c and for imperfect CSI via the UatF or hardening-based bounds.

Related: Use-and-Then-Forget (UatF) Bound, Achievable Rate

Large-Scale Fading Decoding (LSFD)

A technique where the CPU applies AP-specific weights Ξ·lk\eta_{lk} to the locally combined signals before aggregation. The weights depend only on large-scale fading coefficients (which change slowly), avoiding the need for instantaneous CSI at the CPU. Optimizing Ξ·lk\eta_{lk} maximizes the effective SINR.

Related: Levels of AP Cooperation, Local Combining

Historical Note: Origins of the Use-and-Then-Forget Bound

2000s-2010s

The UatF bounding technique traces back to Medard (2000), who showed that treating the channel estimate as the true channel yields a valid achievable rate in fading channels. The technique was popularized in the massive MIMO context by Marzetta (2010) and Ngo et al. (2013), who used it to derive closed-form rate expressions that revealed the power scaling laws. The name "use-and-then-forget" was coined by Marzetta, Larsson, Yang, and Ngo in their 2016 textbook, emphasizing that the bound does not require the receiver to track the instantaneous effective channel β€” it "uses" the channel estimate for beamforming and then "forgets" it for detection.

Why This Matters: SE Bounds in 5G NR System Design

The UatF bound is not just an academic tool β€” it directly informs 5G NR system design. The 3GPP system-level evaluation methodology uses similar capacity bounds (treating CSI errors as noise) to compare MIMO configurations. The cell-free SE expressions derived here predict the performance gains that O-RAN distributed MIMO deployments achieve over conventional macro-cell architectures, guiding operators in their network densification strategies.

See full treatment in Chapter 22

Quick Check

In the fronthaul-limited SE expression, what happens as the number of APs LL increases while the per-AP fronthaul capacity CfhC_{\text{fh}} remains fixed?

SE grows without bound because more APs always help

SE saturates at a ceiling determined by the quantization bit-width blb_l

SE decreases because more fronthaul links add more quantization noise

SE is independent of LL and depends only on fronthaul capacity

Common Mistake: Using Perfect CSI SE as a Design Target

Mistake:

Designing cell-free systems using SE expressions that assume perfect CSI and ignoring the pilot overhead pre-log factor (Ο„cβˆ’Ο„p)/Ο„c(\tau_c - \tau_p) / \tau_c.

Correction:

With K=20K = 20 users and orthogonal pilots (Ο„p=20\tau_p = 20) in a coherence interval of Ο„c=200\tau_c = 200 samples, the pre-log factor is 180/200=0.9180/200 = 0.9 β€” a 10% SE loss just from pilot overhead. With pilot reuse, Ο„p\tau_p is smaller but pilot contamination reduces the effective SINR. The net SE under imperfect CSI can be 30-50% lower than the perfect-CSI prediction, especially for cell-edge users where estimation quality is poor.