Achievable Rate Expressions
Why Precise Rate Expressions Matter
Chapters 11-14 built the cell-free massive MIMO architecture from the ground up: distributed APs, user-centric clustering, cooperation levels, and fronthaul constraints. This chapter brings the story full circle by deriving precise, computable spectral efficiency expressions that account for all these practical realities. Without closed-form SE bounds, system design would require Monte Carlo simulation for every parameter choice β a luxury that 6G standardization timelines do not permit. The achievable rate expressions developed here are the engine that drives every comparison, optimization, and deployment decision in the rest of the chapter.
Definition: Cell-Free Massive MIMO System Model
Cell-Free Massive MIMO System Model
Consider a cell-free massive MIMO system with access points (APs), each equipped with antennas, jointly serving single-antenna users over the same time-frequency resource. The system operates in TDD mode with a coherence interval of samples, of which are used for uplink pilot transmission.
The channel from AP to user is modeled as
where is the large-scale fading coefficient and is the spatial correlation matrix with . The uplink received signal at AP during the data phase is
where is user 's transmitted symbol with power and .
The model subsumes uncorrelated Rayleigh fading () as a special case. The spatial correlation structure becomes important when APs have multiple antennas and when exploiting covariance information for MMSE estimation.
Definition: MMSE Channel Estimate
MMSE Channel Estimate
After receiving pilot symbols, AP forms the MMSE estimate of :
where is the covariance of the received pilot signal and denotes the pilot index assigned to user . The estimation quality is captured by
The estimation error is independent of (by the orthogonality principle) with covariance .
Use-and-Then-Forget (UatF) Bound
A capacity lower bound obtained by treating the channel estimate as the true channel in the beamforming/combining operation and accounting for the estimation error as additional uncorrelated noise. The bound is tight when channel hardening holds and avoids requiring instantaneous effective channel knowledge at the receiver.
Related: Hardening-Based Bound, Ergodic Capacity
Theorem: UatF Bound for Uplink Spectral Efficiency
Consider a cell-free massive MIMO system where AP uses combining vector (local MRC). The uplink spectral efficiency of user under the use-and-then-forget bound is
where
Here are the LSFD (large-scale fading decoding) weights applied at the CPU.
The numerator contains the coherent combining gain , which grows quadratically in the number of APs with good channels to user . This macro-diversity gain is the fundamental advantage of cell-free over co-located architectures: the sum extends over geographically distributed APs, each providing an independent fading path.
Signal decomposition
After local combining at AP , the CPU receives . The CPU forms the global estimate .
Desired signal extraction
Decompose the desired signal term: . With MRC, .
UatF bounding technique
Treat as the desired signal and everything else (beamforming uncertainty, inter-user interference, noise) as worst-case uncorrelated Gaussian noise. By the capacity of a Gaussian channel with known deterministic gain, the ergodic achievable rate follows. The pre-log factor accounts for pilot overhead.
SINR expression
Computing the signal and interference-plus-noise powers and applying the standard Gaussian lower bound yields the stated SINR expression.
Definition: Hardening-Based Bound
Hardening-Based Bound
When channel hardening holds (i.e., with small variance), the effective channel gain is nearly deterministic. The hardening-based bound exploits this by using the deterministic effective channel gain in the rate expression:
where the expectation is over the small-scale fading and the SINR is computed using the instantaneous effective channel gains rather than their expectations. This bound is tighter than UatF when channel hardening is strong but requires knowledge of the effective channel statistics at the receiver.
For co-located massive MIMO with many antennas, hardening is strong and both bounds are tight. For cell-free systems with single-antenna APs, hardening is weaker (averaging over independent large-scale fading coefficients instead of many co-located antenna elements), making the gap between the two bounds more significant.
Hardening-Based Bound
An achievable rate bound that exploits channel hardening to treat the effective channel gain as approximately deterministic. Tighter than UatF when hardening holds, but requires statistical CSI at the receiver.
Related: Use-and-Then-Forget (UatF) Bound, Channel Hardening
Theorem: Downlink SE with Conjugate Beamforming
Under conjugate beamforming (CB) at each AP, where AP uses precoding vector for user , the downlink SE under the UatF bound is
with
subject to the per-AP power constraint for all .
The structure mirrors the uplink: the numerator is the squared coherent combining gain, and the denominator contains beamforming gain uncertainty, inter-user interference, and noise. The per-AP power constraint couples the power control coefficients across users, making the optimization problem more constrained than in co-located systems.
Received signal
User receives . Decompose the desired signal: .
Apply UatF
By the UatF technique, treat the deterministic part as the signal gain. By TDD reciprocity and the MMSE estimation properties, . The SINR follows from computing the signal and interference-plus-noise powers.
Impact of Pilot Contamination on SE
When users and share the same pilot (), the MMSE estimate becomes correlated with . This introduces a coherent interference term that does not vanish as :
This is the cell-free version of pilot contamination. Unlike co-located massive MIMO where pilot contamination from users in other cells creates an asymptotic rate ceiling, in cell-free systems the impact is mitigated by (i) careful pilot assignment exploiting the geographic separation of users, and (ii) MMSE combining that suppresses contaminating users based on their spatial signatures.
Common Mistake: Confusing Coherent and Non-Coherent Combining Gains
Mistake:
Assuming that the combining gain from APs is always (linear). This confuses non-coherent combining (power addition) with coherent combining (amplitude addition).
Correction:
With coherent combining under perfect CSI, the signal power grows as , which is proportional to when all APs have equal path loss. Non-coherent combining gives only . The UatF bound achieves coherent combining for the desired signal (the numerator squares the sum), while interference combines non-coherently. This vs scaling is the fundamental reason cell-free outperforms alternatives.
Theorem: SE with Centralized MMSE (Level 4)
Under centralized MMSE combining (Level 4 cooperation), where the CPU has access to all received signals and channel estimates, the uplink SE of user is
where is the stacked channel estimate and . This is an ergodic achievable rate (not just a bound) when the effective SINR is treated as a random variable and the expectation is taken over the channel realizations.
Level 4 gives the CPU access to the full -dimensional received signal. The MMSE combining vector jointly suppresses inter-user interference using spatial signatures from all APs. This is the best achievable performance for linear processing but requires sending all raw baseband samples over the fronthaul β a cost of complex scalars per sample.
Stacked system model
The stacked received signal is where is the stacked estimation error.
MMSE combining
The MMSE combining vector for user is . The resulting SINR follows from the matrix inversion lemma.
Rate expression
The standard capacity of a Gaussian channel with MMSE combining yields the stated expression. The expectation over fading gives the ergodic rate.
Spectral Efficiency vs Number of APs
Explore how the per-user spectral efficiency scales with the number of distributed APs under different cooperation levels (MRC, local MMSE, centralized MMSE) and with/without pilot contamination.
Parameters
Quick Check
In the UatF bound, the beamforming gain uncertainty (the random fluctuation of around its mean) is treated as:
Part of the desired signal, increasing the SINR
Worst-case uncorrelated Gaussian noise, decreasing the SINR
Ignored entirely, as it averages to zero
Quantified exactly using instantaneous CSI
The UatF bound treats all random components β including beamforming gain uncertainty β as worst-case Gaussian noise uncorrelated with the desired signal. This gives a clean, computable lower bound.
Example: SE Loss from Pilot Contamination
In a cell-free system with single-antenna APs and users, only orthogonal pilots are available. Two users and share the same pilot and are located such that for all (worst case: identical large-scale fading). Compare the SE of user with and without pilot contamination from user .
Without contamination
With orthogonal pilots, . The SINR numerator is , which grows as .
With contamination from user $j$
The contaminated estimate has , so . With equal powers, .
Coherent interference
The interference from user contains a coherent component that adds constructively across APs. The interference term scales as , same as the signal. As , . With equal powers and , the asymptotic SINR ceiling is , or about 6 dB β a significant loss.
Key Takeaway
The spectral efficiency of cell-free massive MIMO is governed by the coherent combining gain in the SINR numerator, which grows quadratically in the number of contributing APs. This macro-diversity gain β absent in co-located architectures β is the fundamental performance advantage of cell-free systems.
Theorem: SE Under Fronthaul Capacity Constraints
When each AP-to-CPU fronthaul link has capacity bits per sample and AP uses scalar quantization with bits per real dimension, the effective uplink SE becomes
where is the quantization distortion factor (from the Bussgang decomposition) and is the per-dimension bit budget. As , and the full SE is recovered.
Fronthaul quantization introduces an effective noise floor that cannot be overcome by adding more APs. With coarse quantization ( small), is large and the SE saturates regardless of SNR β the fronthaul becomes the bottleneck. This motivates the fronthaul-aware designs of Chapter 14.
Bussgang decomposition
The quantized signal is decomposed as where is the quantization distortion uncorrelated with , with variance .
Effective SINR
The quantization distortion adds to the noise floor. The effective SINR after combining at the CPU becomes , a monotonically increasing but saturating function of the original SINR.
Practical Fronthaul Bit-Width Selection
In eCPRI-based fronthaul (O-RAN 7.2x split), the typical quantization resolution is -- bits per I/Q sample. At , , meaning the SE loss from quantization is negligible for SINR below 46 dB. At (aggressive compression), and the SE ceiling appears at SINR dB. For cell-free systems with many APs and moderate per-AP SINR, -- bits suffices for less than 0.5 dB SE loss.
- β’
eCPRI fronthaul rate = 2 * N * b * f_s bits/s per AP, where f_s is the sampling rate
- β’
For N = 4 antennas, b = 8, f_s = 30.72 MHz: fronthaul = 1.97 Gbps per AP
- β’
25 Gbps Ethernet can support ~12 multi-antenna APs with b = 8
Spectral Efficiency (SE)
The achievable rate per unit bandwidth, measured in bits/s/Hz. In cell-free massive MIMO, the per-user SE accounts for pilot overhead via the pre-log factor and for imperfect CSI via the UatF or hardening-based bounds.
Large-Scale Fading Decoding (LSFD)
A technique where the CPU applies AP-specific weights to the locally combined signals before aggregation. The weights depend only on large-scale fading coefficients (which change slowly), avoiding the need for instantaneous CSI at the CPU. Optimizing maximizes the effective SINR.
Related: Levels of AP Cooperation, Local Combining
Historical Note: Origins of the Use-and-Then-Forget Bound
2000s-2010sThe UatF bounding technique traces back to Medard (2000), who showed that treating the channel estimate as the true channel yields a valid achievable rate in fading channels. The technique was popularized in the massive MIMO context by Marzetta (2010) and Ngo et al. (2013), who used it to derive closed-form rate expressions that revealed the power scaling laws. The name "use-and-then-forget" was coined by Marzetta, Larsson, Yang, and Ngo in their 2016 textbook, emphasizing that the bound does not require the receiver to track the instantaneous effective channel β it "uses" the channel estimate for beamforming and then "forgets" it for detection.
Why This Matters: SE Bounds in 5G NR System Design
The UatF bound is not just an academic tool β it directly informs 5G NR system design. The 3GPP system-level evaluation methodology uses similar capacity bounds (treating CSI errors as noise) to compare MIMO configurations. The cell-free SE expressions derived here predict the performance gains that O-RAN distributed MIMO deployments achieve over conventional macro-cell architectures, guiding operators in their network densification strategies.
See full treatment in Chapter 22
Quick Check
In the fronthaul-limited SE expression, what happens as the number of APs increases while the per-AP fronthaul capacity remains fixed?
SE grows without bound because more APs always help
SE saturates at a ceiling determined by the quantization bit-width
SE decreases because more fronthaul links add more quantization noise
SE is independent of and depends only on fronthaul capacity
The quantization distortion factor creates an SINR ceiling of . Adding more APs increases the pre-quantization SINR but not the post-quantization SINR beyond this ceiling.
Common Mistake: Using Perfect CSI SE as a Design Target
Mistake:
Designing cell-free systems using SE expressions that assume perfect CSI and ignoring the pilot overhead pre-log factor .
Correction:
With users and orthogonal pilots () in a coherence interval of samples, the pre-log factor is β a 10% SE loss just from pilot overhead. With pilot reuse, is smaller but pilot contamination reduces the effective SINR. The net SE under imperfect CSI can be 30-50% lower than the perfect-CSI prediction, especially for cell-edge users where estimation quality is poor.