Achievable Rates with ZF and MMSE
Beyond MRC: Interference Suppression
MRC treats each user independently, ignoring multi-user interference. When is not negligible compared to , this interference dominates and the MRC rate saturates. Two alternatives exploit the excess spatial degrees of freedom to suppress interference:
-
Zero Forcing (ZF): Projects each user's signal onto the subspace orthogonal to all other users' estimated channels. This eliminates interference completely (given perfect estimates) but amplifies noise because the projection reduces the effective array dimension.
-
MMSE (Regularized ZF): Adds a regularization term that balances interference suppression against noise amplification. This is optimal among linear receivers in the MSE sense.
Both require inverting a matrix, but the payoff is dramatic: the per-user rate with ZF/MMSE can be substantially higher than MRC, especially when approaches .
Definition: Zero-Forcing Combining Vector
Zero-Forcing Combining Vector
Let be the matrix of channel estimates. The ZF combining vector for user is the -th column of
i.e., , where is the -th standard basis vector.
ZF requires for the pseudo-inverse to exist. In massive MIMO, is the typical operating regime, so this condition is easily satisfied.
Definition: MMSE (Regularized ZF) Combining Vector
MMSE (Regularized ZF) Combining Vector
The MMSE combining vector for user is
where accounts for the estimation error variance and noise.
When and the estimation is perfect (), the MMSE combiner reduces to ZF. The regularization term prevents noise amplification by keeping the matrix inversion well-conditioned.
Theorem: Closed-Form UatF Rate with ZF Combining
Under i.i.d. Rayleigh fading with MMSE estimation and no pilot contamination, the UatF achievable rate for user with ZF combining is
valid for .
Comparing with the MRC formula, two changes stand out:
-
The array gain is instead of . ZF "uses up" degrees of freedom to null interference, leaving for coherent combining. When , the loss is negligible.
-
The denominator contains (the estimation error variance) instead of (the full channel power). ZF eliminates the component of interference along the estimated channels β only the part due to estimation error remains. With perfect estimation (), the interference vanishes entirely and the denominator is just .
Step 1: ZF eliminates estimated-channel interference
By construction, for and . Therefore
Step 2: Compute the signal mean
{P_t}_{k}$.
Step 3: Compute the interference power
For : . Since and is independent of :
Step 4: Use the trace identity for ZF norm
For i.i.d. channels, the expected ZF combining norm is
which follows from the distribution of the inverse Wishart matrix.
Step 5: Assemble the SINR
$
Theorem: UatF Rate with MMSE Combining (Large-System Approximation)
Under i.i.d. Rayleigh fading in the large-system limit ( with ), the UatF achievable rate with MMSE combining converges to
where is the Stieltjes transform of the asymptotic eigenvalue distribution of the interference-plus-noise covariance matrix, evaluated at . For equal power and equal path loss, this simplifies to
where is the unique positive solution to .
The MMSE rate interpolates between MRC (when dominates) and ZF (when interference dominates). The random matrix theory machinery is needed because the MMSE combiner couples all users through the matrix inversion, making the per-user SINR depend on the joint statistics of all channels.
For practical purposes, when , the MMSE rate is very close to the ZF rate, and the simpler ZF formula suffices for system design.
Sketch: Random matrix theory approach
The MMSE SINR for user involves where . In the large-system limit, the resolvent concentrates around a deterministic equivalent characterized by the Stieltjes transform of the Marchenko-Pastur distribution. The rigorous proof uses the rank-1 perturbation formula and the matrix inversion lemma β see Hoydis, ten Brink, and Debbah (2013) for the full derivation.
Historical Note: Random Matrix Theory Enters Wireless Communications
1996-2013The application of random matrix theory (RMT) to wireless communications began with the landmark papers of Telatar (1999) and Foschini (1996), who showed that MIMO capacity scales linearly with the minimum of and . The Marchenko-Pastur law, originally developed in the context of nuclear physics (1967), became an essential tool for analyzing large MIMO systems.
Hoydis, ten Brink, and Debbah (2013) brought RMT-based analysis to massive MIMO, deriving deterministic equivalents for the SINR under MMSE processing. Their results showed that RMT predictions are accurate even for modest system sizes (, ), vindicating the large-system approach for practical 5G design.
Historical Note: From Interference Cancellation to Zero Forcing
1990s-2010sZero-forcing detectors have a long history in communications, dating back to the equalization of inter-symbol interference in single-antenna channels. The extension to multi-user MIMO was developed in the early 2000s as part of the BLAST architecture at Bell Labs. The key realization for massive MIMO was that ZF β previously considered impractical due to the matrix inversion β becomes nearly optimal and computationally feasible when , because the Gram matrix converges to a well-conditioned diagonal matrix.
MRC vs. ZF vs. MMSE Combining
| Property | MRC | ZF | MMSE |
|---|---|---|---|
| Combining vector | |||
| Array gain | (large system) | ||
| Interference handling | None (interference-limited) | Fully eliminated (estimated part) | Optimally balanced |
| Denominator noise | Implicitly via Stieltjes transform | ||
| Complexity per symbol | |||
| Best regime | All regimes |
ZF and MMSE SINR vs. Number of Antennas
Compare ZF and MMSE achievable rates as a function of . Notice how ZF loses degrees of freedom (the curve starts at ) while MMSE gracefully handles the transition.
Parameters
Example: When Does ZF Outperform MRC?
For equal power, equal path loss, and perfect estimation (), find the condition on , , and under which .
Simplify with perfect estimation
With for all users:
Find the crossover
iff
Simplifying:
So ZF outperforms MRC whenever , which is almost always satisfied in practice (e.g., , gives the threshold at dB).
Sum Rate Comparison: MRC vs. ZF vs. MMSE
Compare the sum spectral efficiency of all three combining schemes as a function of the number of users for a fixed . MRC saturates at high while ZF and MMSE continue to grow.
Parameters
Common Mistake: Forgetting the Degrees-of-Freedom Loss in ZF
Mistake:
Writing the ZF SINR with array gain instead of . This overestimates the ZF rate, especially when is not negligible compared to .
Correction:
ZF projects onto the -dimensional orthogonal complement of the other users' channels. The effective array gain is , not . Always use the correct formula:
Regularized Zero Forcing (RZF)
A linear combining/precoding scheme that adds a regularization (Tikhonov) term to the ZF pseudo-inverse: . The optimal regularization equals , recovering the MMSE combiner. Also called MMSE combining.
Related: Effective SINR
Inverse Wishart Distribution
If has i.i.d. entries with , then follows an inverse Wishart distribution. The diagonal entries have mean , which determines the ZF combining norm and hence the degrees-of-freedom loss.
Related: Regularized Zero Forcing (RZF)
Quick Check
With perfect channel estimation ( for all ), what does the ZF denominator reduce to?
only
Zero β ZF achieves infinite rate with perfect CSI
With , the estimation error variance for all users, so the denominator is just . ZF with perfect CSI is noise-limited, not interference-limited.
Key Takeaway
ZF combining eliminates inter-user interference at the cost of degrees of freedom, yielding an effective array gain of . MMSE combining optimally balances interference suppression and noise enhancement. Both achieve strictly higher rates than MRC whenever . The gap between ZF and MMSE is small when .