Low-Complexity Approximations

When Matrix Inversion Is Too Expensive

The MMSE receiver requires inverting the KΓ—KK \times K matrix HHH+Οƒ2PI\mathbf{H}^{H} \mathbf{H} + \frac{\sigma^2}{P}\mathbf{I} at a cost of O(K3)\mathcal{O}(K^{3}). For massive MIMO with K=16K = 16, this is fast. But in emerging scenarios β€” cell-free massive MIMO with hundreds of users, or millimeter-wave systems with hybrid beamforming β€” the matrix dimension can grow to the point where exact inversion becomes a bottleneck.

This section presents three strategies to approximate the MMSE filter without explicit matrix inversion: the Neumann series, local MMSE (for distributed systems), and iterative (conjugate gradient) methods.

Definition:

Neumann Series Approximation

For a matrix A\mathbf{A} with spectral radius less than 1, the Neumann series gives

Aβˆ’1=βˆ‘n=0∞(Iβˆ’A)n.\mathbf{A}^{-1} = \sum_{n=0}^{\infty} (\mathbf{I} - \mathbf{A})^n.

Applied to the regularized Gram matrix A=HHH+Οƒ2PI\mathbf{A} = \mathbf{H}^{H} \mathbf{H} + \frac{\sigma^2}{P}\mathbf{I}, we decompose A=D+E\mathbf{A} = \mathbf{D} + \mathbf{E} where D\mathbf{D} is the diagonal part and E\mathbf{E} is the off-diagonal. Then

Aβˆ’1=Dβˆ’1βˆ‘n=0∞(βˆ’Dβˆ’1E)nβ‰ˆDβˆ’1βˆ‘n=0Lβˆ’1(βˆ’Dβˆ’1E)n,\mathbf{A}^{-1} = \mathbf{D}^{-1} \sum_{n=0}^{\infty} (-\mathbf{D}^{-1}\mathbf{E})^n \approx \mathbf{D}^{-1} \sum_{n=0}^{L-1} (-\mathbf{D}^{-1}\mathbf{E})^n,

where LL is the truncation order. The LL-th order approximation has complexity O(LK2)\mathcal{O}(L K^{2}) β€” avoiding the cubic cost of exact inversion.

The convergence rate depends on the spectral radius of Dβˆ’1E\mathbf{D}^{-1}\mathbf{E}, which is small when the Gram matrix is diagonally dominant. In the massive MIMO regime with favorable propagation, diagonal dominance holds and even L=2L = 2 or L=3L = 3 gives near-optimal performance.

Theorem: Neumann Series Convergence for Massive MIMO

Let A=HHH+Ξ±I\mathbf{A} = \mathbf{H}^{H} \mathbf{H} + \alpha \mathbf{I} with Ξ±=Οƒ2/P>0\alpha = \sigma^2/P > 0 and i.i.d. Rayleigh fading. As Nt/Kβ†’βˆžN_t/K \to \infty, the spectral radius of Dβˆ’1E\mathbf{D}^{-1}\mathbf{E} satisfies

ρ(Dβˆ’1E)β†’0,\rho(\mathbf{D}^{-1}\mathbf{E}) \to 0,

and the LL-th order Neumann approximation satisfies

βˆ₯Aβˆ’1βˆ’A^Lβˆ’1βˆ₯=O((KNt)L).\|\mathbf{A}^{-1} - \hat{\mathbf{A}}^{-1}_{L}\| = \mathcal{O}\left(\left(\frac{K}{N_t}\right)^L\right).

Favorable propagation makes the Gram matrix approximately diagonal, so the off-diagonal perturbation E\mathbf{E} is small relative to D\mathbf{D}. More antennas relative to users means faster convergence.

Example: Second-Order Neumann Approximation

For Nt=128N_t = 128, K=8K = 8, SNR=10\text{SNR} = 10 dB with i.i.d. Rayleigh fading, compute the average SINR loss of the 2nd-order Neumann approximation relative to exact MMSE.

Neumann Series Convergence vs. Truncation Order

Explore how the Neumann series approximation converges to the exact MMSE SINR as the truncation order LL increases. Observe that convergence is faster for larger Nt/KN_t/K ratios.

Parameters
64
8
10
5

Local MMSE for Distributed Antenna Systems

In cell-free massive MIMO (MIMO Ch. 11–16), the BS antennas are distributed across many access points (APs), each with a local processor and limited fronthaul capacity. The local MMSE approach partitions the antenna array into groups and applies MMSE within each group independently:

x^k(g)=(gk(g))Hy(g),\hat{x}_k^{(g)} = (\mathbf{g}_k^{(g)})^H \mathbf{y}^{(g)},

where y(g)\mathbf{y}^{(g)} and gk(g)\mathbf{g}_k^{(g)} are the received signal and combining vector at AP group gg. The final estimate is a weighted combination:

x^k=βˆ‘g=1Gwk,gx^k(g),\hat{x}_k = \sum_{g=1}^{G} w_{k,g} \hat{x}_k^{(g)},

where wk,gw_{k,g} are large-scale fading decoding (LSFD) weights.

Local MMSE avoids centralizing all received signals and requires only local CSI at each AP. The performance gap to centralized MMSE depends on the spatial diversity of the distributed array.

Conjugate Gradient Method

The MMSE detection problem (HHH+Οƒ2PI)x^=HHy(\mathbf{H}^{H} \mathbf{H} + \frac{\sigma^2}{P}\mathbf{I})\hat{\mathbf{x}} = \mathbf{H}^{H} \mathbf{y} is a positive definite linear system that can be solved iteratively by the conjugate gradient (CG) method. CG converges in at most KK iterations (exact solution), but in practice 5–10 iterations suffice because the condition number of the regularized Gram matrix is small in massive MIMO.

Each CG iteration costs O(K2)\mathcal{O}(K^{2}) (one matrix-vector multiply with the Gram matrix), giving a total cost of O(LK2)\mathcal{O}(L K^{2}) for LL iterations β€” the same scaling as the Neumann series.

Common Mistake: Neumann Series Diverges When the Antenna-to-User Ratio Is Small

Mistake:

Applying the Neumann series with L=2L = 2 or L=3L = 3 when Nt/KN_t/K is close to 1 (e.g., Nt=16N_t = 16, K=12K = 12).

Correction:

The spectral radius ρ(Dβˆ’1E)\rho(\mathbf{D}^{-1}\mathbf{E}) can exceed 1 when Nt/KN_t/K is small, causing the Neumann series to diverge. In this regime, use exact matrix inversion or the conjugate gradient method instead. As a rule of thumb, the Neumann series is reliable for Nt/Kβ‰₯4N_t/K \geq 4.

Neumann Series

An iterative expansion Aβˆ’1=βˆ‘n=0∞(Iβˆ’A)n\mathbf{A}^{-1} = \sum_{n=0}^{\infty}(\mathbf{I} - \mathbf{A})^n that avoids explicit matrix inversion. Converges when the spectral radius of (Iβˆ’A)(\mathbf{I} - \mathbf{A}) is less than 1.

Related: Matrix Inversion, Conjugate Gradient Method

Local MMSE

An MMSE detection strategy for distributed antenna systems where each access point applies MMSE locally using only its own received signal and CSI. Results are combined centrally using LSFD weights.

Related: Cell-Free Massive MIMO, Level 3 β€” Large-Scale Fading Decoding (LSFD)