Linear Detectors: ZF and MMSE

The Linear Shortcut

If the constellation constraint is the source of the hardness, what happens if we simply ignore it? Treat x\mathbf{x} as a continuous vector, invert the channel, and then project back onto the alphabet. This gives a linear filter followed by symbol-wise slicing. The result is fast, parallelizable, and β€” as we will see β€” optimal in a well-defined continuous sense. It is also uniformly worse than ML. The tradeoff between these two extremes is the story of this chapter.

Definition:

Zero-Forcing (ZF) Detector

Assuming H\mathbf{H} has full column rank (nrβ‰₯ntn_r \geq n_t), the zero-forcing detector applies the Moore-Penrose pseudoinverse: GZF=(HHH)βˆ’1HH,x~ZF=GZFy=x+GZFw.\mathbf{G}_{\text{ZF}} = (\mathbf{H}^{H} \mathbf{H})^{-1} \mathbf{H}^{H}, \qquad \tilde{\mathbf{x}}_{\text{ZF}} = \mathbf{G}_{\text{ZF}} \mathbf{y} = \mathbf{x} + \mathbf{G}_{\text{ZF}} \mathbf{w}. The hard decision is x^ZF=QA(x~ZF)\hat{\mathbf{x}}_{\text{ZF}} = \mathcal{Q}_{\mathcal{A}}(\tilde{\mathbf{x}}_{\text{ZF}}), where QA\mathcal{Q}_{\mathcal{A}} slices each entry onto the nearest constellation point.

ZF "forces" inter-stream interference to zero at the cost of noise enhancement. If H\mathbf{H} is ill-conditioned, the noise gain on some streams can be catastrophic.

Definition:

MMSE Detector

The linear MMSE detector minimizes the mean-squared error between x\mathbf{x} and its linear estimate. Assuming x∼CN(0,I)\mathbf{x} \sim \mathcal{CN}(\mathbf{0}, \mathbf{I}) (unit-energy symbols, uncorrelated) and independent noise: GMMSE=(HHH+Οƒ2I)βˆ’1HH,x~MMSE=GMMSEy.\mathbf{G}_{\text{MMSE}} = (\mathbf{H}^{H} \mathbf{H} + \sigma^2 \mathbf{I})^{-1} \mathbf{H}^{H}, \qquad \tilde{\mathbf{x}}_{\text{MMSE}} = \mathbf{G}_{\text{MMSE}} \mathbf{y}. The hard decision is x^MMSE=QA(x~MMSE)\hat{\mathbf{x}}_{\text{MMSE}} = \mathcal{Q}_{\mathcal{A}}(\tilde{\mathbf{x}}_{\text{MMSE}}).

The regularizer Οƒ2I\sigma^2 \mathbf{I} shrinks the inverse of HHH\mathbf{H}^{H} \mathbf{H}, trading residual bias for reduced noise enhancement. As Οƒ2β†’0\sigma^2 \to 0, MMSE β†’\to ZF. As Οƒ2β†’βˆž\sigma^2 \to \infty, MMSE β†’\to matched filter HH/Οƒ2\mathbf{H}^{H} / \sigma^2.

Theorem: Post-Detection SINR of the ZF Detector

The kk-th stream of the ZF detector has post-detection SINR Ξ³kZF=1Οƒ2β‹…[(HHH)βˆ’1]kk.\gamma_k^{\text{ZF}} = \frac{1}{\sigma^2 \cdot [(\mathbf{H}^{H} \mathbf{H})^{-1}]_{kk}}.

The diagonal entry [(HHH)βˆ’1]kk[(\mathbf{H}^{H} \mathbf{H})^{-1}]_{kk} measures how much noise is amplified on the kk-th stream. For an ill-conditioned channel this entry is large, and the corresponding SINR collapses.

Theorem: MMSE Dominates ZF in Per-Stream SINR

For every channel H\mathbf{H} with nrβ‰₯ntn_r \geq n_t, every stream kk, and every noise variance Οƒ2>0\sigma^2 > 0, Ξ³kMMSEβ‰₯Ξ³kZF,\gamma_k^{\text{MMSE}} \geq \gamma_k^{\text{ZF}}, with equality only in the noise-free limit Οƒ2β†’0\sigma^2 \to 0. Explicitly, Ξ³kMMSE=1[(HHH+Οƒ2I)βˆ’1]kkβˆ’1.\gamma_k^{\text{MMSE}} = \frac{1}{[(\mathbf{H}^{H} \mathbf{H} + \sigma^2 \mathbf{I})^{-1}]_{kk}} - 1.

MMSE is the unconstrained MSE-optimal linear estimator. No other linear filter β€” including ZF β€” can produce a better SINR in the Wiener sense. This is a direct consequence of the orthogonality principle.

Key Takeaway

ZF eliminates interference at the cost of noise enhancement. MMSE balances residual interference against noise, and strictly dominates ZF in per-stream SINR. Neither matches ML at low-to-moderate SNR: both pay an SNR loss proportional to the "penalty" for treating the discrete alphabet as continuous.

A Convexity Flag

The MMSE detector is the unique solution to a strictly convex quadratic program β€” this is what guarantees the water-filling-like form (HHH+Οƒ2I)βˆ’1HH(\mathbf{H}^{H} \mathbf{H} + \sigma^2 \mathbf{I})^{-1} \mathbf{H}^{H}. The convexity is what makes the linear receiver design textbook-easy; it is also what makes it suboptimal against the combinatorial ML problem.

BER: ZF vs. MMSE vs. ML vs. MMSE-SIC

Bit error rate for different linear and nonlinear detectors on a Rayleigh i.i.d. MIMO channel. Notice how MMSE dominates ZF at all SNRs, and how MMSE-SIC closes most of the gap to ML.

Parameters
2

Post-Detection SINR Distribution (Rayleigh Channel)

Empirical CDF of the post-detection SINR for ZF and MMSE, aggregated across streams and channel realizations. The left tail (low-SINR events) is what dominates the BER.

Parameters
4
10

Example: ZF and MMSE on a 2Γ—22\times 2 Channel

Let H=[10.90.91]\mathbf{H} = \begin{bmatrix} 1 & 0.9 \\ 0.9 & 1 \end{bmatrix} (real, symmetric, highly correlated) with Οƒ2=0.1\sigma^2 = 0.1. Compute Ξ³kZF\gamma_k^{\text{ZF}} and Ξ³kMMSE\gamma_k^{\text{MMSE}} for each stream, and quantify the gap.

Historical Note: From Wiener Filters to MIMO Receivers

1949–1998

The MMSE receiver in the MIMO context is a direct descendant of the Wiener filter (1949) and Kailath's innovations approach to linear estimation (1968-1972). Its appearance as a MIMO detector in Foschini's 1998 V-BLAST paper was not a new algorithm but a recognition that the Wiener form, instantiated on the per-user-per-antenna structure, solves the linear front-end design optimally. The MMSE-SIC combination (next section) is the MIMO counterpart of decision-feedback equalization.

Common Mistake: Never Invert When You Can Solve

Mistake:

One computes GZF=(HHH)βˆ’1HH\mathbf{G}_{\text{ZF}} = (\mathbf{H}^{H} \mathbf{H})^{-1} \mathbf{H}^{H} by explicit matrix inversion.

Correction:

In practice, never form the inverse. Compute a QR factorization of H\mathbf{H}, then solve the triangular system. The inversion form has condition number ΞΊ(HHH)=ΞΊ(H)2\kappa(\mathbf{H}^{H}\mathbf{H}) = \kappa(\mathbf{H})^2, doubling the loss of significant digits. The QR-based solve has condition number only ΞΊ(H)\kappa(\mathbf{H}) and is numerically far more robust, especially when columns of H\mathbf{H} are near-collinear.

Zero-Forcing Detector

A linear detector that applies the Moore-Penrose pseudoinverse of the channel, eliminating inter-stream interference at the cost of noise enhancement inversely proportional to the channel's singular values.

Related: MMSE Detector, Pseudoinverse

MMSE Detector

The linear detector that minimizes the mean-squared error between the transmitted symbol vector and its filtered estimate. Equivalent to regularized least squares with regularization parameter Οƒ2\sigma^2.

Related: Zero-Forcing (ZF) Detector, LMMSE Detector SINR (Finite nrn_r), Wiener Filter

Quick Check

Which statement about linear MIMO detectors is TRUE for Οƒ2>0\sigma^2 > 0?

ZF achieves lower BER than MMSE at high SNR.

MMSE and ZF coincide as Οƒ2β†’0\sigma^2 \to 0.

MMSE eliminates all inter-stream interference.

ZF noise enhancement is independent of the channel.