Colored Gaussian Noise and Pre-Whitening
Real Noise Is Colored
The AWGN assumption is an idealisation. Amplifier noise is flat over receiver bandwidth but coloured by filter frequency responses; antenna arrays see co-channel interference with strong spatial structure; radar targets sit in clutter with long correlation times. Whenever the noise covariance is not a scaled identity, the correlator of Section 2.1 ceases to be optimal. A single linear change of coordinates --- pre-whitening --- restores the AWGN machinery.
Definition: Detection in Colored Gaussian Noise
Detection in Colored Gaussian Noise
Let be a positive-definite covariance matrix. Consider
with . This is the known-signal detection problem in colored Gaussian noise.
Definition: Whitening Transform
Whitening Transform
Let be the Cholesky factorisation. The linear map
is called the whitening transform. Applied to the colored-noise vector , it yields .
Any square factor with (e.g., from the eigendecomposition) whitens equally well. Cholesky is the canonical triangular choice.
Definition: Mahalanobis Norm
Mahalanobis Norm
For positive-definite , the Mahalanobis norm of a vector is
and the Mahalanobis inner product of is .
The Mahalanobis norm is the Euclidean norm in the whitened coordinate system. Geometrically, the locus is the ellipsoid with principal axes aligned with the eigenvectors of .
Theorem: LRT in Colored Gaussian Noise
For the detection problem in Definition DDetection in Colored Gaussian Noise, the log-likelihood ratio is
The Neyman--Pearson test is equivalent to thresholding the pre-whitened correlator
and its deflection coefficient is the squared Mahalanobis norm
The inverse covariance assigns low weight to directions where the noise is strong and high weight to directions where the noise is weak --- a coloured-noise generalisation of inverse-variance weighting.
Write the two multivariate Gaussian densities.
\boldsymbol{\mu}_0 = \mathbf{0}\boldsymbol{\mu}_1 = \mathbf{s}$.
Log-ratio.
The normalising constants cancel:
Expand and simplify.
$
Deflection coefficient.
Under , is Gaussian with mean and variance . Under , has the same variance but mean . Therefore
Theorem: Whitening Yields an Equivalent AWGN Problem
Let , where . Then the transformed problem
has , and the AWGN correlator on the whitened data coincides with the Mahalanobis correlator on the original data.
Distribution of $\widetilde{\mathbf{w}}$.
, so
AWGN statistic on whitened data.
The unit-noise AWGN correlator is .
Unwind the transform.
$
Conclusion.
Detection in colored noise and detection in white noise after a pre-whitening filter give numerically identical decisions. The pre-whitened matched filter is therefore the Neyman--Pearson test for the colored-noise case.
Example: AR(1) Noise: Deflection Loss
Two observations are corrupted by noise with covariance and the signal is . Compute the deflection coefficient of (i) the naive AWGN correlator that ignores correlation and (ii) the optimal Mahalanobis correlator, and compare as a function of .
Invert $\mathbf{C}_w$.
$
Optimal deflection.
$
Naive AWGN correlator deflection.
The mismatched statistic is . . Its variance under is . Hence
Observation.
For the two coincide --- . The signal happens to lie along the principal eigenvector of , so whitening changes only its magnitude, not its direction. The reader should verify that for the mismatched detector instead achieves deflection , an improvement over the case: the noise is anti-correlated with the difference signal, actively helping detection.
Example: The Mahalanobis Ellipsoid
Let . Describe the Mahalanobis unit sphere .
Compute $\mathbf{C}^{-1}$.
, so
Write the defining equation.
expands to
Diagonalise.
Eigenvalues of are , giving principal-axis lengths and . The Mahalanobis unit sphere is an ellipse with these semi-axis lengths.
Pre-Whitening a Colored Noise Process
Empirical PSD of an AR(1) noise process before and after pre-whitening by . The whitened process has an approximately flat PSD, matching the ideal white reference.
Parameters
Pre-Whitened Matched-Filter Detector
Complexity: Cholesky, per detection once is precomputed.In practice, is estimated from secondary data and is reused across many observations. When is Toeplitz (stationary noise), Levinson--Durbin recursion replaces Cholesky with cost.
Common Mistake: Do Not Whiten the Signal Twice
Mistake:
"To get the whitened correlator, I compute and and form ."
Correction:
Either (a) whiten both vectors and form the ordinary inner product , or (b) leave both vectors in the original coordinates and compute the Mahalanobis inner product . Do not mix the two, or you will end up with --- a different statistic with reduced deflection.
Common Mistake: Ill-Conditioned Covariance Matrices
Mistake:
"I inverted and the detection probability exploded --- my optimal detector is clearly broken."
Correction:
A near-singular indicates that some direction in observation space carries essentially no noise. The optimal detector exploits this (and, on paper, achieves arbitrary deflection) but is catastrophically sensitive to modelling error: tiny perturbations in move the null space around. Diagonal loading is the standard remedy; it shrinks the deflection but bounds the sensitivity.
Quick Check
The Mahalanobis squared distance equals:
Correct: the squared Euclidean norm of the whitened vector .
White vs. Colored Gaussian Detection
| Quantity | White () | Colored () |
|---|---|---|
| Test statistic | ||
| Deflection | ||
| Geometry | Sphere | Ellipsoid |
| Pre-processing | None | Cholesky |
| Complexity (per detection) | after setup |
Historical Note: Mahalanobis and the Whitening Transform
1930s--1950sPrasanta Chandra Mahalanobis introduced the distance that bears his name in a 1936 paper in the Proceedings of the National Institute of Sciences of India, motivated by the anthropometric classification of Indian tribal groups. His insight --- that one should measure distance relative to the covariance of the background population --- anticipated the engineering whitening transform by two decades. The connection to Gaussian detection was made rigorous in the statistical signal processing literature of the 1950s, notably by Kailath's work on innovations representations.
Space-Time Adaptive Processing (STAP)
Airborne radars observe ground clutter whose Doppler spectrum is a linear function of antenna angle --- clutter lives on a ridge in the angle-Doppler plane. A STAP processor estimates the clutter-plus-noise covariance from secondary range cells and then applies the pre-whitened matched filter for each Doppler--angle hypothesis. The approach can yield 30+ dB of clutter suppression relative to a per-channel matched filter. The price: if the covariance matrix has dimension (pulses antennas), the required secondary-data support scales as (Reed--Mallett--Brennan rule).
- •
Secondary-data homogeneity assumption
- •
Clutter-Doppler aliasing when PRF < clutter bandwidth
- •
Computational budget: per CPI
Why This Matters: Interference Whitening in MIMO Receivers
A 5G NR MIMO receiver sees a colored noise-plus-interference vector at its antennas. The LMMSE-IRC (interference rejection combining) receiver estimates from demodulation reference signals and applies exactly the Mahalanobis detector of Theorem TLRT in Colored Gaussian Noise to each resource element. The spatial covariance structure makes inter-cell interference predictable and defeats it with the same mathematics introduced in 1950s radar.
Whitening transform
A linear map that turns a coloured Gaussian vector with covariance into an uncorrelated Gaussian vector with identity covariance. Canonically where .
Related: Cholesky factorisation, Mahalanobis distance
Mahalanobis distance
The distance between two points measured in the metric induced by . Equal to the Euclidean distance between their whitened images.
Related: Whitening Transform, Covariance matrix