Spectral Factorization and Innovations
Why Spectral Factorization?
The non-causal Wiener filter was easy because convolution over all of turned into multiplication in the frequency domain. The causal Wiener filter is harder because the Wiener-Hopf equation now holds only for , and a half-axis convolution equation has no one-line Fourier solution.
The trick β due to Wiener and Hopf themselves, refined by Kolmogorov β is to whiten the observation first. If we can write as a causal LTI image of a white process , then causal estimation from becomes causal estimation from , and because is uncorrelated across time, causal estimation from is trivial: just project on each independently. Spectral factorization is the tool that performs this whitening.
Definition: Paley-Wiener Condition
Paley-Wiener Condition
A WSS process with PSD satisfies the Paley-Wiener condition if This is the integrability condition that must satisfy for the causal factor to exist. It fails, for example, if vanishes on an interval of positive measure.
Paley-Wiener is the condition that separates processes that can be generated as causal LTI outputs of a white noise driver (and hence causally predicted to positive accuracy) from those that cannot. Band-limited processes, for instance, are not Paley-Wiener.
Theorem: Spectral Factorization
Let be WSS with PSD satisfying the Paley-Wiener condition. Then there exist functions and such that where (i) is causal: its inverse DTFT is supported on ; (ii) is anti-causal: its inverse DTFT is supported on ; (iii) is also causal, and is also anti-causal. We call the minimum-phase (or causal, or spectral) factor.
Think of as the frequency response of a stable, causal, minimum-phase filter whose squared magnitude is . Because both the filter and its inverse are causal and stable, passing the observation through whitens it without losing any causal information: the filtering is invertible in real time.
Define the log-spectrum
Let . By Paley-Wiener, . Expand in Fourier series: , with . Since is real and positive, is real, so .
Split into causal and anti-causal halves
Write where (causal, strictly) and (anti-causal, strictly). Note by the conjugate symmetry of .
Exponentiate and identify the factors
Define and . Then .
Verify causality of $P_y^+$
is a sum of with only. Its exponential is a power series in β each term is a polynomial in (with non-negative exponents). The Fourier coefficients therefore vanish for , establishing causality. The same argument on shows is causal as well.
Example: Spectral Factorization of the AR(1)+Noise PSD
For the observation with an AR(1) process of coefficient and innovation variance , and white of variance , find the spectral factor explicitly.
Write the PSD as a ratio of trig polynomials
Expanding: . The numerator becomes with and .
Match a minimum-phase numerator
Write the numerator as and match: , . The latter gives ; substituting into the former yields a quadratic in : . The two roots are reciprocals; pick the one with .
Write the spectral factor
Both the zero at and the pole at lie strictly inside the unit circle, so and are both causal and stable.
Definition: Innovations Process
Innovations Process
Let be WSS with Paley-Wiener-positive PSD and let be its minimum-phase spectral factor. The innovations process is the output of passing through the whitening filter : The innovations satisfy (i) , (ii) (white, unit variance), and (iii) (causal information equivalence).
Because is causal and invertible, can be computed from in real time, and vice versa. The innovations are the "fresh news" contained in each new observation β the part that could not be predicted from the past. This is the discrete-time analog of the continuous-time innovations representation of Wold (1938).
Pole-Zero Map of the Spectral Factors
For AR(1)+noise, plot the poles and zeros of (inside the unit circle: minimum phase) and (outside: reciprocal partners). The reciprocal symmetry is the signature of spectral factorization. Vary and SNR to see the zero move.
Parameters
Spectral Factorization and the Causal Cone
Historical Note: Wiener at MIT During the War
1940-1945Norbert Wiener (1894-1964) derived what we now call the Wiener filter in 1942 as part of a classified report to the National Defense Research Committee, titled "Extrapolation, Interpolation, and Smoothing of Stationary Time Series." The motivating problem was anti-aircraft fire control: given a noisy radar track, predict where the aircraft would be when the shell arrived. Wiener's report circulated only among insiders β engineers called it "the yellow peril" because of its difficulty and its yellow binding β and was not openly published until 1949.
The filter was ahead of its time in several ways. It was one of the first systematic uses of second-order statistics in signal processing, and it introduced the spectral factorization machinery that would later become a central tool in control theory, filter design, and spectral estimation. Wiener, already famous for his work on Brownian motion and cybernetics, characteristically wrote the report in a dense style that required engineer Julian Bigelow to produce an accessible companion document for practitioners.
Historical Note: Kolmogorov's Independent Discovery
1941Independently of Wiener and a year earlier, Andrey Kolmogorov (1903-1987) published "Stationary Sequences in Hilbert Space" in the Bulletin of Moscow State University in 1941. Kolmogorov worked in the discrete-time setting (which is where we are in this chapter) and approached the problem from pure Hilbert-space geometry: the optimal linear predictor of from its past is the orthogonal projection onto the closed subspace generated by . He derived what we now call the Kolmogorov-Szego formula for the one-step prediction variance, , a result of remarkable elegance.
Kolmogorov's work reached the West only after the war. For a period in the 1950s there was a gentle priority dispute, but the dust settled on calling the continuous-time smoothing filter "Wiener" and the discrete-time predictor "Kolmogorov" or "Wiener-Kolmogorov." The fact that the same structure was discovered twice, on opposite sides of a world war, from a radar problem and from pure probability, is characteristic of deep mathematical ideas.
Common Mistake: Spectral Factorization Is Not Unique Without Causality
Mistake:
Writing for any and calling the spectral factor.
Correction:
There are infinitely many ways to factor as a squared magnitude: multiply any such by an all-pass filter with and you get another factor. The unique choice is the minimum-phase factor , which is the one whose inverse is also causal and stable. Causality pins down the factor up to a constant scalar.
Innovations
The white process obtained by passing an observation through the causal whitening filter . The innovation represents the new information in beyond what could be predicted from .
Minimum-Phase Spectral Factor
The function in the factorization whose inverse DTFT is supported on (causal) and whose reciprocal is also causal. For rational PSDs it corresponds to placing all poles and zeros inside the unit circle.
Related: Innovations, Paley-Wiener Condition
Paley-Wiener Condition
The requirement , ensuring that the PSD does not vanish on a set of positive measure. It is the necessary and sufficient condition for spectral factorization to exist and for the process to admit a causal white-noise representation.
Related: Minimum-Phase Spectral Factor
Whitening Filter
A causal filter whose output has unit-variance white autocorrelation when driven by a given colored input. For WSS the whitening filter has frequency response .
Related: Innovations