Discrete-Time PSD
The Discrete-Time Setting
In practice, we always work with sampled data. The discrete-time PSD is the workhorse of digital signal processing and communications: given a sequence of samples, how is the power distributed across the normalized frequency range ? The Wiener-Khinchin theorem carries over to discrete time β the PSD is the DTFT of the autocorrelation β and the periodogram provides a finite-data estimate that connects theory to measurement.
Definition: Power Spectral Density (Discrete-Time)
Power Spectral Density (Discrete-Time)
Let be a random process. Define the truncated DFT . The power spectral density is
The total average power is .
Theorem: Wiener-Khinchin Theorem (Discrete-Time)
Let be a WSS sequence with absolutely summable autocorrelation , i.e., . Then
The inverse relation is
The proof follows the same pattern as the CT case: expand the periodogram in terms of the autocorrelation, and the FejΓ©r-type window converges to as under absolute summability.
Expand the periodogram
\mathbb{E}[X_n X_m^*] = r_{xx}[n-m]\ell = n - m(2N+1 - |\ell|)(n,m) \in {-N,\ldots,N}^2n - m = \ell$.
Take the limit
As , for each fixed . By absolute summability of and dominated convergence:
Theorem: Properties of the Discrete-Time PSD
The DT PSD satisfies:
- Periodic: (period 1 in ).
- Non-negative: for all .
- Real: .
- Even for real processes: .
- Power: .
Periodicity comes from the DTFT. The remaining properties mirror the CT case and follow from the positive semi-definiteness and Hermitian symmetry of .
Periodicity
since . So .
Non-negativity and reality
Same argument as the CT case: is positive semi-definite, so its DTFT is non-negative and real.
Theorem: PSD of Non-WSS Processes (Discrete-Time)
For a DT process with autocorrelation (not necessarily WSS):
where is the time-averaged autocorrelation.
This is the DT version of Theorem 43. Even when the autocorrelation depends on both time indices, time-averaging extracts a function of the lag alone, to which the Wiener-Khinchin theorem applies.
Expand and average
Write using and change the summation variable to the lag . The coefficient of is . As this converges to .
Theorem: Corollary: PSD of WSC Processes (Discrete-Time)
If is wide-sense cyclostationary with period , then
The periodicity allows replacing the infinite CesΓ ro average with a finite average over one period.
Finite average suffices
Since is periodic in with period , the CesΓ ro limit reduces to the average over one period: .
Example: PSD of an AR(1) Process
Consider the AR(1) process where and is white noise with variance . Find .
Autocorrelation
We know .
DTFT
$
Verification
Alternatively, is the output of the filter driven by white noise . So , which matches.
Definition: The Periodogram
The Periodogram
Given samples of a WSS process, the periodogram is the estimator
The periodogram is a natural finite-data approximation to the PSD: it replaces the infinite sum and the expectation in the definition with a finite sum and a single realization.
The periodogram is an asymptotically unbiased estimator of , but it is not consistent: its variance does not decrease as . Averaging multiple periodograms (Bartlett's method) or windowing (Welch's method) is needed for consistent estimation.
Periodogram of a WSS Sequence
Generate an AR(1) process and compare the periodogram (single realization) with the true PSD. Observe how the periodogram is noisy and how averaging (Bartlett's method) reduces variance.
Parameters
DT PSD Autocorrelation
Explore the DTFT relationship between and for different process types.
Parameters
Example: PSD of an MA(1) Process
Let where is white noise with variance . Find .
Autocorrelation
, , for .
DTFT
$
Interpretation
This is also where is the MA filter. For the PSD peaks at (low-pass); for it peaks at (high-pass).
Quick Check
The periodogram computed from samples of a WSS process is:
Asymptotically unbiased but inconsistent
Consistent and unbiased
Biased and consistent
Neither unbiased nor consistent
as , but does not vanish. The variance stays approximately regardless of .
Common Mistake: Trusting a Single Periodogram
Mistake:
Using the raw periodogram from a single data record as if it were the true PSD.
Correction:
The periodogram has high variance (approximately equal to ) regardless of data length. To reduce variance, use averaging methods: Bartlett (segment-average), Welch (overlapping windowed segments), or multitaper methods. The bias-variance tradeoff is controlled by the number of segments.
Welch's Method in Practice
Welch's method (1967) splits the data into overlapping segments,
windows each segment, computes the periodogram of each, and averages.
With segments and 50% overlap, the variance is reduced by
approximately a factor of compared to the raw periodogram,
at the cost of frequency resolution. This is the de facto standard
PSD estimator in most signal processing libraries (e.g.,
scipy.signal.welch in Python, pwelch in MATLAB).
Historical Note: Schuster and the Periodogram
1898The periodogram was introduced by Arthur Schuster in 1898 for detecting hidden periodicities in meteorological and geophysical data. The term "periodogram" reflects its original purpose: finding periodic components. Its statistical properties as a PSD estimator were only understood much later, when it was recognized that the periodogram is an inconsistent estimator β a surprising and initially disappointing result that motivated the development of averaged and windowed spectral estimators.
DT vs. CT PSD Comparison
| Property | Continuous-Time | Discrete-Time |
|---|---|---|
| PSD formula | ||
| Frequency range | (periodic) | |
| Power | ||
| White noise PSD | (infinite power) | (finite power) |
| Periodogram |
Periodogram
The estimator . Asymptotically unbiased but inconsistent (variance does not decrease with ).
Related: {{Ref:Gloss Psd}}
Autocorrelation (WSS)
(DT) or (CT). Depends only on the lag for WSS processes. Forms a Fourier pair with the PSD.
Key Takeaway
The DT Wiener-Khinchin theorem mirrors the CT version: is the DTFT of , periodic in with period 1. The periodogram is the natural finite-data estimator but requires averaging (Bartlett, Welch) for reliable spectral estimation.