Discrete-Time PSD

The Discrete-Time Setting

In practice, we always work with sampled data. The discrete-time PSD is the workhorse of digital signal processing and communications: given a sequence of samples, how is the power distributed across the normalized frequency range [βˆ’12,12][-\frac{1}{2}, \frac{1}{2}]? The Wiener-Khinchin theorem carries over to discrete time β€” the PSD is the DTFT of the autocorrelation β€” and the periodogram provides a finite-data estimate that connects theory to measurement.

Definition:

Power Spectral Density (Discrete-Time)

Let XnX_n be a random process. Define the truncated DFT XΛ‡N(f)=βˆ‘n=βˆ’NNXn eβˆ’j2Ο€fn\check{X}_N(f) = \sum_{n=-N}^{N} X_n\,e^{-j2\pi fn}. The power spectral density is

Px(f)=lim⁑Nβ†’βˆž12N+1 E ⁣[∣XΛ‡N(f)∣2],f∈[βˆ’12,12].P_x(f) = \lim_{N \to \infty} \frac{1}{2N+1}\,\mathbb{E}\!\left[|\check{X}_N(f)|^2\right], \qquad f \in [-\tfrac{1}{2},\tfrac{1}{2}].

The total average power is Px=βˆ«βˆ’1/21/2Px(f) df\mathcal{P}_x = \int_{-1/2}^{1/2} P_x(f)\,df.

Theorem: Wiener-Khinchin Theorem (Discrete-Time)

Let XnX_n be a WSS sequence with absolutely summable autocorrelation rxx[m]r_{xx}[m], i.e., βˆ‘m=βˆ’βˆžβˆžβˆ£rxx[m]∣<∞\sum_{m=-\infty}^{\infty} |r_{xx}[m]| < \infty. Then

Px(f)=βˆ‘m=βˆ’βˆžβˆžrxx[m] eβˆ’j2Ο€fm,f∈[βˆ’12,12].\boxed{P_x(f) = \sum_{m=-\infty}^{\infty} r_{xx}[m]\,e^{-j2\pi fm}, \qquad f \in [-\tfrac{1}{2}, \tfrac{1}{2}].}

The inverse relation is

rxx[m]=βˆ«βˆ’1/21/2Px(f) ej2Ο€fm df.r_{xx}[m] = \int_{-1/2}^{1/2} P_x(f)\,e^{j2\pi fm}\,df.

The proof follows the same pattern as the CT case: expand the periodogram in terms of the autocorrelation, and the FejΓ©r-type window (1βˆ’βˆ£m∣/(2N+1))(1 - |m|/(2N+1)) converges to 11 as Nβ†’βˆžN \to \infty under absolute summability.

,

Theorem: Properties of the Discrete-Time PSD

The DT PSD satisfies:

  1. Periodic: Px(f+1)=Px(f)P_x(f + 1) = P_x(f) (period 1 in ff).
  2. Non-negative: Px(f)β‰₯0P_x(f) \geq 0 for all ff.
  3. Real: Px(f)∈RP_x(f) \in \mathbb{R}.
  4. Even for real processes: Px(f)=Px(βˆ’f)P_x(f) = P_x(-f).
  5. Power: rxx[0]=βˆ«βˆ’1/21/2Px(f) dfr_{xx}[0] = \int_{-1/2}^{1/2} P_x(f)\,df.

Periodicity comes from the DTFT. The remaining properties mirror the CT case and follow from the positive semi-definiteness and Hermitian symmetry of rxx[m]r_{xx}[m].

Theorem: PSD of Non-WSS Processes (Discrete-Time)

For a DT process with autocorrelation rxx[n,m]r_{xx}[n, m] (not necessarily WSS):

Px(f)=βˆ‘m=βˆ’βˆžβˆžrΛ‰xx[m] eβˆ’j2Ο€fm,P_x(f) = \sum_{m=-\infty}^{\infty} \bar{r}_{xx}[m]\,e^{-j2\pi fm},

where rΛ‰xx[m]=lim⁑Nβ†’βˆž12N+1βˆ‘n=βˆ’NNrxx[n,nβˆ’m]\bar{r}_{xx}[m] = \lim_{N \to \infty}\frac{1}{2N+1}\sum_{n=-N}^{N} r_{xx}[n, n-m] is the time-averaged autocorrelation.

This is the DT version of Theorem 43. Even when the autocorrelation depends on both time indices, time-averaging extracts a function of the lag alone, to which the Wiener-Khinchin theorem applies.

Theorem: Corollary: PSD of WSC Processes (Discrete-Time)

If XnX_n is wide-sense cyclostationary with period TT, then

rΛ‰xx[m]=1Tβˆ‘n=0Tβˆ’1rxx[n,nβˆ’m].\bar{r}_{xx}[m] = \frac{1}{T}\sum_{n=0}^{T-1} r_{xx}[n, n-m].

The periodicity allows replacing the infinite CesΓ ro average with a finite average over one period.

Example: PSD of an AR(1) Process

Consider the AR(1) process Xn=aXnβˆ’1+WnX_n = a X_{n-1} + W_n where ∣a∣<1|a| < 1 and WnW_n is white noise with variance Οƒ2\sigma^2. Find Px(f)P_x(f).

Definition:

The Periodogram

Given NN samples X0,X1,…,XNβˆ’1X_0, X_1, \ldots, X_{N-1} of a WSS process, the periodogram is the estimator

Px^(f)=1Nβˆ£βˆ‘n=0Nβˆ’1Xn eβˆ’j2Ο€fn∣2.\hat{P_x}(f) = \frac{1}{N}\left|\sum_{n=0}^{N-1} X_n\,e^{-j2\pi fn}\right|^2.

The periodogram is a natural finite-data approximation to the PSD: it replaces the infinite sum and the expectation in the definition with a finite sum and a single realization.

The periodogram is an asymptotically unbiased estimator of Px(f)P_x(f), but it is not consistent: its variance does not decrease as Nβ†’βˆžN \to \infty. Averaging multiple periodograms (Bartlett's method) or windowing (Welch's method) is needed for consistent estimation.

,

Periodogram of a WSS Sequence

Generate an AR(1) process and compare the periodogram (single realization) with the true PSD. Observe how the periodogram is noisy and how averaging (Bartlett's method) reduces variance.

Parameters
0.8
256
1
42

DT PSD ↔\leftrightarrow Autocorrelation

Explore the DTFT relationship between rxx[m]r_{xx}[m] and Px(f)P_x(f) for different process types.

Parameters
0.7

Example: PSD of an MA(1) Process

Let Xn=Wn+bWnβˆ’1X_n = W_n + b W_{n-1} where WnW_n is white noise with variance Οƒ2\sigma^2. Find Px(f)P_x(f).

Quick Check

The periodogram Px^(f)\hat{P_x}(f) computed from NN samples of a WSS process is:

Asymptotically unbiased but inconsistent

Consistent and unbiased

Biased and consistent

Neither unbiased nor consistent

Common Mistake: Trusting a Single Periodogram

Mistake:

Using the raw periodogram from a single data record as if it were the true PSD.

Correction:

The periodogram has high variance (approximately equal to Px(f)2P_x(f)^2) regardless of data length. To reduce variance, use averaging methods: Bartlett (segment-average), Welch (overlapping windowed segments), or multitaper methods. The bias-variance tradeoff is controlled by the number of segments.

πŸ”§Engineering Note

Welch's Method in Practice

Welch's method (1967) splits the data into overlapping segments, windows each segment, computes the periodogram of each, and averages. With KK segments and 50% overlap, the variance is reduced by approximately a factor of 9K/119K/11 compared to the raw periodogram, at the cost of frequency resolution. This is the de facto standard PSD estimator in most signal processing libraries (e.g., scipy.signal.welch in Python, pwelch in MATLAB).

Historical Note: Schuster and the Periodogram

1898

The periodogram was introduced by Arthur Schuster in 1898 for detecting hidden periodicities in meteorological and geophysical data. The term "periodogram" reflects its original purpose: finding periodic components. Its statistical properties as a PSD estimator were only understood much later, when it was recognized that the periodogram is an inconsistent estimator β€” a surprising and initially disappointing result that motivated the development of averaged and windowed spectral estimators.

DT vs. CT PSD Comparison

PropertyContinuous-TimeDiscrete-Time
PSD formulaPx(f)=∫rxx(Ο„) eβˆ’j2Ο€fτ dΟ„P_x(f) = \int r_{xx}(\tau)\,e^{-j2\pi f\tau}\,d\tauPx(f)=βˆ‘mrxx[m] eβˆ’j2Ο€fmP_x(f) = \sum_m r_{xx}[m]\,e^{-j2\pi fm}
Frequency rangef∈(βˆ’βˆž,∞)f \in (-\infty, \infty)f∈[βˆ’1/2,1/2]f \in [-1/2, 1/2] (periodic)
Powerβˆ«βˆ’βˆžβˆžPx(f) df\int_{-\infty}^{\infty} P_x(f)\,dfβˆ«βˆ’1/21/2Px(f) df\int_{-1/2}^{1/2} P_x(f)\,df
White noise PSDN0/2N_0/2 (infinite power)Οƒ2\sigma^2 (finite power)
Periodogram1T∣XΛ‡T(f)∣2\frac{1}{T}|\check{X}_T(f)|^21Nβˆ£βˆ‘nXneβˆ’j2Ο€fn∣2\frac{1}{N}|\sum_n X_n e^{-j2\pi fn}|^2

Periodogram

The estimator Px^(f)=1Nβˆ£βˆ‘nXneβˆ’j2Ο€fn∣2\hat{P_x}(f) = \frac{1}{N}|\sum_n X_n e^{-j2\pi fn}|^2. Asymptotically unbiased but inconsistent (variance does not decrease with NN).

Related: {{Ref:Gloss Psd}}

Autocorrelation (WSS)

rxx[m]=E[XnXnβˆ’mβˆ—]r_{xx}[m] = \mathbb{E}[X_n X_{n-m}^*] (DT) or rxx(Ο„)=E[X(t+Ο„)Xβˆ—(t)]r_{xx}(\tau) = \mathbb{E}[X(t+\tau)X^*(t)] (CT). Depends only on the lag for WSS processes. Forms a Fourier pair with the PSD.

Key Takeaway

The DT Wiener-Khinchin theorem mirrors the CT version: Px(f)P_x(f) is the DTFT of rxx[m]r_{xx}[m], periodic in ff with period 1. The periodogram is the natural finite-data estimator but requires averaging (Bartlett, Welch) for reliable spectral estimation.