Ferkans — Interactive Telecom Tutor

ex-ch09-01

Easy

Let $d[n]$ and $y[n]$ be jointly WSS with cross-correlation $r_{dy}[k] = \mathbb{E}[d[n+k]y^*[n]]$ and observation auto-correlation $r_y[k]$ . Starting from the orthogonality principle, derive the Wiener-Hopf equation satisfied by the non-causal filter $h[k]$ : $\sum_{\ell=-\infty}^{\infty} h[\ell]\, r_y[k-\ell] = r_{dy}[k], \qquad \forall k \in \mathbb{Z}.$

Show Hint

Write the estimation error $e[n] = d[n] - \sum_\ell h[\ell] y[n-\ell]$ and set $\mathbb{E}[e[n] y^*[n-k]] = 0$ .

Use WSS to collapse absolute-time indices into lag differences.

Solution

Write the error and impose orthogonality

The LMMSE estimate is $\hat{d}[n] = \sum_\ell h[\ell] y[n-\ell]$ . Orthogonality requires $\mathbb{E}\!\left[\left(d[n] - \sum_\ell h[\ell] y[n-\ell]\right) y^*[n-k]\right] = 0$ for every $k$ .

Collapse using WSS

Expanding the expectation: $\mathbb{E}[d[n] y^*[n-k]] = r_{dy}[k]$ and $\mathbb{E}[y[n-\ell] y^*[n-k]] = r_y[k-\ell]$ . The equation becomes $\sum_\ell h[\ell] r_y[k-\ell] = r_{dy}[k]$ for all $k \in \mathbb{Z}$ . $\blacksquare$

Remark

This is a discrete convolution equation on the full integer line. Taking the DTFT gives $\check{h}_{\text{nc}}(f) = S_{dy}(f)/S_y(f)$ directly.

ex-ch09-02

Easy

Compute the non-causal Wiener filter $\check{h}_{\text{nc}}(f)$ and the resulting MMSE for the signal-plus-noise model $y[n] = d[n] + w[n]$ , where $d[n]$ and $w[n]$ are independent, zero-mean, WSS with PSDs $S_d(f)$ and $S_w(f)$ respectively.

Show Hint

Independence plus zero means gives $S_{dy}(f) = S_d(f)$ and $S_y(f) = S_d(f)+S_w(f)$ .

Solution

Cross and auto spectra

$r_{dy}[k] = \mathbb{E}[d[n+k](d[n]+w[n])^*] = r_d[k]$ , so $S_{dy}(f) = S_d(f)$ . Similarly $S_y(f) = S_d(f)+S_w(f)$ .

Filter and MMSE

The non-causal Wiener filter is $\check{h}_{\text{nc}}(f) = \dfrac{S_d(f)}{S_d(f)+S_w(f)}$ , and the MMSE is $\sigma^2_{nc} = \displaystyle\int_{-1/2}^{1/2} \frac{S_d(f) S_w(f)}{S_d(f)+S_w(f)}\,df$ . $\blacksquare$

Operational reading

The filter is a frequency-selective attenuator: it passes frequencies where $\mathrm{SNR}(f) = S_d(f)/S_w(f) \gg 1$ and suppresses those with $\mathrm{SNR}(f) \ll 1$ .

ex-ch09-03

Medium

Let $d[n]$ be an AR(1) process with $d[n] = \alpha\, d[n-1] + u[n]$ , $|\alpha|<1$ , where $u[n]$ is white with variance $\sigma_u^2$ , observed in independent white noise $w[n]$ with variance $\sigma_w^2$ . Derive closed-form expressions for $S_d(f)$ , $S_y(f)$ , and the non-causal Wiener filter $\check{h}_{\text{nc}}(f)$ .

Show Hint

$S_d(f) = \sigma_u^2 / |1 - \alpha e^{-j2\pi f}|^2$ .

Solution

PSD of the AR(1) signal

Taking the transfer function $H(z) = 1/(1-\alpha z^{-1})$ acting on white $u[n]$ , $S_d(f) = \dfrac{\sigma_u^2}{|1-\alpha e^{-j2\pi f}|^2} = \dfrac{\sigma_u^2}{1-2\alpha\cos(2\pi f)+\alpha^2}$ .

Observation PSD

$S_y(f) = S_d(f)+\sigma_w^2$ .

Non-causal Wiener filter

$\check{h}_{\text{nc}}(f) = \dfrac{S_d(f)}{S_d(f)+\sigma_w^2} = \dfrac{\sigma_u^2}{\sigma_u^2 + \sigma_w^2 |1-\alpha e^{-j2\pi f}|^2}$ . $\blacksquare$

Operational reading

For $\alpha$ close to 1 the signal PSD concentrates near DC, and the Wiener filter becomes a low-pass filter. For $\alpha$ near 0 the signal is nearly white and the filter collapses to the scalar MMSE gain $\sigma_u^2/(\sigma_u^2+\sigma_w^2)$ .

ex-ch09-04

Medium

Prove that the non-causal MMSE can be written as $\sigma^2_{nc} = \int_{-1/2}^{1/2}\!\left(S_d(f) - \frac{|S_{dy}(f)|^2}{S_y(f)}\right)df.$ Interpret the integrand.

Show Hint

MMSE $= r_d[0] - \sum_k h[k] r_{yd}[-k]$ ; apply Parseval.

Solution

Time-domain MMSE

From orthogonality, $\sigma^2_{nc} = \mathbb{E}[e[n] d^*[n]] = r_d[0] - \sum_k h[k] r_{dy}^*[k]$ .

Parseval

$r_d[0] = \int S_d(f)\,df$ and $\sum_k h[k] r_{dy}^*[k] = \int H(f) S_{dy}^*(f)\,df$ . Substituting $H(f) = S_{dy}(f)/S_y(f)$ yields $\sigma^2_{nc} = \int\!\left(S_d(f) - |S_{dy}(f)|^2/S_y(f)\right)df$ . $\blacksquare$

Interpretation

The integrand is the residual PSD after projecting the desired signal onto the observation spectrum at each frequency. When $|S_{dy}(f)|^2 = S_d(f) S_y(f)$ (perfect coherence), the integrand vanishes and the estimate is exact.

ex-ch09-05

Medium

State the Paley-Wiener condition for a PSD $S_y(f) > 0$ and explain why it is required for spectral factorization $S_y(f) = |G(f)|^2$ with $G(z)$ causal and minimum-phase.

Show Hint

The condition is on the integrability of $\log S_y(f)$ .

Solution

The condition

$\displaystyle\int_{-1/2}^{1/2} |\log S_y(f)|\,df < \infty$ (equivalently, $\log S_y(f) \in L^1$ ).

Why it is needed

Spectral factorization constructs $G(z) = \exp\!\big(\sum_{k=0}^\infty c_k z^{-k}\big)$ with Fourier coefficients of $\tfrac12 \log S_y$ . These coefficients exist and decay only when $\log S_y \in L^1$ . Without this, no causal $G$ with a causal inverse exists.

Practical consequence

A PSD that vanishes on a positive-measure set (spectral null) violates Paley-Wiener; the process is then deterministic in the forward direction and cannot be represented as the output of a causal stable filter driven by white noise. $\blacksquare$

ex-ch09-06

Medium

Perform the spectral factorization of $S_y(f) = \dfrac{5 - 4\cos(2\pi f)}{5 - 4\cos(2\pi f)+ \text{constant}}$ , more concretely of $S_y(f) = 5 - 4\cos(2\pi f)$ . Identify the minimum-phase factor $G(z)$ and the innovations variance $\sigma_\nu^2$ .

Show Hint

Express in $z$ : $S_y(z) = 5 - 2z - 2z^{-1}$ .

Find roots of $5 - 2z - 2z^{-1}= 0$ ; pair inside/outside the unit circle.

Solution

Rewrite in $z$

$S_y(z) = 5 - 2z - 2z^{-1} = -2 z^{-1}(z^2 - \tfrac{5}{2} z + 1) = -2 z^{-1}(z-2)(z-\tfrac12)$ .

Factor in terms of unit-circle roots

$(z-2)(z-\tfrac12) = \tfrac12 (2z-1)(z-\tfrac12)\cdot(\text{rearrange})$ . Equivalently, $S_y(z) = 4 \cdot (1-\tfrac12 z^{-1})(1-\tfrac12 z)$ .

Extract minimum-phase factor

Take $G(z) = 2(1-\tfrac12 z^{-1})$ (zero inside unit disk at $z=\tfrac12$ , causal and stable), and $\sigma_\nu^2 = 1$ . Check: $|G(e^{j2\pi f})|^2 \cdot \sigma_\nu^2 = 4|1-\tfrac12 e^{-j2\pi f}|^2 = 4(1 - \cos 2\pi f + \tfrac14) = 5 - 4\cos 2\pi f$ . $\blacksquare$

ex-ch09-07

Medium

Show that the innovations process $\nu[n]$ , obtained by passing $y[n]$ through the whitening filter $1/G(z)$ , is white and has variance $\sigma_\nu^2$ . Why are innovations useful for deriving the causal Wiener filter?

Show Hint

Compute $S_\nu(f) = S_y(f)/|G(f)|^2$ .

Solution

Whiten

Let $\nu[n] = (1/G) * y$ . In frequency, $S_\nu(f) = S_y(f)/|G(f)|^2 = \sigma_\nu^2$ , constant. So $\nu[n]$ is white with variance $\sigma_\nu^2$ .

Why it helps

Because $\nu[n]$ is white, projecting $d[n]$ onto $\{\nu[k]: k\leq n\}$ is a sequence of independent scalar projections. The causal Wiener filter is then the causal part of the cross-spectrum, normalized by $\sigma_\nu^2$ : $\check{h}_c(f) = \dfrac{1}{G(f)}\left[\dfrac{S_{dy}(f)}{G^*(f)}\right]_+$ .

Remark

The bracketed $[\cdot]_+$ operation keeps only causal (non-negative index) Fourier coefficients; it is the formal projection onto the past.

ex-ch09-08

Hard

Derive the causal Wiener filter for the AR(1)-in-white-noise model of Exercise ex-ch09-03 in closed form. Compute the causal MMSE and compare with the non-causal MMSE.

Show Hint

Spectral-factor $S_y(f) = \sigma_u^2 / |1-\alpha e^{-j2\pi f}|^2 + \sigma_w^2$ .

Find the unique $\beta \in (-1,1)$ and gain $c$ such that $S_y(z) = c^2 (1-\beta z^{-1})(1-\beta z)/[(1-\alpha z^{-1})(1-\alpha z)]$ .

Solution

Combine and factor

$S_y(z) = \dfrac{\sigma_u^2 + \sigma_w^2 (1-\alpha z^{-1})(1-\alpha z)}{(1-\alpha z^{-1})(1-\alpha z)}$ . The numerator is a second-order symmetric polynomial; write it as $c^2 (1-\beta z^{-1})(1-\beta z)$ with $|\beta|<1$ and $c>0$ . Matching coefficients gives $c^2 (1+\beta^2) = \sigma_u^2 + \sigma_w^2 (1+\alpha^2)$ and $c^2 \beta = \sigma_w^2 \alpha$ .

Causal filter

After cancelling the common causal pole, one obtains $\check{h}_c(z) = \dfrac{K(1-\alpha z^{-1})}{1 - \beta z^{-1}}$ , where $K = (\alpha-\beta)/(c(1-\alpha\beta))$ arises from the $[\cdot]_+$ operation. This is a first-order IIR smoother.

Comparison

The causal MMSE is strictly larger than the non-causal one unless $\beta=0$ (i.e. the signal is white). The gap is quantified by the Kolmogorov-Szego formula applied to the prediction-error spectrum. $\blacksquare$

ex-ch09-09

Medium

State the Kolmogorov-Szego formula for the one-step prediction error variance of a purely non-deterministic WSS process, and interpret it as a geometric mean.

Show Hint

The formula involves $\exp \int \log S_y(f)\,df$ .

Solution

Formula

For a purely non-deterministic WSS process with PSD $S_y(f)>0$ a.e. satisfying Paley-Wiener, the minimum one-step prediction-error variance is $\sigma_\nu^2 = \exp\!\Big(\displaystyle\int_{-1/2}^{1/2} \log S_y(f)\,df\Big)$ .

Geometric-mean interpretation

The arithmetic mean of $S_y(f)$ is $r_y[0]$ (signal power). The geometric mean is $\sigma_\nu^2$ . The ratio is the achievable prediction-gain: $r_y[0]/\sigma_\nu^2 \geq 1$ , with equality iff the process is white.

Consequence

Concentrating PSD energy (colored spectra) lowers $\sigma_\nu^2$ : predictable structure reduces the innovations floor. $\blacksquare$

ex-ch09-10

Easy

A random process has $r_y[0]=4$ , $r_y[1]=2$ , $r_y[2]=1$ . Fit a second-order linear predictor $\hat y[n] = a_1 y[n-1] + a_2 y[n-2]$ by solving the $2\times 2$ Yule-Walker equations.

Show Hint

Yule-Walker: $\mathbf{R} \mathbf{a} = \mathbf{r}$ , with $\mathbf{R}_{ij}=r_y[i-j]$ , $\mathbf{r}_i=r_y[i]$ .

Solution

Set up the system

$\begin{pmatrix} 4 & 2 \\ 2 & 4 \end{pmatrix}\begin{pmatrix}a_1\\a_2\end{pmatrix} = \begin{pmatrix}2\\1\end{pmatrix}$ .

Solve

Determinant $= 12$ . $a_1 = (4\cdot 2 - 2\cdot 1)/12 = 1/2$ , $a_2 = (4\cdot 1 - 2\cdot 2)/12 = 0$ .

Prediction-error variance

$\sigma_e^2 = r_y[0] - a_1 r_y[1] - a_2 r_y[2] = 4 - 1 - 0 = 3$ . $\blacksquare$

ex-ch09-11

Hard

Consider the two-sided noncausal smoother for $y[n] = d[n]+w[n]$ with independent zero-mean WSS $d,w$ . Prove that the non-causal MMSE is strictly smaller than the causal MMSE whenever $S_d(f) S_w(f) \not\equiv 0$ , and equal only in pathological cases.

Show Hint

Express both MMSEs via the prediction-error spectrum of the observation after scaling.

Solution

MMSE gap

One can show (Kailath-Sayed-Hassibi, Ch. 7) that $\sigma^2_{c} - \sigma^2_{nc} = \int_{-1/2}^{1/2} \Big|\Big[\frac{S_{dy}}{G^*}\Big]_-\Big|^2 / \sigma_\nu^2\,df$ , i.e. the energy of the anti-causal part of the whitened cross-spectrum.

Strict positivity

The right-hand side is non-negative and vanishes iff $[S_{dy}/G^*]_-\equiv 0$ , that is, iff the whitened cross-correlation is purely causal. For the additive-noise model this fails unless $S_d(f) S_w(f) = 0$ almost everywhere. $\blacksquare$

Operational reading

Access to the future strictly improves estimation whenever signal and noise overlap in frequency; the smoothing gap is a direct measure of that overlap.

ex-ch09-12

Medium

Derive the frequency response of the $m$ -step Wiener predictor of $d[n]=y[n+m]$ , $m\geq 1$ , for a purely non-deterministic WSS $y[n]$ with innovations representation $y = G(z)\,\nu$ .

Show Hint

The $m$ -step predictor keeps the causal part of $z^m G(z)$ times $1/G(z)$ .

Solution

Setup

$y[n] = \sum_{k\geq 0} g[k]\nu[n-k]$ . The MMSE $m$ -step predictor of $y[n+m]$ from $\{y[k]:k\leq n\}$ equals the projection onto $\{\nu[k]:k\leq n\}$ , namely $\hat y[n+m\mid n] = \sum_{k\geq m} g[k] \nu[n+m-k]$ .

Transfer function

Reindexing, the predictor has transfer function, in $z$ -domain, $\hat Y(z)/Y(z) = \dfrac{1}{G(z)}\Big[z^m G(z)\Big]_{+, \text{shifted}}$ , which reduces to $\hat{h}_m(z) = \dfrac{[z^m G(z)]_+}{G(z)}$ , where $[\cdot]_+$ keeps non-negative powers of $z^{-1}$ .

Prediction error

$\sigma_m^2 = \sigma_\nu^2 \sum_{k=0}^{m-1} |g[k]|^2$ , which grows with $m$ and saturates at $r_y[0]$ . $\blacksquare$

ex-ch09-13

Medium

Derive the MMSE of the one-step predictor for an AR( $p$ ) process $y[n] = \sum_{k=1}^p a_k y[n-k] + \nu[n]$ , $\nu[n]$ white with variance $\sigma_\nu^2$ .

Show Hint

For an AR( $p$ ) process the optimal predictor is exact in the recursion.

Solution

Predictor

The true recursion $y[n]=\sum_k a_k y[n-k]+\nu[n]$ shows that the AR( $p$ ) predictor $\hat y[n\mid n-1] = \sum_k a_k y[n-k]$ uses only the past $p$ samples and is optimal.

MMSE

The prediction error is $\nu[n]$ , hence $\sigma_1^2 = \sigma_\nu^2$ . This is also what the Kolmogorov-Szego formula yields: $\sigma_\nu^2 = \exp \int \log S_y(f)\,df$ . $\blacksquare$

ex-ch09-14

Easy

Consider a narrowband desired signal $d[n]$ with PSD concentrated in $|f|<B$ , observed in independent wideband white noise of variance $\sigma_w^2$ . Sketch the Wiener filter magnitude response and the in-band / out-of-band gains.

Show Hint

Use $H(f) = S_d(f)/(S_d(f)+\sigma_w^2)$ and approximate in the two regimes.

Solution

In-band

For $|f|<B$ , $S_d(f)\gg \sigma_w^2$ so $H(f)\approx 1$ (signal passes).

Out-of-band

For $|f|>B$ , $S_d(f)\approx 0$ so $H(f)\approx 0$ (noise suppressed).

Transition

Near the band edge the filter smoothly transitions; its sharpness depends on the roll-off of $S_d(f)$ . The Wiener filter behaves as a signal-matched low-pass filter. $\blacksquare$

ex-ch09-15

Medium

Show that the Wiener filter is a contraction: $\int |\hat D(f)|^2 df \leq \int |D(f)|^2 df$ , where $D$ and $\hat D$ are the spectra of $d$ and $\hat d$ respectively. Interpret.

Show Hint

$|\hat D(f)| = |H(f)||Y(f)| \leq |Y(f)|$ since $0\leq H(f)\leq 1$ in the additive-noise case.

Solution

Bound on $|H(f)|$

For $y = d+w$ with $d,w$ independent, $H(f) = S_d/(S_d+S_w)\in[0,1]$ .

Power inequality

$\mathbb{E}|\hat d[n]|^2 = \int |H(f)|^2 S_y(f) df \leq \int S_y(f) df = r_y[0]$ . Since $r_y[0] \geq r_d[0]$ (adding noise adds variance), in general this bounds $\hat d$ by the observation power. When additionally $S_d \leq S_y$ one also gets $\int|H|^2 S_d\leq r_d[0]$ .

Interpretation

The Wiener filter is never a gain; it attenuates uniformly and preserves no noise power at frequencies where the signal is absent. $\blacksquare$

ex-ch09-16

Medium

The LMS algorithm updates a tap vector by $\mathbf{h}_{n+1} = \mathbf{h}_n + \mu\, e_n\, \mathbf{y}_n^*$ , where $e_n = d_n - \mathbf{h}_n^H \mathbf{y}_n$ . Identify the deterministic counterpart of LMS that converges to the Wiener solution, and state the standard step-size condition for mean convergence.

Show Hint

The deterministic (expected-value) update is steepest descent on the MSE cost.

Use the eigenvalues of $\mathbf{R} = \mathbb{E}[\mathbf{y}_n \mathbf{y}_n^H]$ .

Solution

Deterministic counterpart

Taking expectations yields $\bar{\mathbf{h}}_{n+1} = \bar{\mathbf{h}}_n + \mu(\mathbf{r}_{yd} - \mathbf{R}\bar{\mathbf{h}}_n)$ , which is steepest descent on $J(\mathbf{h}) = \mathbb{E}|d-\mathbf{h}^H\mathbf{y}|^2$ . Its fixed point solves $\mathbf{R}\bar{\mathbf{h}}_\star = \mathbf{r}_{yd}$ , the Wiener-Hopf normal equations.

Step-size condition

Convergence in the mean requires $0 < \mu < 2/\lambda_{\max}(\mathbf{R})$ . Slow-converging modes are dictated by $\lambda_{\min}(\mathbf{R})$ , so the condition number $\lambda_{\max}/\lambda_{\min}$ controls the misadjustment-speed tradeoff. $\blacksquare$

ex-ch09-17

Hard

Consider the \emph{smoothing} problem: estimate $d[n]$ from $\{y[k]\}_{k=-\infty}^{n+L}$ for fixed lag $L\geq 0$ . Derive the frequency response of the optimal lag- $L$ smoother and show that as $L\to\infty$ it converges to the non-causal Wiener filter.

Show Hint

Use causal Wiener with desired signal $d[n-L]$ and then shift.

Solution

Reduce to causal

Estimating $d[n]$ from data up to $n+L$ is equivalent (up to a delay) to causally estimating $d'[n]=d[n-L]$ from $y[n]$ . Apply causal Wiener with cross-spectrum $e^{-j2\pi f L} S_{dy}(f)$ .

Smoother formula

$H_L(f) = \dfrac{e^{j2\pi f L}}{G(f)}\Big[\dfrac{S_{dy}(f) e^{-j2\pi f L}}{G^*(f)}\Big]_+$ .

Limit $L\to\infty$

As $L\to\infty$ the bracketed causal projection absorbs the entire spectrum $S_{dy}/G^*$ , yielding $H_\infty(f) = \dfrac{S_{dy}(f)}{|G(f)|^2} = \dfrac{S_{dy}(f)}{S_y(f)}$ , which is the non-causal Wiener filter. $\blacksquare$

Interpretation

Every extra unit of allowable delay lets the smoother exploit more of the anti-causal cross-correlation. The curve of MMSE vs. $L$ is the canonical smoothing tradeoff.

ex-ch09-18

Medium

Show that the Wiener-Hopf normal equations for an FIR filter of length $M$ reduce to the matrix equation $\mathbf{R}\mathbf{h} = \mathbf{r}$ , with $\mathbf{R}$ Hermitian Toeplitz. Discuss when $\mathbf{R}$ is singular and how to regularize.

Show Hint

Truncate the infinite Wiener-Hopf equation to lags $0,\ldots,M-1$ .

Solution

Finite truncation

Imposing $h[k]=0$ for $k<0$ and $k\geq M$ and restricting orthogonality to $k=0,\ldots,M-1$ : $\sum_{\ell=0}^{M-1} h[\ell] r_y[k-\ell] = r_{dy}[k]$ , i.e. $\mathbf{R}\mathbf{h}=\mathbf{r}$ with $\mathbf{R}_{k\ell}=r_y[k-\ell]$ Hermitian Toeplitz.

Singularity

$\mathbf{R}$ is singular iff $y[n]$ lies in a proper subspace (e.g. a deterministic sum of sinusoids with fewer than $M$ components). Then the estimator is underdetermined on the null space.

Regularization

Use diagonal loading $\mathbf{R}+\delta\mathbf{I}$ (ridge), or drop toward a shorter filter. Diagonal loading is the MAP solution under a Gaussian prior on $\mathbf{h}$ . $\blacksquare$

Exercises

ex-ch09-01

Write the error and impose orthogonality

Collapse using WSS

Remark

ex-ch09-02

Cross and auto spectra

Filter and MMSE

Operational reading

ex-ch09-03

PSD of the AR(1) signal

Observation PSD

Non-causal Wiener filter

Operational reading

ex-ch09-04

Time-domain MMSE

Parseval

Interpretation

ex-ch09-05

The condition

Why it is needed

Practical consequence

ex-ch09-06

Rewrite in $z$

Factor in terms of unit-circle roots

Extract minimum-phase factor

ex-ch09-07

Whiten

Why it helps

Remark

ex-ch09-08

Combine and factor

Causal filter

Comparison

ex-ch09-09

Formula

Geometric-mean interpretation

Consequence

ex-ch09-10

Set up the system

Solve

Prediction-error variance

ex-ch09-11

MMSE gap

Strict positivity

Operational reading

ex-ch09-12

Setup

Transfer function

Prediction error

ex-ch09-13

Predictor

MMSE

ex-ch09-14

In-band

Out-of-band

Transition

ex-ch09-15

Bound on $|H(f)|$

Power inequality

Interpretation

ex-ch09-16

Deterministic counterpart

Step-size condition

ex-ch09-17

Reduce to causal

Smoother formula

Limit $L\to\infty$

Interpretation

ex-ch09-18

Finite truncation

Singularity

Regularization