Ferkans — Interactive Telecom Tutor

What State Evolution Delivers

The payoff for introducing the Onsager term is that AMP admits a one-dimensional deterministic description of its per-iteration performance in the large-system limit. This description — state evolution (SE) — is a scalar recursion $\tau_{t+1}^2 = \Psi(\tau_t^2)$ whose fixed points predict the terminal MSE, the convergence speed, and the sharp phase-transition boundary between recoverable and unrecoverable signals.

The analytical power is considerable: we can predict AMP's MSE without running AMP, plot phase diagrams by sweeping $(\delta,\rho)$ , and design denoisers by minimising a scalar function.

Theorem: State Evolution (Bayati--Montanari)

Consider AMP with denoiser $\eta$ on the model of Definition DLinear Observation Model, with $N, M \to \infty$ at fixed $\delta = M/N$ . Assume $\eta(\cdot;\theta)$ is Lipschitz and the prior $p_X$ has finite second moment. Then, for every iteration $t \ge 0$ :

(a) Gaussianity of pseudo-data. In the sense of empirical distribution convergence, $\mathbf{A}^{H}\mathbf{r}^t + \hat{\mathbf{x}}^t \;\stackrel{d}{=}\; \mathbf{x} + \tau_t \mathbf{Z}, \qquad \mathbf{Z} \sim \mathcal{N}(\mathbf{0},\mathbf{I}_N), \quad \mathbf{Z} \perp \mathbf{x}.$

(b) Scalar recursion. The variance $\tau_t^2$ evolves as $\boxed{\tau_{t+1}^2 \;=\; \sigma^2 \;+\; \frac{1}{\delta}\,\mathbb{E}\!\left[\bigl(\eta(X+\tau_t Z;\theta_t) - X\bigr)^2\right],}$ with $\tau_0^2 = \sigma^2 + \frac{1}{\delta}\mathbb{E}[X^2]$ , $X \sim p_X$ , $Z \sim \mathcal{N}(0,1)$ independent.

Part (a) says that AMP behaves as if at every iteration we were denoising the signal $\mathbf{x}$ observed in an AWGN channel of variance $\tau_t^2$ . Part (b) is Pythagoras: the effective noise variance at the next iteration equals the physical noise $\sigma^2$ plus a rescaling ( $1/\delta$ ) of the scalar MSE achieved by the denoiser at the previous iteration.

Proof

Conditioning on the iterates (cavity argument)

Denote by $\mathbb{F}_t$ the $\sigma$ -algebra generated by $\{\hat{\mathbf{x}}^s, \mathbf{r}^s\}_{s \le t}$ . By the orthogonality of Gaussian projections, conditioning $\mathbf{A}$ on $\mathbb{F}_t$ leaves the complementary random projection Gaussian. This is the same cavity-method argument used in the TAP analysis of the SK model.

Onsager cancels the bias

Writing $\mathbf{A}^{H}\mathbf{r}^t$ as a sum of independent terms conditioned on $\mathbb{F}_t$ yields a linear combination of Gaussian increments plus a systematic bias $\delta^{-1}\langle\eta'(u_{t-1})\rangle\,\mathbf{r}^{t-1}$ . The Onsager term in the AMP residual is constructed to cancel this bias exactly, leaving a pure Gaussian.

Variance bookkeeping

The residual $\mathbf{r}^t$ has entries with variance $\tau_t^2 \delta$ (the factor $\delta$ comes from $\mathbb{E}\|\mathbf{r}^t\|^2/M$ ). Hence $\mathbf{A}^{H}\mathbf{r}^t + \hat{\mathbf{x}}^t - \mathbf{x}$ is a Gaussian of per-entry variance $\tau_t^2$ . The denoiser is then applied, producing error $\mathbb{E}[(\eta(X+\tau_t Z;\theta_t)-X)^2]$ per coordinate. Plugging this back through the residual update gives the claimed recursion.

Passage to the limit

Bayati--Montanari (2011) rigorously show that, for Lipschitz $\eta$ , the empirical distribution of $(\hat{\mathbf{x}}^t, \mathbf{x}, \mathbf{A}^{H}\mathbf{r}^t)$ converges almost surely to the joint distribution $(\eta(X+\tau_t Z;\theta_t), X, X+\tau_t Z)$ in Wasserstein-2 sense. This upgrades the heuristic cavity argument into a theorem.

,

Definition:
State-Evolution Fixed Point

A fixed point of state evolution is a value $\tau^2_\star \ge 0$ with $\tau^2_\star = \sigma^2 + \frac{1}{\delta}\,\mathbb{E}\!\left[(\eta(X+\tau_\star Z;\theta_\star)-X)^2\right].$ Define $\Psi(\tau^2) = \sigma^2 + \delta^{-1}\mathbb{E}[(\eta(X+\tau Z;\theta(\tau))-X)^2]$ . A fixed point is stable if $\Psi'(\tau^2_\star) < 1$ and unstable otherwise.

When $\Psi$ has multiple fixed points, AMP generically converges to the stable fixed point encountered first by the iteration — typically the one with the largest $\tau^2$ . This is why AMP can miss the global optimum even when state evolution admits a smaller fixed point.

Theorem: AMP MSE Equals State-Evolution Fixed Point

Under the hypotheses of Theorem TState Evolution (Bayati--Montanari), let $\tau^2_\star = \lim_{t\to\infty} \tau_t^2$ be the limit of the SE recursion. Then $\lim_{N\to\infty}\frac{1}{N}\|\hat{\mathbf{x}}^\infty - \mathbf{x}\|^2 \;=\; \mathbb{E}\!\left[(\eta(X+\tau_\star Z;\theta_\star)-X)^2\right] \;=\; \delta\,(\tau_\star^2 - \sigma^2),$ almost surely.

The terminal MSE of AMP is determined entirely by the one scalar $\tau^2_\star$ — a remarkable dimensionality reduction from an $N$ -dimensional distribution to a single number.

Proof

Convergence of the recursion

$\Psi$ is monotone (a mixture of MSE curves, each monotone in input noise) and bounded, so $\tau_t^2$ is monotone in $t$ and converges to $\tau^2_\star$ . Since $\Psi$ is continuous, the limit is a fixed point.

Passing limits inside

By Lipschitz continuity of $\eta$ , the expectation $\mathbb{E}[(\eta(X+\tau Z;\theta)-X)^2]$ is continuous in $\tau$ . Combined with the empirical-distribution convergence from Theorem TState Evolution (Bayati--Montanari), we obtain the stated equality almost surely.

Connection to the Replica Prediction

If $\eta$ is the MMSE denoiser for the prior $p_X$ matched to effective noise $\tau^2$ , the state-evolution fixed point coincides with the replica-symmetric prediction of the Bayes-optimal MMSE, which is conjectured (and in many cases proved, e.g.\ Reeves--Pfister 2016) to be the fundamental information-theoretic limit of the Bayesian CS problem.

In other words: when AMP converges, AMP matched to the prior is asymptotically Bayes-optimal. This elevates AMP from a heuristic iterative solver to a provably optimal estimator in the proportional asymptotic regime.

Example: State Evolution, Noiseless LASSO, and the Donoho--Tanner Curve

Take $\sigma^2 = 0$ and a sparse signal with i.i.d. entries from $(1-\rho)\delta_0 + \rho p_{\neq 0}$ . Using AMP with soft-thresholding tuned optimally, derive the state-evolution fixed-point equation and identify the boundary between successful and failed recovery.

Solution

Scalar MMSE of the soft-threshold

Let $\mathrm{mse}_{\mathrm{st}}(\tau^2;\lambda) = \mathbb{E}[(\eta_{\mathrm{st}}(X+\tau Z;\lambda\tau)-X)^2]$ be the scalar MSE of soft-thresholding on the Bernoulli--Gaussian signal with threshold $\lambda\tau$ . Direct computation (integrating the Gaussian density against the piecewise-linear soft-threshold) gives a closed-form expression depending on $\rho$ and $\lambda$ only.

Normalised SE recursion

With $\sigma^2=0$ the recursion becomes $\tau_{t+1}^2 = \frac{1}{\delta}\,\mathrm{mse}_{\mathrm{st}}(\tau_t^2;\lambda_t).$ Scaling: $\mathrm{mse}_{\mathrm{st}}(\tau^2;\lambda\tau) = \tau^2 M(\rho,\lambda)$ for a function $M$ independent of $\tau$ . The recursion is then $\tau_{t+1}^2 = \delta^{-1} M(\rho,\lambda)\,\tau_t^2$ , a geometric decrease iff $M(\rho,\lambda)/\delta < 1$ .

Phase transition

Optimising over $\lambda$ for each $(\delta,\rho)$ yields the Donoho--Tanner phase boundary $\rho_{\mathrm{DT}}(\delta) = \max_\lambda \frac{\delta - M(\rho,\lambda)}{\mathrm{const}},$ i.e., the curve of $(\delta,\rho)$ below which $M^\star(\rho)/\delta<1$ and AMP converges to $\tau=0$ (exact recovery) and above which the iteration has a nonzero stable fixed point $\tau>0$ (recovery fails). This curve matches exactly the Donoho--Tanner combinatorial geometry result on phase transitions of $\ell_1$ recovery.

State Evolution vs Empirical AMP

Compares the scalar state-evolution prediction $\tau_t^2$ (deterministic recursion) against the empirical per-iteration MSE from a single run of AMP at finite $N$ . As $N$ grows the two curves lock onto each other.

Parameters

N1000

\delta

0.5

\rho

0.15

SNR (dB)40

Iterations20

Phase Transition Diagram

State-evolution phase diagram in the $(\delta,\rho)$ plane for noiseless sparse recovery with a Bernoulli--Gaussian signal. Colour = predicted terminal MSE of AMP with optimally tuned soft-thresholding. The curve separates the "recovery" and "failure" regions — the Donoho--Tanner phase boundary.

Parameters

State-Evolution Cobweb Diagram

Animated cobweb plot of

\tau_{t+1}^2 = \Psi(\tau_t^2)

showing the iteration climbing (or descending) to its fixed point. For

(\delta,\rho)

below the DT curve the recursion descends to

\tau=0

; above it settles on a positive fixed point.

State-evolution trajectory: stable fixed points act as attractors for the AMP variance.

Common Mistake: State Evolution Is an Asymptotic Statement

Mistake:

Reading off the SE prediction for a specific finite- $N$ problem and being surprised when AMP's empirical MSE differs by 20% or more. Worse, tuning $\lambda$ assuming SE holds exactly at $N=100$ .

Correction:

State evolution predicts $\tau_t^2$ in the limit $N \to \infty$ . For $N \lesssim 500$ deviations of 5--15% are normal. For $N=2000$ - $5000$ the agreement is typically tight (<2%). Always sanity-check by running AMP at two values of $N$ and confirming the empirical trajectories are converging to the SE prediction as $N$ grows.

State Evolution as a Diagnostic Tool

Beyond its role as a performance predictor, state evolution is useful as a diagnostic. If AMP's empirical MSE diverges from the SE prediction, one of the assumptions has been violated — typically: (i) the matrix $\mathbf{A}$ is not i.i.d.\ (Section 20.4); (ii) $N$ is too small; (iii) the denoiser is not Lipschitz. Each produces a characteristic signature that an experienced practitioner learns to recognise.

Key Takeaway

State evolution reduces the analysis of AMP — a random, high-dimensional, nonlinear dynamical system — to a one-dimensional deterministic recursion $\tau^2_{t+1} = \sigma^2 + \delta^{-1}\mathbb{E}[(\eta(X+\tau_t Z)-X)^2]$ . This is the most striking analytical feature of AMP and the reason phase transitions and Bayes-optimality can be computed in closed form.

State Evolution

What State Evolution Delivers

Theorem: State Evolution (Bayati--Montanari)

Conditioning on the iterates (cavity argument)

Onsager cancels the bias

Variance bookkeeping

Passage to the limit

Definition: State-Evolution Fixed Point

Theorem: AMP MSE Equals State-Evolution Fixed Point

Convergence of the recursion

Passing limits inside

Connection to the Replica Prediction

Example: State Evolution, Noiseless LASSO, and the Donoho--Tanner Curve

Scalar MMSE of the soft-threshold

Normalised SE recursion

Phase transition

State Evolution vs Empirical AMP

Parameters

Phase Transition Diagram

Parameters

State-Evolution Cobweb Diagram

Common Mistake: State Evolution Is an Asymptotic Statement

State Evolution as a Diagnostic Tool

Key Takeaway

Definition:
State-Evolution Fixed Point