Ferkans — Interactive Telecom Tutor

Beyond Gaussian Measurements

AMP, OAMP, and VAMP all assume Gaussian measurement noise: $\ntn{obs} = \mathbf{A}\mathbf{x} + \mathbf{w}$ with $\mathbf{w} \sim \mathcal{CN}(\mathbf{0},\sigma^2\mathbf{I})$ . But many measurements in the real world are manifestly non-Gaussian: Poisson photon counts in astronomy and medical imaging, 1-bit or few-bit quantized samples from low-power ADCs, log-linear counts in spectroscopy, and logistic outcomes in classification.

GAMP (Generalized AMP) extends the AMP framework to generalized linear models (GLMs), where each measurement depends on $z_i = \langle \mathbf{a}_i,\mathbf{x}\rangle$ through an arbitrary per-element likelihood $p(y_i|z_i)$ . The algorithm retains AMP's two-scalar state-evolution structure but adds a new component — an output channel denoiser — that handles the non-Gaussian likelihood.

The beauty of GAMP is that it treats the nonlinearity at the measurement end and the prior at the signal end symmetrically: both are handled by local MMSE steps, one in signal space, one in measurement space.

Definition:
Generalized Linear Model (GLM)

A generalized linear model is a Markov chain $\mathbf{x} \to \mathbf{z} \to \mathbf{y}$ where $\mathbf{z} = \mathbf{A}\mathbf{x}$ is the linear image of the signal, and the measurement is produced element-wise through a fixed conditional distribution

$p(\mathbf{y}|\mathbf{z}) = \prod_{i=1}^{M} p(y_i|z_i).$

The likelihood family $p(\cdot|\cdot)$ characterizes the GLM:

$p(y|z) = \mathcal{N}(y;z,\sigma^2)$ → standard linear regression;
$p(y|z) = \mathrm{Poisson}(\lambda = e^z)$ → Poisson regression;
$p(y|z) = \mathrm{Bernoulli}(\mathrm{sigmoid}(z))$ → logistic regression;
$p(y|z) = \mathbb{1}\{y = \mathrm{sign}(z+w)\}$ → 1-bit compressed sensing.

The GLM abstraction is a modelling unifier: once GAMP knows how to handle any such per-element likelihood, a vast range of inverse problems become instances of a single algorithm with a user-supplied output channel denoiser.

Definition:
GAMP Iteration

With $\mathbf{A}$ having i.i.d. zero-mean entries with variance $1/M$ , GAMP iterates

$\begin{aligned} \text{output:} \quad & \hat{z}_i^{(t)} = [\mathbf{A}\hat{\mathbf{x}}^{(t)}]_i - \tau_p^{(t)} \hat{s}_i^{(t-1)}, \\ & \hat{s}_i^{(t)} = g_{\text{out}}(y_i, \hat{z}_i^{(t)}, \tau_p^{(t)}), \quad \tau_s^{(t)} = -\partial_z g_{\text{out}}\text{ avg}, \\ \text{input:} \quad & \hat{r}_j^{(t)} = \hat{x}_j^{(t)} + \tau_r^{(t)} [\mathbf{A}^{\mathsf{H}}\hat{\mathbf{s}}^{(t)}]_j, \\ & \hat{x}_j^{(t+1)} = g_{\text{in}}(\hat{r}_j^{(t)}, \tau_r^{(t)}), \quad \tau_r^{(t+1)} \text{ from } \tau_s^{(t)}. \end{aligned}$

The output channel denoiser $g_{\text{out}}(y,\hat{z},\tau_p)$ is the MMSE estimate of the score of the likelihood,

$g_{\text{out}}(y,\hat{z},\tau_p) = \frac{1}{\tau_p}\left(\mathbb{E}[z|y,\hat{z},\tau_p] - \hat{z}\right),$

where the expectation is over $z \sim \mathcal{N}(\hat{z},\tau_p)$ weighted by $p(y|z)$ . The input denoiser $g_{\text{in}}$ is the familiar signal-prior MMSE estimator.

Setting $p(y|z) = \mathcal{N}(y;z,\sigma^2)$ recovers standard AMP with $g_{\text{out}}(y,\hat{z},\tau_p) = (y - \hat{z})/(\sigma^2 + \tau_p)$ . Every other likelihood simply changes this scalar Bayesian update.

Theorem: GAMP State Evolution

For i.i.d. sub-Gaussian sensing matrices $\mathbf{A}$ and any GLM likelihood with bounded output MMSE, GAMP's iterates admit, in the large-system limit, a scalar state evolution

$\tau_r^{(t+1)} = \mathcal{F}_{\text{in}}(\tau_p^{(t)}; p_X), \qquad \tau_p^{(t+1)} = \mathcal{F}_{\text{out}}(\tau_r^{(t+1)}; p(y|z), p_Z),$

where $\mathcal{F}_{\text{in}}$ is the denoiser MSE under effective noise $\tau_r$ , and $\mathcal{F}_{\text{out}}$ is the Fisher information of the scalar channel $z \sim \mathcal{N}(\hat{z},\tau_r)$ , $y \sim p(y|z)$ .

Once again the whole algorithm collapses to two scalar numbers at each iteration — the effective noise variance on the signal side ( $\tau_r$ ) and on the measurement side ( $\tau_p$ ). The presence of two variances (rather than one) reflects that GLM inference is a two-stage problem: decode the linear image from the measurements, then decode the signal from the linear image.

Proof

Conditioning and Gaussianity on each side

Treat $\hat{\mathbf{z}}$ and $\mathbf{r}$ as AMP-style pseudo-observations in their respective spaces. The i.i.d.-Gaussian structure of $\mathbf{A}$ ensures that, after the Onsager correction, each pseudo-observation is $\mathbf{z} + \mathcal{N}(\mathbf{0},\tau_p\mathbf{I})$ and $\mathbf{x} + \mathcal{N}(\mathbf{0},\tau_r\mathbf{I})$ .

Close the measurement side

Given the scalar channel $z\to y$ , the MMSE of estimating $z$ from $(y,\hat{z})$ produces the $\mathcal{F}_{\text{out}}$ transfer function, which is inversely related to the Fisher information of the channel at the current operating point.

Close the signal side

This is the standard AMP argument: the input denoiser's MSE under Gaussian noise $\tau_r$ gives $\mathcal{F}_{\text{in}}$ . Combining the two closures gives the coupled recursion.

,

Example: GAMP for 1-Bit Compressed Sensing

Consider 1-bit compressed sensing: the true signal $\mathbf{x}$ is sparse, measurements are taken as $z_i = \langle\mathbf{a}_i,\mathbf{x}\rangle + w_i$ with $w_i \sim \mathcal{N}(0,\sigma_w^2)$ , and observed as $y_i = \mathrm{sign}(z_i) \in \{-1,+1\}$ . Derive the output channel denoiser $g_{\text{out}}$ .

Solution

Write the likelihood

The 1-bit + Gaussian-noise channel has $p(y_i=+1|z_i) = \Phi(z_i/\sigma_w)$ and $p(y_i=-1|z_i) = \Phi(-z_i/\sigma_w)$ , where $\Phi$ is the standard Gaussian CDF.

Posterior of $z$ given $(y,\hat{z})$

Combining $z \sim \mathcal{N}(\hat{z},\tau_p)$ with $y = \mathrm{sign}(z + w)$ gives a truncated-Gaussian posterior on $z$ . Its mean is

$\mathbb{E}[z|y,\hat{z},\tau_p] = \hat{z} + y\cdot\frac{\tau_p \phi(\hat{z}/\sqrt{\tau_p+\sigma_w^2})}{\sqrt{\tau_p+\sigma_w^2}\cdot\Phi(y\hat{z}/\sqrt{\tau_p+\sigma_w^2})},$

where $\phi$ is the Gaussian PDF.

Output denoiser

Applying the GAMP formula $g_{\text{out}} = (\mathbb{E}[z|\cdot] - \hat{z})/\tau_p$ gives the closed-form 1-bit output denoiser. It is bounded (unlike the linear-noise denoiser) — a reflection of the fact that each 1-bit measurement carries at most 1 bit of information.

Definition:
EM-GAMP for Automatic Hyperparameter Tuning

EM-GAMP interleaves GAMP iterations with EM updates of unknown hyperparameters $\boldsymbol{\theta} = (\rho, \sigma_x^2, \sigma^2)$ :

$\boldsymbol{\theta}^{(t+1)} = \arg\max_{\boldsymbol{\theta}} \, \mathbb{E}_{p(\mathbf{x}|\ntn{obs};\boldsymbol{\theta}^{(t)})}[\log p(\mathbf{x},\ntn{obs};\boldsymbol{\theta})],$

where the expectation over the posterior is approximated by the marginals produced at the end of the last GAMP run. The E-step is free — GAMP gives the marginal moments — and the M-step reduces to closed-form updates for common parametric priors.

In practice one runs $K$ GAMP iterations, updates $\boldsymbol{\theta}$ once, and repeats. Convergence is typically reached in 5-20 outer iterations. EM-GAMP is a workhorse when priors are unknown but parametric.

1-Bit Compressed Sensing with GAMP

Recover a sparse signal from 1-bit quantized measurements as a function of the oversampling ratio $M/N$ . Compare GAMP against the AMP baseline that treats 1-bit measurements as real-valued (a model mismatch).

Parameters

Signal dimension

N

400

Sparsity

\rho

0.1

Pre-quantization SNR (dB)20

Output Denoisers for Common GLMs

Visualize the output channel denoiser $g_{\text{out}}(y,\hat{z},\tau_p)$ as a function of $\hat{z}$ for three common likelihoods: Gaussian, Poisson, and 1-bit. Notice how the 1-bit denoiser saturates while the Gaussian one is linear.

Parameters

\tau_p

1

Effective noise variance on $z$

Observation

y

1

🚨Critical Engineering Note

GAMP Is More Fragile Than OAMP

Even on i.i.d. Gaussian sensing matrices, GAMP frequently diverges without damping. The two coupled transfer functions $(\mathcal{F}_{\text{in}}, \mathcal{F}_{\text{out}})$ can create slowly converging or oscillating loops, particularly for heavy-tailed priors and heavily nonlinear output channels (1-bit, Poisson with low counts).

Standard remedies:

Damping on both scalar states with factor $\beta \approx 0.5$ – $0.8$ .
Variance flooring: clamp $\tau_p, \tau_r$ away from zero.
"Mean-only" damping: damp the vector iterates $\hat{\mathbf{x}}, \hat{\mathbf{s}}$ but not the precisions.

For structured sensing matrices, the RRI equivalents of GAMP ("GVAMP") are more robust and should be preferred in production.

Common Mistake: Output Channel Sign Convention

Mistake:

Confusing the sign of $g_{\text{out}}$ with the sign of the residual $y-\hat{z}$ . For Gaussian channels they agree; for Poisson or 1-bit channels the scaling is different and the sign can differ relative to the assumed operating point.

Correction:

Derive $g_{\text{out}}$ directly from the definition $g_{\text{out}}(y,\hat{z},\tau_p) = (\mathbb{E}[z|y,\hat{z},\tau_p] - \hat{z})/\tau_p$ and verify on a simple test case (scalar Gaussian likelihood with known SNR) that the output matches the closed-form solution. Then trust the derivation in the generalized case.

Historical Note: Rangan and the Birth of GAMP

2011-2013

Sundeep Rangan's 2011 ISIT paper Generalized Approximate Message Passing for Estimation with Random Linear Mixing marked the transition of AMP from a specialized sparse-recovery algorithm to a general-purpose Bayesian inference engine. The key insight — that the Onsager-corrected residual on the measurement side could be reused for any generalized linear model — opened the door to a flood of applications in quantized sensing, phase retrieval, and machine learning.

The subsequent EM-GAMP variant (Vila & Schniter, 2013) made GAMP parameter-free, eliminating the need to specify the prior or noise level in advance. For a decade GAMP was the default engine for Bayesian compressed sensing in applications ranging from hyperspectral unmixing to quantitative MRI.

GAMP (Generalized AMP)

An extension of AMP to generalized linear models with per-element likelihoods. Adds an output-channel denoiser to handle non-Gaussian measurements (1-bit, Poisson, etc.) while preserving the scalar state-evolution structure of AMP.

Generalized linear model (GLM)

A statistical model where the measurement $y_i$ depends on a linear image $z_i = \langle\mathbf{a}_i,\mathbf{x}\rangle$ only through a fixed per-element likelihood $p(y_i|z_i)$ . Generalizes linear regression to Poisson, logistic, quantized, and many other observation models.

Related: GAMP (Generalized AMP)

EM-GAMP

A variant of GAMP that alternates message-passing iterations with EM updates of the prior and noise hyperparameters. Enables automatic tuning without prior specification of $\rho, \sigma_x^2$ , or $\sigma^2$ .

Related: GAMP (Generalized AMP)

Quick Check

In GAMP, which of the following determines the output denoiser $g_{\text{out}}$ ?

The signal prior $p_X(\mathbf{x})$ only.

The per-element measurement likelihood $p(y|z)$ and the current state $\tau_p$ .

The structure of the sensing matrix $\mathbf{A}$ .

The signal sparsity $\rho$ .

Correction:

The per-element measurement likelihood

p(y|z)

and the current state

\tau_p

.

Correct. $g_{\\text{out}}$ is the Bayesian score of the channel $z + \\mathcal{N}(0,\\tau_p) \\to y$ under the likelihood $p(y|z)$ .

Why This Matters: 1-Bit GAMP in Massive MIMO

Massive MIMO uplinks with 1-bit ADCs are a rare wireless setting where a non-Gaussian likelihood dominates the receiver design. Each antenna produces a binary sample per sampling instant, representing the sign of a complex-valued received signal plus noise. Channel estimation in this regime is exactly the 1-bit compressed-sensing problem, and GAMP with the 1-bit output denoiser is competitive with much more expensive algorithms.

The CommIT group's activity-detection pipeline (Ch 14) leverages this observation: the two-step GAMP structure — output denoiser for the quantized measurement, input denoiser for the sparse user activity — gives a principled low-complexity receiver that the group has extended to unsourced random access with large numbers of potential users.

See full treatment in Chapter 14، Section sec-activity-detection

GAMP — Generalized Approximate Message Passing

Beyond Gaussian Measurements

Definition: Generalized Linear Model (GLM)

Definition: GAMP Iteration

Theorem: GAMP State Evolution

Conditioning and Gaussianity on each side

Close the measurement side

Close the signal side

Example: GAMP for 1-Bit Compressed Sensing

Write the likelihood

Posterior of $z$ given $(y,\hat{z})$

Output denoiser

Definition: EM-GAMP for Automatic Hyperparameter Tuning

1-Bit Compressed Sensing with GAMP

Parameters

Output Denoisers for Common GLMs

Parameters

GAMP Is More Fragile Than OAMP

Common Mistake: Output Channel Sign Convention

Historical Note: Rangan and the Birth of GAMP

GAMP (Generalized AMP)

Generalized linear model (GLM)

EM-GAMP

Quick Check

Why This Matters: 1-Bit GAMP in Massive MIMO

Definition:
Generalized Linear Model (GLM)

Definition:
GAMP Iteration

Definition:
EM-GAMP for Automatic Hyperparameter Tuning