The Bussgang Decomposition

Turning a Nonlinear Receiver into a Linear One

The quantizer QbQ_b is nonlinear, so the observation yq=Qb(Hx+w)\mathbf{y}_q = Q_b(\mathbf{H}\mathbf{x} + \mathbf{w}) has a non-Gaussian conditional distribution given x\mathbf{x}. Shannon capacity for a nonlinear channel has no closed form, and direct analysis of yq\mathbf{y}_q forces us through a jungle of multinomial distributions. The Bussgang decomposition sidesteps the jungle by writing the quantizer output as a linear function of the input plus an uncorrelated distortion:

yq=By+d,E[dyH]=0.\mathbf{y}_q = \mathbf{B}\mathbf{y} + \mathbf{d}, \qquad \mathbb{E}[\mathbf{d}\mathbf{y}^H] = \mathbf{0}.

The matrix B\mathbf{B} absorbs the "correlated" part of the nonlinearity; d\mathbf{d} contains the rest. Downstream, we treat d\mathbf{d} like additive noise — it is not Gaussian and not independent of y\mathbf{y} in general, but its zero cross-correlation with y\mathbf{y} is enough to compute linear-receiver SINRs. The idea is due to Bussgang (1952) for scalar Gaussian inputs through memoryless nonlinearities and was introduced to MIMO quantization analysis by Mezghani and Nossek (2012). It is now the standard tool for 1-bit and low-resolution massive MIMO.

,

Theorem: Bussgang's Theorem (Scalar Gaussian Input)

Let YN(0,σY2)Y \sim \mathcal{N}(0, \sigma_Y^2) and let g:RRg: \mathbb{R} \to \mathbb{R} be a memoryless (possibly nonlinear) function with E[g(Y)2]<\mathbb{E}[g(Y)^2] < \infty. Write Yq=g(Y)Y_q = g(Y). Then there exists a scalar BgRB_g \in \mathbb{R} such that Yq=BgY+dY_q = B_g\, Y + d with E[dY]=0\mathbb{E}[d\, Y] = 0. Explicitly,

Bg=E[Yg(Y)]σY2=E ⁣[g(Y)](Stein’s identity).B_g = \frac{\mathbb{E}[Y\, g(Y)]}{\sigma_Y^2} = \mathbb{E}\!\left[g'(Y)\right] \quad \text{(Stein's identity)}.

For the 1-bit quantizer g(y)=sign(y)g(y) = \operatorname{sign}(y) this gives B1=2πσY2.B_1 = \sqrt{\dfrac{2}{\pi\,\sigma_Y^2}}.

We are projecting YqY_q onto the subspace of random variables linear in YY. By the orthogonality principle, the residual is uncorrelated with YY. Stein's identity expresses this projection as an expected derivative of the nonlinearity — for a hard limiter the derivative is a delta at zero and the resulting integral is 2/(πσY2)\sqrt{2/(\pi\sigma_Y^2)}.

,

Definition:

Bussgang Matrix and Distortion Covariance (MIMO)

Let yCNr\mathbf{y} \in \mathbb{C}^{N_r} be zero-mean complex Gaussian with covariance Σy=E[yyH]\boldsymbol{\Sigma}_{y} = \mathbb{E}[\mathbf{y} \mathbf{y}^H] and yq=Qb(y)\mathbf{y}_q = Q_b(\mathbf{y}) the element-wise quantized vector. The Bussgang matrix is

BE[yqyH]Σy1.\mathbf{B} \triangleq \mathbb{E}[\mathbf{y}_q \mathbf{y}^H]\, \boldsymbol{\Sigma}_{y}^{-1}.

By construction the residual d=yqBy\mathbf{d} = \mathbf{y}_q - \mathbf{B} \mathbf{y} is uncorrelated with y\mathbf{y}, i.e. E[dyH]=0\mathbb{E}[\mathbf{d}\mathbf{y}^H] = \mathbf{0}. Its covariance is Σd=ΣyqBΣyBH\boldsymbol{\Sigma}_{d} = \boldsymbol{\Sigma}_{y_q} - \mathbf{B}\,\boldsymbol{\Sigma}_{y} \mathbf{B}^H. For a 1-bit quantizer applied element-wise, the diagonal entries of B\mathbf{B} are

Bnn=2π1[Σy]nn,B_{nn} = \sqrt{\tfrac{2}{\pi}}\, \frac{1}{\sqrt{[\boldsymbol{\Sigma}_{y}]_{nn}}},

and the quantized covariance follows the arcsine law

[Σyq]mn=2πarcsin ⁣([Σy]mn[Σy]mm[Σy]nn).[\boldsymbol{\Sigma}_{y_q}]_{mn} = \tfrac{2}{\pi}\arcsin\!\left( \tfrac{[\boldsymbol{\Sigma}_{y}]_{mn}} {\sqrt{[\boldsymbol{\Sigma}_{y}]_{mm}\,[\boldsymbol{\Sigma}_{y}]_{nn}}}\right).

The Bussgang matrix for the 1-bit quantizer depends only on the diagonal of Σy\boldsymbol{\Sigma}_{y}, so it is diagonal. The cross-antenna correlation is pushed into Σd\boldsymbol{\Sigma}_{d}, where the arcsine nonlinearity curbs the off-diagonals — a quantitative statement of "quantization de-correlates" used throughout the chapter.

,

Definition:

Bussgang Distortion Factor

For a uniform scalar quantizer with bb bits, optimized for a Gaussian input, define the Bussgang distortion factor

ρb1E[YQb(Y)]2σY2E[Qb(Y)2].\rho_b \triangleq 1 - \frac{\mathbb{E}[Y\,Q_b(Y)]^2} {\sigma_Y^2\,\mathbb{E}[Q_b(Y)^2]}.

For small resolution bb a table (Max 1960, Lloyd-Max optimized quantizers, Gaussian source) gives the following values, which are ubiquitous in 1-bit-MIMO papers:

bb ρb\rho_b κb=1ρb\kappa_b = 1-\rho_b (effective SNR scaling)
1 12/π0.36341 - 2/\pi \approx 0.3634 0.63660.6366
2 0.11750.1175 0.88250.8825
3 0.034540.03454 0.96550.9655
4 0.0094970.009497 0.99050.9905
5 0.0024990.002499 0.99750.9975

Every additional bit reduces ρb\rho_b by a factor of 4\approx 4, matching the usual "6 dB/bit" SQNR heuristic.

The factor κb=1ρb\kappa_b = 1 - \rho_b is the effective SNR attenuation of the quantizer: a signal with pre-quantization SNR SNR\text{SNR} emerges with κbSNR/(1κbSNR/(1+SNR))\kappa_b\,\text{SNR}/(1 - \kappa_b\,\text{SNR}/(1+\text{SNR})) post-quantization. At low SNR this simplifies to κbSNR\kappa_b\,\text{SNR}, so quantization scales the SNR by a fixed factor.

,

Theorem: Bussgang SINR for Linear Combining

Consider the KK-user massive-MIMO uplink with bb-bit ADCs on every antenna, perfect CSI, and Gaussian inputs. Apply a linear combiner vk\mathbf{v}_k to the quantized observation. Using the Bussgang decomposition, the achievable rate of user kk is

Rklog2 ⁣(1+κbvkHHk2PkκbjkvkHHj2Pj+vkH(κbσ2I+Σd)vk),R_k \,\geq\, \log_2\!\left(1 + \frac{\kappa_b\,|\mathbf{v}_k^H \mathbf{H}_{k}|^2\,P_k} {\kappa_b \sum_{j\neq k} |\mathbf{v}_k^H \mathbf{H}_{j}|^2 P_j + \mathbf{v}_k^H(\kappa_b \sigma^2\mathbf{I} + \boldsymbol{\Sigma}_{d}) \mathbf{v}_k}\right),

where κb=1ρb\kappa_b = 1 - \rho_b is the effective-gain factor from Definition DBussgang Distortion Factor and Σd\boldsymbol{\Sigma}_{d} is the Bussgang distortion covariance.

The linear-combining analysis we already know from Chapter 9 carries over verbatim, with two modifications: (i) desired and interfering powers shrink by κb\kappa_b, and (ii) an extra distortion term vkHΣdvk\mathbf{v}_k^H \boldsymbol{\Sigma}_{d} \mathbf{v}_k joins the thermal noise. In the massive-MIMO limit NrN_r \to \infty with i.i.d. users the distortion is diagonally dominant, so MRC and MMSE designs look almost identical to the infinite-precision case but with a constant penalty κb\kappa_b in the SINR denominator.

,

Effective SNR After bb-Bit Quantization

Sweep the per-antenna SNR and plot the post-Bussgang effective SNR SNReff(b)=κbSNR/(1+ρbSNR)\text{SNR}_{\text{eff}}(b) = \kappa_b\,\text{SNR}/(1 + \rho_b\,\text{SNR}) for b{1,2,3,4,}b \in \{1, 2, 3, 4, \infty\}. Observe the asymptotic SNR ceiling for small bb at high SNR — 1 bit caps around 4\sim 4 dB, 4 bits essentially match infinite precision over any reasonable range.

Parameters
-15
30

Example: Bussgang SINR in a Two-User Massive Uplink

A 1-bit massive-MIMO uplink has Nr=128N_r = 128 antennas and K=2K = 2 users. Both channels are i.i.d. Rayleigh with Hk2Nr\|\mathbf{H}_{k}\|^2 \approx N_r and both users transmit at per-antenna SNR SNR=5\text{SNR} = -5 dB (0.316\approx 0.316). The base station uses MRC. Estimate the SINR and rate of user 1.

Bussgang-LMMSE Detector for a 1-Bit Uplink

Complexity: O(Nr3+KNr2)\mathcal{O}(N_r^{3} + KN_r^{2}) per coherence block
Input: quantized observation y_q, channel H, powers P_k,
noise variance sigma^2, distortion factor rho_1 = 1 - 2/pi
Output: estimates x_hat for all users k = 1..K
// Step 1 — Covariance of the unquantized observation
Sigma_y <- H diag(P) H^H + sigma^2 I
// Step 2 — Bussgang diagonal matrix (1-bit)
for n = 1 .. N_r:
B[n,n] <- sqrt(2 / pi / Sigma_y[n,n])
// Step 3 — Quantized covariance via arcsine law
R_y <- diag(1/sqrt(diag(Sigma_y))) Sigma_y diag(1/sqrt(diag(Sigma_y)))
Sigma_yq <- (2/pi) * arcsin(R_y) // element-wise
// Step 4 — Distortion covariance
Sigma_d <- Sigma_yq - B * Sigma_y * B^H
// Step 5 — Bussgang-LMMSE combining
V <- (B H)^H ( B Sigma_y B^H + Sigma_d )^(-1)
// Step 6 — Decode
x_hat <- V * y_q
return x_hat

Steps 1–4 can be precomputed once per coherence block, amortizing their cost over hundreds of data symbols. The element-wise arcsine in Step 3 is the Van Vleck identity; for real-valued RyR_y it is a scalar operation on each off-diagonal entry.

,

Common Mistake: Distortion Is Uncorrelated — Not Independent

Mistake:

Because the Bussgang residual d\mathbf{d} is orthogonal to the input by construction, it is tempting to treat it as statistically independent Gaussian noise and plug it straight into mutual-information formulas.

Correction:

Orthogonality (zero cross-correlation) is not independence. d\mathbf{d} is a deterministic nonlinear function of y\mathbf{y}, so d\mathbf{d} and y\mathbf{y} share information through higher-order moments. What the Bussgang decomposition gives is (a) the linear part of the quantizer's input-output relationship exactly, and (b) a lower bound on the mutual information obtained by treating the distortion as the worst-case (Gaussian) noise with the correct covariance. The bound is tight at low SNR and becomes increasingly loose at high SNR — another reminder that 1-bit massive MIMO is a low-to-moderate SNR technology, not a high-SNR one.

Why This Matters: Bussgang Beyond Quantization

The Bussgang decomposition applies to any memoryless nonlinearity with finite second moment, not just ADC quantizers. It is the standard tool for analyzing nonlinear power amplifiers at the transmit side (Book CM Ch. 8), phase-noise distortion (Book FSP Ch. 9), and even nonlinear digital-predistortion residuals in modern base-station radios. The same linearization yout=Byin+d\mathbf{y}_{\text{out}} = \mathbf{B} \mathbf{y}_{\text{in}} + \mathbf{d} will reappear in Chapter 20 when we study phase-shifter quantization in hybrid beamforming — mirror image of the 1-bit ADC on the transmit side.

Key Takeaway

Bussgang linearizes any memoryless nonlinearity at the cost of a distortion term that is only uncorrelated with the input. For a bb-bit quantizer the result is a rate expression that looks like the infinite-precision one with two edits: desired and interfering powers scale by κb=1ρb\kappa_b = 1 - \rho_b, and an extra diagonal-dominant distortion covariance Σd\boldsymbol{\Sigma}_{d} is added to the noise. For 1-bit, κ1=2/π0.637\kappa_1 = 2/\pi \approx 0.637, recovering the 1.961.96 dB low-SNR loss of the previous section.

Bussgang decomposition

For any zero-mean Gaussian input y\mathbf{y} passed through a memoryless nonlinearity QQ, the identity Q(y)=By+dQ(\mathbf{y}) = \mathbf{B}\mathbf{y} + \mathbf{d} with E[dyH]=0\mathbb{E}[\mathbf{d}\mathbf{y}^H] = 0, where B\mathbf{B} is the Bussgang matrix and d\mathbf{d} the residual distortion. The workhorse for analyzing 1-bit and low-resolution MIMO receivers.

Related: 1-bit ADC, Mixed-ADC Massive MIMO Receiver, Arcsine Law for 1-Bit Correlation

Quick Check

For a 1-bit uniform quantizer applied to a zero-mean Gaussian input, which value of the Bussgang distortion factor ρ1\rho_1 (i.e., the residual power fraction) is correct?

ρ1=0\rho_1 = 0

ρ1=12/π0.363\rho_1 = 1 - 2/\pi \approx 0.363

ρ1=1/2\rho_1 = 1/2

ρ1=11/2π\rho_1 = 1 - 1/\sqrt{2\pi}