Broadcast Channel Capacity

The Broadcast Channel: One Transmitter, Many Receivers

The broadcast channel (BC) is the dual of the MAC: a single transmitter sends independent messages to KK receivers. This models the cellular downlink, Wi-Fi AP-to-STA transmission, and satellite broadcasting. Unlike the MAC, the BC capacity region was unknown for decades. The key breakthrough was superposition coding for the degraded BC (Cover, 1972) and dirty-paper coding (DPC) for the MIMO BC (Weingarten, Steinberg, and Shamai, 2006). The BC capacity region reveals a remarkable duality with the MAC: the MIMO BC capacity region can be computed via the MAC-BC duality, transforming a non-convex problem into a convex one.

Definition:

Degraded Broadcast Channel

A two-user broadcast channel is physically degraded if the channel outputs form a Markov chain:

X→Y1→Y2X \to Y_1 \to Y_2

meaning user 2's observation is a degraded version of user 1's.

Gaussian degraded BC: The transmitter sends XX with power constraint E[X2]≀PE[X^2] \leq P. User kk observes:

Yk=X+Zk,Zk∼N(0,Nk)Y_k = X + Z_k, \quad Z_k \sim \mathcal{N}(0, N_k)

with N1<N2N_1 < N_2 (user 1 has a better channel). This is a degraded BC because Y2=Y1+(Z2βˆ’Z1)Y_2 = Y_1 + (Z_2 - Z_1) can be written as a degraded version of Y1Y_1 when N1<N2N_1 < N_2.

The BC is stochastically degraded if the marginal distribution p(y2∣x)p(y_2 | x) can be expressed as βˆ‘y1p(y2∣y1)p(y1∣x)\sum_{y_1} p(y_2 | y_1) p(y_1 | x), even if the physical channel does not form a Markov chain.

The degraded BC model applies directly to the scalar Gaussian downlink where users experience different path losses. The MIMO BC is generally not degraded, requiring the more powerful DPC approach.

,

Theorem: Capacity Region of the Degraded Gaussian BC

The capacity region of the two-user degraded Gaussian BC (N1<N2N_1 < N_2, power PP) is the union over 0≀α≀10 \leq \alpha \leq 1 of:

R1≀12log⁑2 ⁣(1+(1βˆ’Ξ±)PN1+Ξ±P)=12log⁑2 ⁣(N1+PN1+Ξ±P)R_1 \leq \frac{1}{2}\log_2\!\left(1 + \frac{(1-\alpha)P}{N_1 + \alpha P}\right) = \frac{1}{2}\log_2\!\left(\frac{N_1 + P}{N_1 + \alpha P}\right)

R2≀12log⁑2 ⁣(1+Ξ±PN2)R_2 \leq \frac{1}{2}\log_2\!\left(1 + \frac{\alpha P}{N_2}\right)

where Ξ±P\alpha P is the power allocated to user 2's message and (1βˆ’Ξ±)P(1-\alpha)P to user 1's message in the superposition code.

The boundary is a concave curve parameterised by α∈[0,1]\alpha \in [0,1]:

  • Ξ±=0\alpha = 0: all power to user 1, R1=C1R_1 = C_1, R2=0R_2 = 0.
  • Ξ±=1\alpha = 1: all power to user 2, R1=0R_1 = 0, R2=12log⁑2(1+P/N2)R_2 = \frac{1}{2}\log_2(1 + P/N_2).

Superposition coding encodes two messages at different "layers." User 2 (weaker) decodes only its own message, treating user 1's signal as noise. User 1 (stronger) first decodes user 2's message (which it can do because it has a better channel), subtracts it, and then decodes its own message interference-free. The power split Ξ±\alpha controls the rate trade-off.

,

Definition:

Dirty-Paper Coding (DPC)

Dirty-paper coding (Costa, 1983) addresses the channel:

Y=X+S+ZY = X + S + Z

where SS is interference known non-causally at the transmitter (but not at the receiver), XX has power constraint PP, and Z∼N(0,N)Z \sim \mathcal{N}(0, N).

Costa's theorem: The capacity is:

C=12log⁑2 ⁣(1+PN)C = \frac{1}{2}\log_2\!\left(1 + \frac{P}{N}\right)

β€” the same as if SS were absent. The transmitter can completely pre-cancel the known interference without additional power, by encoding against it using a structured lattice or random binning code.

The optimal auxiliary random variable is U=X+Ξ±SU = X + \alpha S where Ξ±=P/(P+N)\alpha = P/(P + N), and the encoding is based on random binning in the joint typicality framework.

DPC is named after the analogy: writing on dirty paper (known stains SS) does not reduce the amount of information that can be conveyed, as long as the writer knows the stain pattern. DPC is the information-theoretic foundation of precoding at the base station in MIMO downlink (Chapter 17).

Theorem: MAC-BC Duality and MIMO BC Capacity

Consider the MIMO BC with ntn_t transmit antennas, KK single-antenna users, and channels hkH\mathbf{h}_k^H (row vectors). The capacity region of the MIMO BC under sum power constraint PP is:

CBC=⋃K1,…,KKβͺ°0tr(βˆ‘kKk)≀P{(R1,…,RK):Rk≀log⁑2 ⁣(∣I+βˆ‘j≀khkHKjhk∣∣I+βˆ‘j<khkHKjhk∣)}\mathcal{C}_{\mathrm{BC}} = \bigcup_{\substack{\mathbf{K}_1, \ldots, \mathbf{K}_K \succeq 0 \\ \mathrm{tr}(\sum_k \mathbf{K}_k) \leq P}} \left\{ (R_1, \ldots, R_K) : R_k \leq \log_2\!\left(\frac{|\mathbf{I} + \sum_{j \leq k} \mathbf{h}_k^H \mathbf{K}_j \mathbf{h}_k|}{|\mathbf{I} + \sum_{j < k} \mathbf{h}_k^H \mathbf{K}_j \mathbf{h}_k|}\right) \right\}

achieved by DPC with encoding order K,Kβˆ’1,…,1K, K-1, \ldots, 1.

MAC-BC duality (Vishwanath, Jindal, Goldsmith 2003): The capacity region of the MIMO BC with sum power PP equals the capacity region of the dual MIMO MAC (same channels, reversed direction) with sum power PP:

CBC(P)=⋃p1,…,pKβ‰₯0βˆ‘kpk≀PCMAC(p1,…,pK)\mathcal{C}_{\mathrm{BC}}(P) = \bigcup_{\substack{p_1, \ldots, p_K \geq 0 \\ \sum_k p_k \leq P}} \mathcal{C}_{\mathrm{MAC}}(p_1, \ldots, p_K)

This duality transforms the non-convex BC optimisation into the convex MAC problem, enabling efficient computation.

The MAC-BC duality states that any rate tuple achievable on the BC with DPC is also achievable on the dual MAC with SIC, and vice versa, under the same sum power constraint. This is remarkable because the BC uses DPC (non-linear precoding) while the MAC uses SIC (non-linear decoding), yet they achieve the same rate region.

,

Superposition Coding for the Broadcast Channel

Visualise the two-layer superposition coding strategy: the transmitter encodes messages at different power levels (cloud centres for the weak user, satellites for the strong user). The strong user decodes the weak user's message first, subtracts it, then decodes its own message interference-free.
Superposition coding: user 2 (weak) gets power Ξ±P\alpha P in cloud centres; user 1 (strong) gets (1βˆ’Ξ±)P(1-\alpha)P as satellites.

Degraded Gaussian BC Capacity Region

Visualise the capacity region of the two-user degraded Gaussian broadcast channel. The curve shows the achievable rate pairs (R1,R2)(R_1, R_2) parameterised by the power split Ξ±\alpha. User 1 has noise variance N1N_1 (stronger user) and user 2 has N2N_2 (weaker user). Adjust the total power PP and noise variances to see how the region changes. The slider Ξ±\alpha highlights the current operating point on the boundary.

Parameters
10
1
3
0.3

MIMO BC Sum Capacity

Compute and visualise the sum capacity of the MIMO broadcast channel as a function of SNR. The plot compares DPC (optimal), zero-forcing beamforming, and time-division to KK users. Adjust the number of transmit antennas ntn_t, number of users KK, and SNR to observe the multiplexing gain. At high SNR, ZF approaches DPC, while TDMA saturates at log⁑(SNR)\log(\text{SNR}) regardless of KK.

Parameters
4
2
15

Example: Superposition Coding Rate Computation

A base station with power P=20P = 20 serves two users with noise variances N1=1N_1 = 1 (near user) and N2=10N_2 = 10 (far user).

(a) Compute the capacity region boundary for α∈{0,0.3,0.5,0.7,1}\alpha \in \{0, 0.3, 0.5, 0.7, 1\}. (b) What power split maximises the sum rate? (c) Compare with TDMA (time-sharing between two single-user transmissions).

Quick Check

In dirty-paper coding for the channel Y=X+S+ZY = X + S + Z where SS is known at the transmitter, what is the capacity?

12log⁑2(1+P/(N+ΟƒS2))\frac{1}{2}\log_2(1 + P/(N + \sigma_S^2)) because SS acts as additional noise

12log⁑2(1+(P+ΟƒS2)/N)\frac{1}{2}\log_2(1 + (P + \sigma_S^2)/N) because the transmitter can use SS as extra power

12log⁑2(1+P/N)\frac{1}{2}\log_2(1 + P/N) because the known interference can be completely pre-cancelled

12log⁑2(1+P/N)βˆ’12log⁑2(1+ΟƒS2/N)\frac{1}{2}\log_2(1 + P/N) - \frac{1}{2}\log_2(1 + \sigma_S^2/N) due to partial pre-cancellation

Common Mistake: Wrong Superposition Decoding Order

Mistake:

Having the weak user decode the strong user's message first and subtract it, then decode its own β€” i.e., reversing the SIC order in the BC.

Correction:

In the degraded BC, the strong user (lower noise) decodes the weak user's cloud-centre message first (which it can do because its channel is better), subtracts it, and decodes its own. The weak user decodes only its own message, treating the strong user's signal as noise. Reversing this order fails because the weak user cannot reliably decode the strong user's message (it sees it through a noisier channel). This asymmetry is fundamental to superposition coding.

Historical Note: Costa's Dirty-Paper Coding Theorem

1983

Max Costa's 1983 paper "Writing on Dirty Paper" proved one of the most surprising results in information theory: known interference at the transmitter can be completely pre-cancelled at no cost in rate. The result was initially met with scepticism β€” it seemed too good to be true that arbitrarily strong interference could be pre-subtracted without using any additional power. The proof uses random binning and Gel'fand-Pinsker coding. Costa's theorem lay dormant for nearly two decades until Caire and Shamai (2003) and Weingarten, Steinberg, and Shamai (2006) showed that DPC achieves the capacity region of the MIMO broadcast channel, transforming it from a theoretical curiosity into the foundation of modern multi-user MIMO precoding.

Why This Matters: From DPC Theory to Practical MIMO Precoding

The MIMO BC capacity region (achieved by DPC) provides the theoretical benchmark for all practical downlink precoding schemes. The MIMO book develops the practical side: zero-forcing and regularised ZF precoding that approach DPC performance in the massive MIMO regime (Nt≫KN_t \gg K), JSDM (Adhikary/Nam/Ahn/Caire) for FDD massive MIMO that exploits channel covariance structure, and cell-free massive MIMO where distributed precoding must account for fronthaul constraints. The gap between DPC and linear precoding shrinks with more antennas β€” a key insight that makes massive MIMO practical.

Superposition Coding

A coding strategy for the broadcast channel where the transmitter encodes multiple messages at different power levels (layers). Stronger receivers decode and strip weaker users' messages first, then decode their own. Achieves the capacity of the degraded BC.

Related: Dirty-Paper Coding (DPC)

Dirty-Paper Coding (DPC)

A coding technique for channels with non-causally known interference at the transmitter. Costa's theorem shows that known interference can be pre-cancelled without rate loss. DPC achieves the capacity region of the MIMO broadcast channel.

Related: Superposition Coding, MAC-BC Duality

MAC-BC Duality

The capacity region of the MIMO BC with sum power PP equals the union of MAC capacity regions over all power allocations summing to PP. This transforms the non-convex BC optimisation into a convex problem.

Related: Dirty-Paper Coding (DPC)