The MIMO Broadcast Channel

From Discrete to Gaussian MIMO

We now turn to the broadcast channel that matters most for modern wireless systems: the Gaussian MIMO broadcast channel. This is the downlink of a multi-antenna base station serving multiple single- or multi-antenna users.

The MIMO BC is not degraded in general β€” two users with different spatial channel matrices H1\mathbf{H}_{1} and H2\mathbf{H}_{2} cannot be ordered by a degradation relation unless their channels happen to have a special structure. This means superposition coding alone is insufficient. The remarkable result, due to Weingarten, Steinberg, and Shamai (2006), is that dirty-paper coding (DPC) achieves the entire capacity region of the Gaussian MIMO BC.

The point is that DPC transforms the MIMO BC into an effective degraded channel by pre-canceling the interference caused by each user's signal to the subsequently encoded users β€” exactly Costa's result from Chapter 12, applied sequentially.

Definition:

The KK-User Gaussian MIMO Broadcast Channel

The transmitter has ntn_t antennas and serves KK receivers, each with nr,kn_{r,k} antennas. The channel to user kk is: yk=Hkx+zk,k=1,…,K\mathbf{y}_k = \mathbf{H}_{k} \mathbf{x} + \mathbf{z}_k, \quad k = 1, \ldots, K where:

  • x∈Cnt\mathbf{x} \in \mathbb{C}^{n_t} is the transmitted signal with power constraint E[βˆ₯xβˆ₯2]≀P\mathbb{E}[\|\mathbf{x}\|^2] \leq P
  • Hk∈Cnr,kΓ—nt\mathbf{H}_{k} \in \mathbb{C}^{n_{r,k} \times n_t} is the channel matrix to user kk
  • zk∼CN(0,Οƒk2I)\mathbf{z}_k \sim \mathcal{CN}(\mathbf{0}, \sigma^2_{k} \mathbf{I}) is i.i.d. Gaussian noise

The encoder sends independent messages M1,…,MKM_1, \ldots, M_K at rates R1,…,RKR_{1}, \ldots, R_{K}, and each receiver kk decodes only MkM_k.

MIMO broadcast channel

The downlink of a multi-antenna transmitter serving multiple receivers simultaneously. Each receiver sees the transmitted signal through its own spatial channel matrix, making the channel non-degraded in general.

Related: Dirty-paper coding, MIMO MAC, Linear precoding

Definition:

DPC Encoding for the MIMO BC

Fix an encoding order Ο€=(Ο€(1),…,Ο€(K))\pi = (\pi(1), \ldots, \pi(K)), a permutation of {1,…,K}\{1, \ldots, K\}. The DPC encoder processes users in the order Ο€(1),Ο€(2),…,Ο€(K)\pi(1), \pi(2), \ldots, \pi(K):

  1. User Ο€(1)\pi(1): Encode MΟ€(1)M_{\pi(1)} using a Gaussian codebook with covariance KΟ€(1)\mathbf{K}_{\pi(1)}, producing xΟ€(1)\mathbf{x}_{\pi(1)}.
  2. User Ο€(k)\pi(k), kβ‰₯2k \geq 2: The signals xΟ€(1),…,xΟ€(kβˆ’1)\mathbf{x}_{\pi(1)}, \ldots, \mathbf{x}_{\pi(k-1)} are known non-causally at the encoder (since the encoder generated them). Treat sk=βˆ‘j=1kβˆ’1HΟ€(k)xΟ€(j)\mathbf{s}_k = \sum_{j=1}^{k-1} \mathbf{H}_{\pi(k)} \mathbf{x}_{\pi(j)} as "interference known at the encoder" and apply Costa's dirty-paper coding to pre-cancel it. Encode MΟ€(k)M_{\pi(k)} with covariance KΟ€(k)\mathbf{K}_{\pi(k)}.
  3. Transmit: x=βˆ‘k=1KxΟ€(k)\mathbf{x} = \sum_{k=1}^K \mathbf{x}_{\pi(k)} with βˆ‘k=1Ktr(KΟ€(k))≀P\sum_{k=1}^K \text{tr}(\mathbf{K}_{\pi(k)}) \leq P.

Theorem: DPC Achievable Rate Region

For a fixed encoding order Ο€\pi and covariance matrices K1,…,KK\mathbf{K}_1, \ldots, \mathbf{K}_K with βˆ‘ktr(Kk)≀P\sum_k \text{tr}(\mathbf{K}_k) \leq P, DPC achieves the rate tuple: RΟ€(k)=log⁑det⁑(I+HΟ€(k)(βˆ‘j=kKKΟ€(j))HΟ€(k)H(σπ(k)2I+HΟ€(k)(βˆ‘j=k+1KKΟ€(j))HΟ€(k)H)βˆ’1)R_{\pi(k)} = \log \det \left( \mathbf{I} + \mathbf{H}_{\pi(k)} \left( \sum_{j=k}^K \mathbf{K}_{\pi(j)} \right) \mathbf{H}_{\pi(k)}^{H} \left( \sigma^2_{\pi(k)} \mathbf{I} + \mathbf{H}_{\pi(k)} \left( \sum_{j=k+1}^K \mathbf{K}_{\pi(j)} \right) \mathbf{H}_{\pi(k)}^{H} \right)^{-1} \right)

The full DPC region RDPC\mathcal{R}_{\text{DPC}} is the convex hull over all encoding orders Ο€\pi and all valid covariance allocations {Kk}\{\mathbf{K}_k\}.

Each user Ο€(k)\pi(k) sees the signals of users Ο€(k+1),…,Ο€(K)\pi(k+1), \ldots, \pi(K) as interference (since those are encoded later and their interference is not pre-canceled), but the signals of Ο€(1),…,Ο€(kβˆ’1)\pi(1), \ldots, \pi(k-1) are pre-canceled via DPC. The rate expression is a generalized log⁑det⁑(I+SNR)\log\det(\mathbf{I} + \text{SNR}) formula with the effective noise including only the non-canceled interference.

,

Theorem: DPC Achieves the MIMO BC Capacity Region

The capacity region of the KK-user Gaussian MIMO broadcast channel equals the DPC region: CBC=RDPCC_{\text{BC}} = \mathcal{R}_{\text{DPC}}

That is, dirty-paper coding is optimal β€” no other coding scheme can achieve rates outside the DPC region.

The converse is the hard part. The proof, due to Weingarten, Steinberg, and Shamai (2006), uses a channel enhancement argument: enhance the noise at each receiver to create a degraded BC that has the same capacity region as the original, then apply the known converse for degraded BCs. The enhancement is possible because the Gaussian distribution is the worst-case noise (entropy-power inequality argument).

Historical Note: The Resolution of the MIMO BC Capacity

2003-2006

The MIMO BC capacity problem was one of the most actively pursued questions in information theory during 2000-2006. Caire and Shamai (2003) showed that DPC achieves the sum capacity of the MISO BC (single-antenna receivers). Vishwanath, Jindal, and Goldsmith (2003) and Viswanath and Tse (2003) independently established DPC optimality for the sum rate via MAC-BC duality. The full capacity region (all rate tuples, not just sum rate) was finally established by Weingarten, Steinberg, and Shamai in 2006, using the elegant channel enhancement technique.

It is worth noting that the practical impact of this result extends far beyond the specific DPC scheme. The capacity region characterization justifies the design of practical MU-MIMO precoding schemes (zero-forcing, regularized ZF, MMSE precoding) that approximate DPC at lower complexity β€” see Book telecom, Chapter 17.

⚠️Engineering Note

DPC vs. Practical Linear Precoding

While DPC is information-theoretically optimal for the MIMO BC, its implementation complexity is prohibitive for real systems. Practical 5G NR base stations use linear precoding (zero-forcing, MMSE, or regularized ZF) instead.

The gap between DPC and zero-forcing precoding is at most log⁑2(K)\log_2(K) bits per user for KK users, but in practice the gap is much smaller β€” typically 1-3 dB for the sum rate at moderate SNR. For massive MIMO systems (nt≫Kn_t \gg K), the gap vanishes asymptotically because the users' channels become nearly orthogonal.

The practical message: DPC tells us the limit of what is possible; linear precoding gets us most of the way there with orders of magnitude less complexity.

Practical Constraints
  • β€’

    DPC requires non-causal knowledge of all users' messages β€” impractical in real-time systems

  • β€’

    Linear precoding requires only CSI, not message knowledge

  • β€’

    Massive MIMO with MRT or ZF approaches DPC performance

πŸŽ“CommIT Contribution(2003)

DPC Sum-Capacity Optimality for the MISO BC

G. Caire, S. Shamai (Shitz) β€” IEEE Trans. Information Theory

Caire and Shamai established that dirty-paper coding achieves the sum capacity of the MISO broadcast channel (single-antenna receivers). This was a key stepping stone toward the full MIMO BC capacity region, later proved by Weingarten, Steinberg, and Shamai (2006). The paper introduced the connection between DPC and Costa's writing-on-dirty-paper theorem in the multiuser MIMO context, and showed that the sum rate with DPC can far exceed that of linear precoding, especially when user channels are not orthogonal.

DPCMISO BCsum capacityView Paper β†’

Why This Matters: Connection to MU-MIMO Precoding

The MIMO BC capacity region established here is the theoretical foundation for all multi-user MIMO downlink techniques used in 4G LTE and 5G NR. The DPC encoding order corresponds to successive interference pre-cancellation, while practical systems approximate this with linear precoding (ZF, MMSE). The MAC-BC duality (Section 16.4) is used in practical beamforming design: it is easier to optimize the uplink and then transform the solution to the downlink.

See full treatment in The $K$-User MIMO Broadcast Channel

DPC Rate Region for the 2-User MISO BC

Visualize the DPC capacity region for a two-user MISO broadcast channel (nt=2n_t = 2, single-antenna receivers). Adjust the channel vectors and SNR to see how the capacity region changes with channel geometry.

Parameters
30

Angle of channel vector $\mathbf{h}_1$ from the $x$-axis

120

Angle of channel vector $\mathbf{h}_2$ from the $x$-axis

15

Transmit SNR in dB

DPC Pre-Cancellation for the MIMO BC

Step-by-step animation of dirty-paper coding for the two-user MIMO BC. The encoder processes users sequentially, pre-canceling interference for later users via Costa's theorem. The last user sees an interference-free channel.

Common Mistake: Encoding Order Matters for DPC

Mistake:

Assuming that the DPC rate region is the same for all encoding orders.

Correction:

Different encoding orders favor different users. User Ο€(1)\pi(1) (encoded first, canceled last) sees the most interference and typically gets the lowest rate. User Ο€(K)\pi(K) (encoded last, all interference pre-canceled) sees a clean channel. The full DPC region is the convex hull over all K!K! encoding orders, which allows time-sharing between orderings.

Example: DPC for the 2-User MISO BC

Consider a MISO BC with nt=2n_t = 2 antennas, two single-antenna users with channels h1=[1,0]T\mathbf{h}_1 = [1, 0]^T and h2=[0,1]T\mathbf{h}_2 = [0, 1]^T (orthogonal channels), and total power P=10P = 10. Noise variance Οƒ2=1\sigma^2 = 1 for both users. Compute the DPC sum rate and compare with zero-forcing.

Quick Check

In the DPC encoding for the MIMO BC, user Ο€(K)\pi(K) (the last user encoded) achieves the rate of a point-to-point MIMO channel. Why?

All interference from previously encoded users is pre-canceled via Costa's theorem

User Ο€(K)\pi(K) has the strongest channel

The noise at user Ο€(K)\pi(K) is zero

Channel enhancement

A converse proof technique where the noise at each receiver is reduced (enhanced) to create a degraded broadcast channel whose capacity region contains that of the original channel. The enhancement is chosen so that the capacity region is preserved, allowing the use of the simpler degraded-channel converse.

Related: Entropy power inequality, Degraded Broadcast Channel