The MIMO Broadcast Channel
From Discrete to Gaussian MIMO
We now turn to the broadcast channel that matters most for modern wireless systems: the Gaussian MIMO broadcast channel. This is the downlink of a multi-antenna base station serving multiple single- or multi-antenna users.
The MIMO BC is not degraded in general β two users with different spatial channel matrices and cannot be ordered by a degradation relation unless their channels happen to have a special structure. This means superposition coding alone is insufficient. The remarkable result, due to Weingarten, Steinberg, and Shamai (2006), is that dirty-paper coding (DPC) achieves the entire capacity region of the Gaussian MIMO BC.
The point is that DPC transforms the MIMO BC into an effective degraded channel by pre-canceling the interference caused by each user's signal to the subsequently encoded users β exactly Costa's result from Chapter 12, applied sequentially.
Definition: The -User Gaussian MIMO Broadcast Channel
The -User Gaussian MIMO Broadcast Channel
The transmitter has antennas and serves receivers, each with antennas. The channel to user is: where:
- is the transmitted signal with power constraint
- is the channel matrix to user
- is i.i.d. Gaussian noise
The encoder sends independent messages at rates , and each receiver decodes only .
MIMO broadcast channel
The downlink of a multi-antenna transmitter serving multiple receivers simultaneously. Each receiver sees the transmitted signal through its own spatial channel matrix, making the channel non-degraded in general.
Related: Dirty-paper coding, MIMO MAC, Linear precoding
Definition: DPC Encoding for the MIMO BC
DPC Encoding for the MIMO BC
Fix an encoding order , a permutation of . The DPC encoder processes users in the order :
- User : Encode using a Gaussian codebook with covariance , producing .
- User , : The signals are known non-causally at the encoder (since the encoder generated them). Treat as "interference known at the encoder" and apply Costa's dirty-paper coding to pre-cancel it. Encode with covariance .
- Transmit: with .
Theorem: DPC Achievable Rate Region
For a fixed encoding order and covariance matrices with , DPC achieves the rate tuple:
The full DPC region is the convex hull over all encoding orders and all valid covariance allocations .
Each user sees the signals of users as interference (since those are encoded later and their interference is not pre-canceled), but the signals of are pre-canceled via DPC. The rate expression is a generalized formula with the effective noise including only the non-canceled interference.
Costa's theorem per user
For user , the interference is known non-causally at the encoder. By Costa's theorem (Chapter 12), this interference can be pre-canceled at no rate cost, as if it were not present. The effective channel for user is therefore:
Gaussian codebook achieves the rate
With Gaussian , the achievable rate is the mutual information of a Gaussian MIMO channel with input covariance and noise-plus-interference covariance . The standard formula applies.
Power constraint
Since with independent components, .
Theorem: DPC Achieves the MIMO BC Capacity Region
The capacity region of the -user Gaussian MIMO broadcast channel equals the DPC region:
That is, dirty-paper coding is optimal β no other coding scheme can achieve rates outside the DPC region.
The converse is the hard part. The proof, due to Weingarten, Steinberg, and Shamai (2006), uses a channel enhancement argument: enhance the noise at each receiver to create a degraded BC that has the same capacity region as the original, then apply the known converse for degraded BCs. The enhancement is possible because the Gaussian distribution is the worst-case noise (entropy-power inequality argument).
Achievability (DPC)
Follows from the DPC construction above and Costa's theorem. Each user sees an effective point-to-point channel with Gaussian codebook achieving the rate.
Converse via channel enhancement
The key idea: for each user , enhance (reduce) the noise covariance from to some such that the resulting channel becomes physically degraded: (after appropriate basis transformations). This enhancement can only increase the capacity region, so any outer bound for the enhanced (degraded) channel is also an outer bound for the original channel.
Degraded channel converse
For the enhanced degraded MIMO BC, superposition coding is optimal (Bergmans' converse extended to the MIMO case). The optimal input distribution is Gaussian (entropy maximization under covariance constraint). The resulting capacity region matches the DPC region of the original channel.
Enhancement preserves capacity
The crucial step: the enhancement is chosen so that the capacity region of the enhanced channel equals that of the original channel. This requires showing that the optimal covariance matrices for the original channel remain optimal under enhancement β which follows from the KKT conditions of the optimization.
Historical Note: The Resolution of the MIMO BC Capacity
2003-2006The MIMO BC capacity problem was one of the most actively pursued questions in information theory during 2000-2006. Caire and Shamai (2003) showed that DPC achieves the sum capacity of the MISO BC (single-antenna receivers). Vishwanath, Jindal, and Goldsmith (2003) and Viswanath and Tse (2003) independently established DPC optimality for the sum rate via MAC-BC duality. The full capacity region (all rate tuples, not just sum rate) was finally established by Weingarten, Steinberg, and Shamai in 2006, using the elegant channel enhancement technique.
It is worth noting that the practical impact of this result extends far beyond the specific DPC scheme. The capacity region characterization justifies the design of practical MU-MIMO precoding schemes (zero-forcing, regularized ZF, MMSE precoding) that approximate DPC at lower complexity β see Book telecom, Chapter 17.
DPC vs. Practical Linear Precoding
While DPC is information-theoretically optimal for the MIMO BC, its implementation complexity is prohibitive for real systems. Practical 5G NR base stations use linear precoding (zero-forcing, MMSE, or regularized ZF) instead.
The gap between DPC and zero-forcing precoding is at most bits per user for users, but in practice the gap is much smaller β typically 1-3 dB for the sum rate at moderate SNR. For massive MIMO systems (), the gap vanishes asymptotically because the users' channels become nearly orthogonal.
The practical message: DPC tells us the limit of what is possible; linear precoding gets us most of the way there with orders of magnitude less complexity.
- β’
DPC requires non-causal knowledge of all users' messages β impractical in real-time systems
- β’
Linear precoding requires only CSI, not message knowledge
- β’
Massive MIMO with MRT or ZF approaches DPC performance
DPC Sum-Capacity Optimality for the MISO BC
Caire and Shamai established that dirty-paper coding achieves the sum capacity of the MISO broadcast channel (single-antenna receivers). This was a key stepping stone toward the full MIMO BC capacity region, later proved by Weingarten, Steinberg, and Shamai (2006). The paper introduced the connection between DPC and Costa's writing-on-dirty-paper theorem in the multiuser MIMO context, and showed that the sum rate with DPC can far exceed that of linear precoding, especially when user channels are not orthogonal.
Why This Matters: Connection to MU-MIMO Precoding
The MIMO BC capacity region established here is the theoretical foundation for all multi-user MIMO downlink techniques used in 4G LTE and 5G NR. The DPC encoding order corresponds to successive interference pre-cancellation, while practical systems approximate this with linear precoding (ZF, MMSE). The MAC-BC duality (Section 16.4) is used in practical beamforming design: it is easier to optimize the uplink and then transform the solution to the downlink.
See full treatment in The $K$-User MIMO Broadcast Channel
DPC Rate Region for the 2-User MISO BC
Visualize the DPC capacity region for a two-user MISO broadcast channel (, single-antenna receivers). Adjust the channel vectors and SNR to see how the capacity region changes with channel geometry.
Parameters
Angle of channel vector $\mathbf{h}_1$ from the $x$-axis
Angle of channel vector $\mathbf{h}_2$ from the $x$-axis
Transmit SNR in dB
DPC Pre-Cancellation for the MIMO BC
Common Mistake: Encoding Order Matters for DPC
Mistake:
Assuming that the DPC rate region is the same for all encoding orders.
Correction:
Different encoding orders favor different users. User (encoded first, canceled last) sees the most interference and typically gets the lowest rate. User (encoded last, all interference pre-canceled) sees a clean channel. The full DPC region is the convex hull over all encoding orders, which allows time-sharing between orderings.
Example: DPC for the 2-User MISO BC
Consider a MISO BC with antennas, two single-antenna users with channels and (orthogonal channels), and total power . Noise variance for both users. Compute the DPC sum rate and compare with zero-forcing.
Orthogonal channels simplification
With orthogonal channels, each user's signal does not interfere with the other. The optimal strategy is simply to allocate power to user 1 and to user 2, with and .
DPC rates
With either encoding order: The DPC encoding order is irrelevant because there is no cross-interference to pre-cancel. Sum rate: .
Comparison with ZF
Zero-forcing with orthogonal channels is identical to DPC β each antenna serves one user with no interference. The ZF and DPC sum rates coincide: By concavity of , equal power allocation is optimal, giving bits/channel use.
Remark on non-orthogonal channels
When channels are not orthogonal (e.g., and are close in angle), DPC strictly outperforms ZF because it can pre-cancel cross-interference instead of nulling it out β ZF wastes degrees of freedom to avoid interference, while DPC uses them constructively.
Quick Check
In the DPC encoding for the MIMO BC, user (the last user encoded) achieves the rate of a point-to-point MIMO channel. Why?
All interference from previously encoded users is pre-canceled via Costa's theorem
User has the strongest channel
The noise at user is zero
User is encoded last, so the encoder knows all other codewords non-causally and pre-cancels their interference completely. The effective channel is interference-free.
Channel enhancement
A converse proof technique where the noise at each receiver is reduced (enhanced) to create a degraded broadcast channel whose capacity region contains that of the original channel. The enhancement is chosen so that the capacity region is preserved, allowing the use of the simpler degraded-channel converse.
Related: Entropy power inequality, Degraded Broadcast Channel