MIMO Channel Capacity
Multiple Antennas: The Spatial Dimension
So far we have studied scalar fading channels with a single transmit and receive antenna. The capacity scales as at high SNR β one additional bit per 3 dB increase. Can we do better without more bandwidth or more power?
The answer is a resounding yes: by using transmit and receive antennas, the high-SNR capacity scales as . This is the spatial multiplexing gain β we get independent parallel channels "for free" by exploiting the spatial dimension. This discovery, due to Foschini (1996) and Telatar (1999), revolutionized wireless communications and motivated the massive MIMO systems deployed in 5G networks today.
Definition: MIMO Channel Model
MIMO Channel Model
The MIMO (multiple-input multiple-output) channel with transmit antennas and receive antennas is described by
where:
- is the transmitted signal vector with covariance constraint and total power constraint ,
- is the channel matrix,
- is additive Gaussian noise,
- is the received signal vector.
Each entry represents the complex channel gain from transmit antenna to receive antenna . The noise is spatially white with power per receive antenna. The total transmit power constraint is .
MIMO channel
A wireless channel with transmit and receive antennas, modeled as where is the channel matrix.
Related: Spatial multiplexing gain
Theorem: MIMO Capacity with CSIR (Fixed Channel)
For a fixed (deterministic) MIMO channel with CSIR, the capacity is
If the transmitter also knows (CSIT), the optimal input covariance is obtained by water-filling over the singular values of .
The mutual information for Gaussian inputs is , and Gaussian inputs maximize under the covariance constraint. The formula is the natural matrix extension of the scalar .
Write and compute each term.
Use the fact that is Gaussian with covariance .
Recall that for complex Gaussian.
Mutual information with Gaussian inputs
For and given :
The mutual information is
Compute the differential entropies
Using the complex Gaussian entropy formula :
Therefore
Optimality of Gaussian inputs
For any input distribution with covariance ,
Since Gaussian maximizes under a fixed covariance (Chapter 5), and is Gaussian when is Gaussian, the maximum is achieved by Gaussian inputs.
Optimization over $\mathbf{K}_x$
The capacity is the maximum over all valid :
The objective is concave in (log-det is concave on positive definite matrices), and the constraint set is convex, so this is a convex optimization problem.
Theorem: SVD Decomposition into Parallel Channels
Let the SVD of be , where , are unitary, and has singular values with .
With CSIT, the MIMO channel decomposes into parallel independent sub-channels:
where , , . The capacity is
with water-filling power allocation
The SVD rotates the input and output spaces so that the MIMO channel becomes independent scalar channels, one per singular value. The -th sub-channel has gain and receives power . The strongest sub-channel () carries the most data; weak sub-channels may receive zero power. This is the water-filling solution from Chapter 12, applied to the singular values of .
Apply the SVD transformation
Substitute into the channel model and left-multiply by :
Define , , . Since is unitary, (noise statistics are preserved). The channel becomes
which decouples into .
Apply water-filling
Each sub-channel is an independent AWGN channel with gain . The optimal input covariance in the rotated basis is . From Chapter 12, the optimal power allocation is water-filling:
and the capacity is the sum of sub-channel capacities.
Theorem: Optimal Input Without CSIT for i.i.d. Rayleigh
For an ergodic MIMO channel with i.i.d. Rayleigh fading ( independently) and CSIR but no CSIT, the capacity-achieving input covariance is
The ergodic capacity is
where .
Without CSIT, the transmitter cannot align its signal with the channel. Since the channel is isotropically distributed (i.i.d. entries), no direction in the transmit space is preferred. The best the transmitter can do is spread power equally across all antennas β the isotropic input . Any other allocation would waste power by favoring directions that are no better on average.
Symmetry argument
For i.i.d. Rayleigh , the distribution of is unitarily invariant: has the same distribution as for any unitary and .
Suppose is optimal. For any unitary , achieves the same ergodic rate (by the unitary invariance of ). By the concavity of in , the average over all unitaries achieves at least as high a rate. Therefore is optimal.
Theorem: Telatar's Linear Capacity Scaling
For the i.i.d. Rayleigh MIMO channel with transmit and receive antennas, the ergodic capacity at high SNR scales as
The factor is the spatial multiplexing gain: the MIMO channel supports independent data streams, each contributing bits per channel use.
The channel matrix has at most nonzero singular values. At high SNR, each singular value contributes approximately to the capacity (the water-filling level is high enough that all sub-channels are active). So the total capacity grows times faster than the scalar channel. Doubling the number of antennas (on both sides) roughly doubles the high-SNR capacity without any increase in bandwidth or transmit power.
Eigenvalue decomposition
Let and let be the nonzero eigenvalues of (or equivalently of ). The ergodic capacity is
High-SNR expansion
At high SNR, for all such that . Since all eigenvalues are positive with probability 1 (the matrix is full rank a.s. when ):
The second term is a constant independent of , so the capacity grows as at high SNR.
Spatial multiplexing gain
The pre-log factor in the high-SNR capacity of a MIMO channel. For an MIMO channel with i.i.d. Rayleigh fading, the spatial multiplexing gain is , meaning the capacity scales as .
Related: MIMO channel
Example: Capacity of a MIMO Channel
Consider a MIMO channel with
and noise power . Compute the capacity with CSIT (water-filling over singular values) for total power .
Compute the SVD
The eigenvalues of (since is symmetric here) are found from the characteristic equation:
The singular values are and .
Water-filling power allocation
The sub-channel gains are and . The water-filling equations are:
Both channels are active (since ). From :
Compute capacity
P_1 = P_2 = 5C_{\text{equal}} = \log(8.5) + \log(3.5) \approx 3.087 + 1.807 = 4.894$ bits/use.
Water-filling provides only a marginal gain here (0.017 bits) because the sub-channel gains are not vastly different.
MIMO Capacity Scaling with Antenna Number
Observe how the ergodic capacity of an i.i.d. Rayleigh MIMO channel scales with the number of antennas. At high SNR, the capacity grows linearly with . Adjust the SNR and antenna configuration to explore the spatial multiplexing gain.
Parameters
Plot $n_t = n_r = 1, 2, \ldots, n_{\max}$
MIMO Ergodic Capacity vs. SNR
Compare the ergodic capacity of different MIMO configurations as a function of SNR. The slopes at high SNR reveal the spatial multiplexing gains: a system gains capacity 4 times faster per dB than a system.
Parameters
Historical Note: The MIMO Revolution: Foschini and Telatar
1996-1999The idea that multiple antennas could provide a linear capacity increase was first demonstrated by Gerard Foschini at Bell Labs in his 1996 paper introducing the BLAST (Bell Labs Layered Space-Time) architecture. Foschini showed through simulation that a MIMO system in Rayleigh fading could achieve spectral efficiencies of 40 bits/s/Hz at 24 dB SNR β a tenfold improvement over SISO.
Emre Telatar, also at Bell Labs, provided the rigorous information-theoretic foundation in his landmark 1999 paper "Capacity of Multi-Antenna Gaussian Channels." Telatar derived the ergodic capacity in closed form (as an expectation over random eigenvalues), proved the optimality of isotropic input for i.i.d. Rayleigh fading without CSIT, and established the scaling law.
These results were independently corroborated by the work of Marzetta and Hochwald at Bell Labs. Together, they launched the field of MIMO communications, which led to MIMO being incorporated into every modern wireless standard: 802.11n (Wi-Fi 4), LTE, LTE-Advanced, and 5G NR.
MIMO Broadcast Channel Capacity
While this chapter treats the point-to-point MIMO channel, the CommIT group has made fundamental contributions to the multiuser MIMO setting. Weingarten, Steinberg, and Shamai (with contributions from Caire) established the capacity region of the MIMO broadcast channel (downlink), proving that dirty paper coding (DPC) achieves the capacity region. This result extended the single-user MIMO capacity theory to the multiuser setting and provided the theoretical foundation for MU-MIMO precoding techniques used in LTE-Advanced and 5G NR.
The duality between the MIMO broadcast channel and the multiple access channel (uplink), established in companion work, showed that the capacity-achieving strategy can be computed via a dual MAC optimization β a computationally tractable approach. This duality is covered in the MIMO book (Book MIMO, Chapters 8-10).
SISO vs. MIMO Capacity
| Property | SISO () | MIMO () |
|---|---|---|
| Channel model | ||
| Degrees of freedom | 1 | |
| High-SNR capacity | ||
| Optimal input (no CSIT, Rayleigh) | ||
| Benefit from CSIT | Water-filling over fading states | Water-filling over singular values + fading states |
| Diversity order (quasi-static) | 1 | Up to |
| Array gain | None | (receive beamforming) or (transmit beamforming with CSIT) |
Common Mistake: MIMO Multiplexing Gain Requires Rich Scattering
Mistake:
Assuming the capacity scaling holds for any MIMO channel, regardless of the propagation environment.
Correction:
The linear scaling requires the channel matrix to be full rank, which happens with probability 1 for i.i.d. Rayleigh fading (rich scattering). In a pure line-of-sight (LoS) environment, is rank-1 regardless of the number of antennas, and the capacity is
which provides an array gain () but only growth β no spatial multiplexing. In practice, mixed LoS + scattered environments (Rician fading) provide intermediate behavior.
From MIMO Theory to Massive MIMO Practice
Telatar's capacity formula suggests that capacity grows linearly with the number of antennas. In theory, a base station with antennas serving a single user with antenna achieves an array gain of 128 (21 dB) but no multiplexing gain. If instead it serves single-antenna users simultaneously (MU-MIMO), the system achieves a multiplexing gain of 16.
In practice, the scaling is limited by:
- Channel estimation overhead: estimating requires at least pilot symbols, consuming resources.
- Pilot contamination: in multi-cell systems, users in neighboring cells reuse pilot sequences, creating interference that does not vanish as (Marzetta 2010).
- Hardware impairments: phase noise and quantization errors in low-cost RF chains degrade the effective channel.
- Spatial correlation: real channels are rarely i.i.d.; correlation reduces the effective rank and the multiplexing gain.
- β’
In 5G NR, the maximum number of antenna ports per TRP is 32 (Rel-15) or 64 (Rel-17)
- β’
CSI-RS overhead scales linearly with the number of antenna ports in FDD
- β’
TDD massive MIMO exploits channel reciprocity to avoid downlink pilot scaling
Quick Check
For a i.i.d. Rayleigh MIMO channel at high SNR, how does the capacity scale compared to a SISO channel at the same SNR?
4 dB higher (array gain only)
4 times the capacity (4x multiplexing gain)
16 times the capacity ( gain)
Same capacity but more reliable
With and i.i.d. Rayleigh fading, the high-SNR capacity is , which is approximately 4 times the SISO capacity at high SNR.
Key Takeaway
The MIMO channel decomposes via SVD into parallel sub-channels. With CSIT, water-filling over singular values is optimal. Without CSIT but with i.i.d. Rayleigh fading, isotropic input is optimal. Telatar's result shows that ergodic capacity scales as at high SNR β the spatial multiplexing gain that launched the MIMO revolution.