MIMO Channel Capacity

Multiple Antennas: The Spatial Dimension

So far we have studied scalar fading channels with a single transmit and receive antenna. The capacity scales as 12log⁑(SNR)\frac{1}{2}\log(\text{SNR}) at high SNR β€” one additional bit per 3 dB increase. Can we do better without more bandwidth or more power?

The answer is a resounding yes: by using ntn_t transmit and nrn_r receive antennas, the high-SNR capacity scales as min⁑(nt,nr)β‹…log⁑(SNR)\min(n_t, n_r) \cdot \log(\text{SNR}). This is the spatial multiplexing gain β€” we get min⁑(nt,nr)\min(n_t, n_r) independent parallel channels "for free" by exploiting the spatial dimension. This discovery, due to Foschini (1996) and Telatar (1999), revolutionized wireless communications and motivated the massive MIMO systems deployed in 5G networks today.

Definition:

MIMO Channel Model

The MIMO (multiple-input multiple-output) channel with ntn_t transmit antennas and nrn_r receive antennas is described by

y=H x+z,\mathbf{y} = \mathbf{H}\,\mathbf{x} + \mathbf{z},

where:

  • x∈Cnt\mathbf{x} \in \mathbb{C}^{n_t} is the transmitted signal vector with covariance constraint E[xxH]=Kx\mathbb{E}[\mathbf{x}\mathbf{x}^H] = \mathbf{K}_x and total power constraint tr(Kx)≀P\text{tr}(\mathbf{K}_x) \leq P,
  • H∈CnrΓ—nt\mathbf{H} \in \mathbb{C}^{n_r \times n_t} is the channel matrix,
  • z∼CN(0,NInr)\mathbf{z} \sim \mathcal{CN}(\mathbf{0}, N\mathbf{I}_{n_r}) is additive Gaussian noise,
  • y∈Cnr\mathbf{y} \in \mathbb{C}^{n_r} is the received signal vector.

Each entry [H]ij[\mathbf{H}]_{ij} represents the complex channel gain from transmit antenna jj to receive antenna ii. The noise is spatially white with power NN per receive antenna. The total transmit power constraint is tr(Kx)≀P\text{tr}(\mathbf{K}_x) \leq P.

MIMO channel

A wireless channel with ntn_t transmit and nrn_r receive antennas, modeled as y=Hx+z\mathbf{y} = \mathbf{H}\mathbf{x} + \mathbf{z} where H∈CnrΓ—nt\mathbf{H} \in \mathbb{C}^{n_r \times n_t} is the channel matrix.

Related: Spatial multiplexing gain

Theorem: MIMO Capacity with CSIR (Fixed Channel)

For a fixed (deterministic) MIMO channel H\mathbf{H} with CSIR, the capacity is

C(H)=max⁑Kxβͺ°0tr(Kx)≀Plog⁑det⁑ ⁣(Inr+1NH Kx HH).C(\mathbf{H}) = \max_{\substack{\mathbf{K}_x \succeq \mathbf{0} \\ \text{tr}(\mathbf{K}_x) \leq P}} \log\det\!\left(\mathbf{I}_{n_r} + \frac{1}{N}\mathbf{H}\,\mathbf{K}_x\,\mathbf{H}^{H}\right).

If the transmitter also knows H\mathbf{H} (CSIT), the optimal input covariance is obtained by water-filling over the singular values of H\mathbf{H}.

The mutual information for Gaussian inputs is I(x;y)=h(y)βˆ’h(z)I(\mathbf{x}; \mathbf{y}) = h(\mathbf{y}) - h(\mathbf{z}), and Gaussian inputs maximize h(y)h(\mathbf{y}) under the covariance constraint. The log⁑det⁑\log\det formula is the natural matrix extension of the scalar 12log⁑(1+SNR)\frac{1}{2}\log(1 + \text{SNR}).

,

Theorem: SVD Decomposition into Parallel Channels

Let the SVD of H\mathbf{H} be H=UΞ›VH\mathbf{H} = \mathbf{U}\boldsymbol{\Lambda}\mathbf{V}^H, where U∈CnrΓ—nr\mathbf{U} \in \mathbb{C}^{n_r \times n_r}, V∈CntΓ—nt\mathbf{V} \in \mathbb{C}^{n_t \times n_t} are unitary, and Ξ›βˆˆRnrΓ—nt\boldsymbol{\Lambda} \in \mathbb{R}^{n_r \times n_t} has singular values Οƒ1β‰₯Οƒ2β‰₯β‹―β‰₯Οƒr>0\sigma_1 \geq \sigma_2 \geq \cdots \geq \sigma_r > 0 with r=rank(H)≀min⁑(nt,nr)r = \text{rank}(\mathbf{H}) \leq \min(n_t, n_r).

With CSIT, the MIMO channel decomposes into rr parallel independent sub-channels:

y~k=Οƒkx~k+z~k,k=1,…,r,\tilde{y}_k = \sigma_k \tilde{x}_k + \tilde{z}_k, \quad k = 1, \ldots, r,

where y~=UHy\tilde{\mathbf{y}} = \mathbf{U}^H \mathbf{y}, x~=VHx\tilde{\mathbf{x}} = \mathbf{V}^H \mathbf{x}, z~=UHz\tilde{\mathbf{z}} = \mathbf{U}^H \mathbf{z}. The capacity is

C(H)=βˆ‘k=1rlog⁑ ⁣(1+Οƒk2PkN),C(\mathbf{H}) = \sum_{k=1}^r \log\!\left(1 + \frac{\sigma_k^2 P_k}{N}\right),

with water-filling power allocation

Pk=(Ξ½βˆ’NΟƒk2) ⁣+,βˆ‘k=1rPk≀P.P_k = \left(\nu - \frac{N}{\sigma_k^2}\right)^{\!+}, \quad \sum_{k=1}^r P_k \leq P.

The SVD rotates the input and output spaces so that the MIMO channel becomes rr independent scalar channels, one per singular value. The kk-th sub-channel has gain Οƒk\sigma_k and receives power PkP_k. The strongest sub-channel (Οƒ1\sigma_1) carries the most data; weak sub-channels may receive zero power. This is the water-filling solution from Chapter 12, applied to the singular values of H\mathbf{H}.

Theorem: Optimal Input Without CSIT for i.i.d. Rayleigh

For an ergodic MIMO channel with i.i.d. Rayleigh fading ([H]ij∼CN(0,1)[\mathbf{H}]_{ij} \sim \mathcal{CN}(0, 1) independently) and CSIR but no CSIT, the capacity-achieving input covariance is

Kxβˆ—=Pnt Int.\mathbf{K}_x^* = \frac{P}{n_t}\,\mathbf{I}_{n_t}.

The ergodic capacity is

Cerg=EH ⁣[log⁑det⁑ ⁣(Inr+SNRntHHH)],C_{\text{erg}} = \mathbb{E}_{\mathbf{H}}\!\left[\log\det\!\left(\mathbf{I}_{n_r} + \frac{\text{SNR}}{n_t}\mathbf{H}\mathbf{H}^{H}\right)\right],

where SNR=P/N\text{SNR} = P/N.

Without CSIT, the transmitter cannot align its signal with the channel. Since the channel is isotropically distributed (i.i.d. entries), no direction in the transmit space is preferred. The best the transmitter can do is spread power equally across all antennas β€” the isotropic input Kx=(P/nt)I\mathbf{K}_x = (P/n_t)\mathbf{I}. Any other allocation would waste power by favoring directions that are no better on average.

Theorem: Telatar's Linear Capacity Scaling

For the i.i.d. Rayleigh MIMO channel with ntn_t transmit and nrn_r receive antennas, the ergodic capacity at high SNR scales as

Cergβ‰ˆmin⁑(nt,nr)β‹…log⁑ ⁣(SNRnt)+O(1)asΒ SNRβ†’βˆž.C_{\text{erg}} \approx \min(n_t, n_r) \cdot \log\!\left(\frac{\text{SNR}}{n_t}\right) + O(1) \quad \text{as } \text{SNR} \to \infty.

The factor min⁑(nt,nr)\min(n_t, n_r) is the spatial multiplexing gain: the MIMO channel supports min⁑(nt,nr)\min(n_t, n_r) independent data streams, each contributing log⁑(SNR/nt)\log(\text{SNR}/n_t) bits per channel use.

The channel matrix H\mathbf{H} has at most min⁑(nt,nr)\min(n_t, n_r) nonzero singular values. At high SNR, each singular value contributes approximately log⁑(SNR)\log(\text{SNR}) to the capacity (the water-filling level is high enough that all sub-channels are active). So the total capacity grows min⁑(nt,nr)\min(n_t, n_r) times faster than the scalar channel. Doubling the number of antennas (on both sides) roughly doubles the high-SNR capacity without any increase in bandwidth or transmit power.

Spatial multiplexing gain

The pre-log factor in the high-SNR capacity of a MIMO channel. For an nrΓ—ntn_r \times n_t MIMO channel with i.i.d. Rayleigh fading, the spatial multiplexing gain is min⁑(nt,nr)\min(n_t, n_r), meaning the capacity scales as min⁑(nt,nr)β‹…log⁑(SNR)\min(n_t, n_r) \cdot \log(\text{SNR}).

Related: MIMO channel

Example: Capacity of a 2Γ—22 \times 2 MIMO Channel

Consider a 2Γ—22 \times 2 MIMO channel with

H=(10.50.51)\mathbf{H} = \begin{pmatrix} 1 & 0.5 \\ 0.5 & 1 \end{pmatrix}

and noise power N=1N = 1. Compute the capacity with CSIT (water-filling over singular values) for total power P=10P = 10.

MIMO Capacity Scaling with Antenna Number

Observe how the ergodic capacity of an i.i.d. Rayleigh MIMO channel scales with the number of antennas. At high SNR, the capacity grows linearly with min⁑(nt,nr)\min(n_t, n_r). Adjust the SNR and antenna configuration to explore the spatial multiplexing gain.

Parameters
15
8

Plot $n_t = n_r = 1, 2, \ldots, n_{\max}$

MIMO Ergodic Capacity vs. SNR

Compare the ergodic capacity of different MIMO configurations as a function of SNR. The slopes at high SNR reveal the spatial multiplexing gains: a 4Γ—44 \times 4 system gains capacity 4 times faster per dB than a 1Γ—11 \times 1 system.

Parameters
4
4
30

Historical Note: The MIMO Revolution: Foschini and Telatar

1996-1999

The idea that multiple antennas could provide a linear capacity increase was first demonstrated by Gerard Foschini at Bell Labs in his 1996 paper introducing the BLAST (Bell Labs Layered Space-Time) architecture. Foschini showed through simulation that a 10Γ—1010 \times 10 MIMO system in Rayleigh fading could achieve spectral efficiencies of 40 bits/s/Hz at 24 dB SNR β€” a tenfold improvement over SISO.

Emre Telatar, also at Bell Labs, provided the rigorous information-theoretic foundation in his landmark 1999 paper "Capacity of Multi-Antenna Gaussian Channels." Telatar derived the ergodic capacity in closed form (as an expectation over random eigenvalues), proved the optimality of isotropic input for i.i.d. Rayleigh fading without CSIT, and established the min⁑(nt,nr)β‹…log⁑(SNR)\min(n_t, n_r) \cdot \log(\text{SNR}) scaling law.

These results were independently corroborated by the work of Marzetta and Hochwald at Bell Labs. Together, they launched the field of MIMO communications, which led to MIMO being incorporated into every modern wireless standard: 802.11n (Wi-Fi 4), LTE, LTE-Advanced, and 5G NR.

πŸŽ“CommIT Contribution(2006)

MIMO Broadcast Channel Capacity

H. Weingarten, Y. Steinberg, S. Shamai (Shitz), G. Caire β€” IEEE Trans. Information Theory, vol. 52, no. 9

While this chapter treats the point-to-point MIMO channel, the CommIT group has made fundamental contributions to the multiuser MIMO setting. Weingarten, Steinberg, and Shamai (with contributions from Caire) established the capacity region of the MIMO broadcast channel (downlink), proving that dirty paper coding (DPC) achieves the capacity region. This result extended the single-user MIMO capacity theory to the multiuser setting and provided the theoretical foundation for MU-MIMO precoding techniques used in LTE-Advanced and 5G NR.

The duality between the MIMO broadcast channel and the multiple access channel (uplink), established in companion work, showed that the capacity-achieving strategy can be computed via a dual MAC optimization β€” a computationally tractable approach. This duality is covered in the MIMO book (Book MIMO, Chapters 8-10).

MIMObroadcast-channelDPCcapacity-regionView Paper β†’

SISO vs. MIMO Capacity

PropertySISO (1Γ—11 \times 1)MIMO (ntΓ—nrn_t \times n_r)
Channel modelY=HX+ZY = HX + Zy=Hx+z\mathbf{y} = \mathbf{H}\mathbf{x} + \mathbf{z}
Degrees of freedom1min⁑(nt,nr)\min(n_t, n_r)
High-SNR capacity12log⁑(SNR)\frac{1}{2}\log(\text{SNR})min⁑(nt,nr)β‹…log⁑(SNR/nt)\min(n_t, n_r) \cdot \log(\text{SNR}/n_t)
Optimal input (no CSIT, Rayleigh)X∼N(0,P)X \sim \mathcal{N}(0, P)x∼CN(0,(P/nt)I)\mathbf{x} \sim \mathcal{CN}(\mathbf{0}, (P/n_t)\mathbf{I})
Benefit from CSITWater-filling over fading statesWater-filling over singular values + fading states
Diversity order (quasi-static)1Up to ntnrn_t n_r
Array gainNonenrn_r (receive beamforming) or ntn_t (transmit beamforming with CSIT)

Common Mistake: MIMO Multiplexing Gain Requires Rich Scattering

Mistake:

Assuming the min⁑(nt,nr)β‹…log⁑(SNR)\min(n_t, n_r) \cdot \log(\text{SNR}) capacity scaling holds for any MIMO channel, regardless of the propagation environment.

Correction:

The linear scaling requires the channel matrix H\mathbf{H} to be full rank, which happens with probability 1 for i.i.d. Rayleigh fading (rich scattering). In a pure line-of-sight (LoS) environment, H=aratH\mathbf{H} = \mathbf{a}_r \mathbf{a}_t^H is rank-1 regardless of the number of antennas, and the capacity is

CLoS=log⁑(1+ntnrβ‹…SNR),C_{\text{LoS}} = \log(1 + n_t n_r \cdot \text{SNR}),

which provides an array gain (ntnrn_t n_r) but only 1Γ—log⁑(SNR)1 \times \log(\text{SNR}) growth β€” no spatial multiplexing. In practice, mixed LoS + scattered environments (Rician fading) provide intermediate behavior.

⚠️Engineering Note

From MIMO Theory to Massive MIMO Practice

Telatar's capacity formula suggests that capacity grows linearly with the number of antennas. In theory, a base station with nt=128n_t = 128 antennas serving a single user with nr=1n_r = 1 antenna achieves an array gain of 128 (21 dB) but no multiplexing gain. If instead it serves K=16K = 16 single-antenna users simultaneously (MU-MIMO), the system achieves a multiplexing gain of 16.

In practice, the scaling is limited by:

  1. Channel estimation overhead: estimating H∈CnrΓ—nt\mathbf{H} \in \mathbb{C}^{n_r \times n_t} requires at least ntn_t pilot symbols, consuming resources.
  2. Pilot contamination: in multi-cell systems, users in neighboring cells reuse pilot sequences, creating interference that does not vanish as ntβ†’βˆžn_t \to \infty (Marzetta 2010).
  3. Hardware impairments: phase noise and quantization errors in low-cost RF chains degrade the effective channel.
  4. Spatial correlation: real channels are rarely i.i.d.; correlation reduces the effective rank and the multiplexing gain.
Practical Constraints
  • β€’

    In 5G NR, the maximum number of antenna ports per TRP is 32 (Rel-15) or 64 (Rel-17)

  • β€’

    CSI-RS overhead scales linearly with the number of antenna ports in FDD

  • β€’

    TDD massive MIMO exploits channel reciprocity to avoid downlink pilot scaling

πŸ“‹ Ref: 3GPP TS 38.214, Section 5.2

Quick Check

For a 4Γ—44 \times 4 i.i.d. Rayleigh MIMO channel at high SNR, how does the capacity scale compared to a SISO channel at the same SNR?

4 dB higher (array gain only)

4 times the capacity (4x multiplexing gain)

16 times the capacity (ntβ‹…nrn_t \cdot n_r gain)

Same capacity but more reliable

Key Takeaway

The MIMO channel y=Hx+z\mathbf{y} = \mathbf{H}\mathbf{x} + \mathbf{z} decomposes via SVD into min⁑(nt,nr)\min(n_t, n_r) parallel sub-channels. With CSIT, water-filling over singular values is optimal. Without CSIT but with i.i.d. Rayleigh fading, isotropic input (P/nt)I(P/n_t)\mathbf{I} is optimal. Telatar's result shows that ergodic capacity scales as min⁑(nt,nr)β‹…log⁑(SNR)\min(n_t, n_r) \cdot \log(\text{SNR}) at high SNR β€” the spatial multiplexing gain that launched the MIMO revolution.

MIMO Capacity Scaling with Antennas

MIMO ergodic capacity grows approximately linearly with min⁑(nt,nr)\min(n_t, n_r) at high SNR β€” spatial multiplexing. The animation shows capacity computed via Monte Carlo for i.i.d. Rayleigh channels alongside the linear scaling reference.