MIMO Capacity: Fading Channels

From Deterministic to Random Channels

In practice, the channel matrix H\mathbf{H} is random --- it changes with time, frequency, and user location. Section 15.3 gave the capacity for a known, fixed H\mathbf{H}. Now we ask: what rate can we sustain when H\mathbf{H} is drawn from a random distribution?

The answer depends critically on the delay constraint and the channel state information (CSI) available at the transmitter. Two complementary capacity metrics emerge: ergodic capacity (long codewords that experience all channel states) and outage capacity (short codewords that see a single channel realisation).

Definition:

Ergodic MIMO Capacity

The ergodic capacity of a fading MIMO channel is the capacity averaged over all channel realisations:

Cerg=EH ⁣[max⁑Q(H):β€…β€Štr(Q)≀Plog⁑2det⁑ ⁣(I+1Οƒ2HQHH)]C_{\mathrm{erg}} = \mathbb{E}_{\mathbf{H}}\!\left[\max_{\mathbf{Q}(\mathbf{H}):\; \mathrm{tr}(\mathbf{Q}) \leq P} \log_2 \det\!\left(\mathbf{I} + \frac{1}{\sigma^2}\mathbf{H}\mathbf{Q}\mathbf{H}^{H}\right)\right]

This is achievable when codewords span many independent fading realisations (ergodic regime), i.e., the coding block length is much larger than the coherence time.

When the transmitter has no CSI (only the receiver knows H\mathbf{H}), the optimal input is Q=(P/nt)Int\mathbf{Q} = (P/n_t)\mathbf{I}_{n_t} (equal power, isotropic transmission), and the capacity simplifies to

Cerg=EH ⁣[log⁑2det⁑ ⁣(Inr+SNRntHHH)]C_{\mathrm{erg}} = \mathbb{E}_{\mathbf{H}}\!\left[\log_2 \det\!\left(\mathbf{I}_{n_r} + \frac{\text{SNR}}{n_t}\mathbf{H}\mathbf{H}^{H}\right)\right]

In the no-CSIT case, transmitting isotropically is optimal because without knowledge of H\mathbf{H}, no direction is preferred. This is the most common scenario in practice (e.g., downlink in FDD systems without channel feedback).

Theorem: Telatar's Formula for Ergodic MIMO Capacity

For an i.i.d. Rayleigh fading MIMO channel with ntn_t transmit and nrn_r receive antennas, no CSIT, and SNR=P/Οƒ2\text{SNR} = P/\sigma^2, the ergodic capacity is

Cerg=E ⁣[βˆ‘i=1mlog⁑2 ⁣(1+SNRntΞ»i)]C_{\mathrm{erg}} = \mathbb{E}\!\left[\sum_{i=1}^{m} \log_2\!\left(1 + \frac{\text{SNR}}{n_t}\lambda_i\right)\right]

where m=min⁑(nt,nr)m = \min(n_t, n_r), and Ξ»1,…,Ξ»m\lambda_1, \ldots, \lambda_m are the mm nonzero eigenvalues of the Wishart matrix W\mathbf{W}:

W={HHHifΒ nr≀ntHHHifΒ nr>nt\mathbf{W} = \begin{cases} \mathbf{H}\mathbf{H}^{H} & \text{if } n_r \leq n_t \\ \mathbf{H}^{H}\mathbf{H} & \text{if } n_r > n_t \end{cases}

The joint density of {λi}\{\lambda_i\} is the unordered eigenvalue distribution of a complex Wishart matrix with parameters (m,n)(m, n) where n=max⁑(nt,nr)n = \max(n_t, n_r):

f(Ξ»1,…,Ξ»m)=Km,n∏i<j(Ξ»iβˆ’Ξ»j)2∏i=1mΞ»inβˆ’meβˆ’Ξ»if(\lambda_1, \ldots, \lambda_m) = K_{m,n} \prod_{i<j}(\lambda_i - \lambda_j)^2 \prod_{i=1}^{m} \lambda_i^{n-m} e^{-\lambda_i}

where Km,nK_{m,n} is a normalisation constant.

The capacity is the sum of rates across min⁑(nt,nr)\min(n_t, n_r) spatial sub-channels, each with a random gain Ξ»i\lambda_i drawn from the Wishart distribution. The Vandermonde term ∏i<j(Ξ»iβˆ’Ξ»j)2\prod_{i<j}(\lambda_i - \lambda_j)^2 ensures eigenvalue repulsion --- the eigenvalues tend to spread out, which is good for spatial multiplexing.

,

Definition:

Outage Capacity

When codewords cannot span multiple fading realisations (non-ergodic regime, e.g., slow fading), the instantaneous mutual information I(H)I(\mathbf{H}) is a random variable. The Ξ΅\varepsilon-outage capacity CΞ΅C_\varepsilon is defined as

Pr⁑ ⁣[log⁑2det⁑ ⁣(I+SNRntHHH)<CΞ΅]=Ξ΅\Pr\!\left[\log_2 \det\!\left(\mathbf{I} + \frac{\text{SNR}}{n_t}\mathbf{H}\mathbf{H}^{H}\right) < C_\varepsilon\right] = \varepsilon

i.e., CΞ΅C_\varepsilon is the rate that can be supported with probability 1βˆ’Ξ΅1 - \varepsilon. Typical values are Ξ΅=0.01\varepsilon = 0.01 or Ξ΅=0.05\varepsilon = 0.05 (1% or 5% outage).

Outage capacity is the relevant metric for delay-sensitive applications (voice, real-time video) where the codeword length is comparable to or shorter than the coherence time. MIMO diversity helps by making the outage probability decay faster with SNR.

Ergodic Capacity and Capacity CDF

Plot the CDF of the instantaneous mutual information for different antenna configurations. The ergodic capacity is the mean, and the outage capacity is read from the CDF at a given outage probability.

Parameters
4
4
20
0.05

Outage probability (vertical line on CDF)

MIMO Capacity Scaling with Antenna Count

Animate how the ergodic capacity grows as the number of antennas increases from 1 to nmax⁑n_{\max} (with nt=nrn_t = n_r). Observe the approximately linear scaling Cβ‰ˆmin⁑(nt,nr)log⁑2(1+SNR/nt)C \approx \min(n_t, n_r) \log_2(1 + \text{SNR}/n_t) for i.i.d. Rayleigh channels.

Parameters
8

Maximum number of antennas (both Tx and Rx)

20

Example: Ergodic Capacity of 4Γ—44 \times 4 i.i.d. Rayleigh Channel

Compute the ergodic capacity of a 4Γ—44 \times 4 MIMO system at SNR=20\text{SNR} = 20 dB with i.i.d. Rayleigh fading and no CSIT. Compare with: (a) a SISO channel at the same SNR, (b) the deterministic capacity with a perfectly conditioned channel (Οƒi=1\sigma_i = 1 for all ii).

Common Mistake: Ergodic Capacity Is Not Achievable in Slow Fading

Mistake:

Quoting ergodic capacity as the "MIMO capacity" for a system operating in a slow-fading environment where the channel is approximately constant over the entire codeword duration.

Correction:

In slow fading, the relevant metric is outage capacity, which is always lower than ergodic capacity. For example, a 2Γ—22 \times 2 system at 20 dB SNR might have ergodic capacity of 12 bits/s/Hz but only 7 bits/s/Hz at 1% outage. The gap depends on the diversity order: more antennas (higher diversity) make the outage CDF steeper, bringing outage capacity closer to ergodic capacity.

Common Mistake: Capacity Scales Linearly with min⁑(nt,nr)\min(n_t, n_r), Not nt+nrn_t + n_r

Mistake:

Claiming that doubling the total number of antennas doubles the capacity. For example, expecting a 4Γ—14 \times 1 system (5 total antennas) to have higher capacity than a 2Γ—22 \times 2 system (4 total antennas).

Correction:

At high SNR, capacity scales as min⁑(nt,nr)log⁑2(SNR/nt)\min(n_t, n_r) \log_2(\text{SNR}/n_t). The 2Γ—22 \times 2 system has min⁑(2,2)=2\min(2,2) = 2 multiplexing streams, while the 4Γ—14 \times 1 (MISO) system has only min⁑(4,1)=1\min(4,1) = 1 stream. The MISO system gets a beamforming (array) gain of log⁑2(nt)=2\log_2(n_t) = 2 bits, but the 2Γ—22\times 2 gets a multiplexing gain of 2Γ—2\times the high-SNR slope. At 20 dB, the 2Γ—22 \times 2 vastly outperforms the 4Γ—14 \times 1 in capacity.

Quick Check

For an i.i.d. Rayleigh fading ntΓ—nrn_t \times n_r MIMO channel with no CSIT, what is the optimal transmit strategy?

Isotropic transmission: Q=(P/nt)Int\mathbf{Q} = (P/n_t)\mathbf{I}_{n_t}

Beamforming along the strongest eigenvector of E[HHH]\mathbb{E}[\mathbf{H}^{H}\mathbf{H}]

Concentrate all power on one antenna

Water-filling based on the channel statistics Rt\mathbf{R}_{t}

Ergodic capacity

The capacity of a fading channel averaged over the channel distribution, achievable when codewords span many independent fading realisations.

Related: Outage capacity, MIMO capacity

Outage capacity

The rate CΞ΅C_\varepsilon supportable with probability at least 1βˆ’Ξ΅1 - \varepsilon over a fading channel. Relevant for delay-constrained communication in slow fading.

Related: Ergodic capacity

Wishart matrix

A random matrix of the form W=HHH\mathbf{W} = \mathbf{H}\mathbf{H}^{H} where H\mathbf{H} has i.i.d. Gaussian entries. The eigenvalue distribution of Wishart matrices governs MIMO capacity statistics.

Related: Ergodic capacity