MIMO Capacity: Deterministic Channels

Capacity as the Ultimate Limit

Having established the MIMO input-output model and the SVD decomposition into parallel channels, we can now answer the fundamental question: what is the maximum achievable rate?

For a known (deterministic) channel H\mathbf{H}, the capacity is achieved by choosing the optimal input covariance matrix Q=E[xxH]\mathbf{Q} = \mathbb{E}[\mathbf{x}\mathbf{x}^H] subject to a power constraint. The solution combines the SVD decomposition from Section 15.1 with the water-filling principle from Chapter 11, yielding one of the most elegant results in information theory.

Definition:

MIMO Capacity (Deterministic Channel)

For a deterministic MIMO channel y=Hx+n\mathbf{y} = \mathbf{H}\mathbf{x} + \mathbf{n} with n∼CN(0,Οƒ2I)\mathbf{n} \sim \mathcal{CN}(\mathbf{0}, \sigma^2 \mathbf{I}) and power constraint tr(Q)≀P\mathrm{tr}(\mathbf{Q}) \leq P, the MIMO capacity is

C=max⁑Qβͺ°0,β€…β€Štr(Q)≀Plog⁑2det⁑ ⁣(Inr+1Οƒ2HQHH)bits/s/HzC = \max_{\mathbf{Q} \succeq 0,\; \mathrm{tr}(\mathbf{Q}) \leq P} \log_2 \det\!\left(\mathbf{I}_{n_r} + \frac{1}{\sigma^2} \mathbf{H}\mathbf{Q}\mathbf{H}^{H}\right) \quad \text{bits/s/Hz}

where Qβͺ°0\mathbf{Q} \succeq 0 means Q\mathbf{Q} is positive semidefinite (valid covariance matrix).

This is the mutual information I(x;y)I(\mathbf{x}; \mathbf{y}) maximised over Gaussian inputs. Gaussian inputs are optimal because the noise is Gaussian (maximum entropy property).

,

Definition:

Parallel Channel Decomposition

Using the SVD H=UΞ£VH\mathbf{H} = \mathbf{U}\boldsymbol{\Sigma}\mathbf{V}^H and the transformation from Theorem 15.1.1, the MIMO channel decomposes into r=rank(H)r = \mathrm{rank}(\mathbf{H}) parallel scalar channels:

y~i=Οƒix~i+n~i,i=1,…,r\tilde{y}_i = \sigma_i \tilde{x}_i + \tilde{n}_i, \qquad i = 1, \ldots, r

Each sub-channel has gain Οƒi2\sigma_i^2 (the ii-th eigenvalue of HHH\mathbf{H}^{H}\mathbf{H}). The MIMO capacity problem reduces to optimally allocating total power PP across these rr parallel Gaussian channels, which is solved by water-filling.

Theorem: MIMO Capacity via SVD and Water-Filling

The capacity of the deterministic MIMO channel y=Hx+n\mathbf{y} = \mathbf{H}\mathbf{x} + \mathbf{n} with H=UΞ£VH\mathbf{H} = \mathbf{U}\boldsymbol{\Sigma}\mathbf{V}^H and power constraint PP is

C=βˆ‘i=1rlog⁑2 ⁣(1+piβˆ—Οƒi2Οƒ2)bits/s/HzC = \sum_{i=1}^{r} \log_2\!\left(1 + \frac{p_i^* \sigma_i^2}{\sigma^2}\right) \quad \text{bits/s/Hz}

where the optimal power allocation {piβˆ—}\{p_i^*\} is given by water-filling:

piβˆ—=(ΞΌβˆ’Οƒ2Οƒi2)+p_i^* = \left(\mu - \frac{\sigma^2}{\sigma_i^2}\right)^+

and ΞΌ\mu is chosen so that βˆ‘i=1rpiβˆ—=P\sum_{i=1}^{r} p_i^* = P. Here (x)+=max⁑(x,0)(x)^+ = \max(x, 0).

The optimal input covariance matrix is Qβˆ—=V diag(p1βˆ—,…,prβˆ—,0,…) VH\mathbf{Q}^* = \mathbf{V}\, \mathrm{diag}(p_1^*, \ldots, p_r^*, 0, \ldots)\, \mathbf{V}^H.

Water-filling allocates more power to stronger sub-channels (large Οƒi2\sigma_i^2) and less to weaker ones, even shutting off the weakest sub-channels entirely. At high SNR, the water level ΞΌ\mu is high relative to all Οƒ2/Οƒi2\sigma^2/\sigma_i^2, so power is approximately equal across sub-channels. At low SNR, it is better to concentrate power on the strongest sub-channel (beamforming).

,

SVD Decomposition into Parallel Sub-Channels

SVD Decomposition into Parallel Sub-Channels
The SVD transforms the MIMO channel H\mathbf{H} into r=rank(H)r = \mathrm{rank}(\mathbf{H}) parallel scalar sub-channels. Precoding with V\mathbf{V} at the transmitter and combining with UH\mathbf{U}^H at the receiver diagonalises the channel. Water-filling allocates power piβˆ—p_i^* to each sub-channel based on its gain Οƒi\sigma_i.

SVD Decomposition of a MIMO Channel

Watch how the SVD decomposes a 4Γ—44 \times 4 MIMO channel into parallel sub-channels, with water-filling power allocation adapting to the singular value spread.
The animation shows the singular values of a random channel realisation, the resulting parallel sub-channels, and how water-filling allocates more power to stronger modes.

SVD Parallel Channels and Water-Filling

Visualise the SVD decomposition of a MIMO channel into parallel sub-channels and the water-filling power allocation. Adjust the channel singular values and total SNR to see how power is distributed.

Parameters
4
4
15

Type of channel realisation to generate

Example: Water-Filling Power Allocation

A 3Γ—33 \times 3 MIMO channel has singular values Οƒ1=2.0\sigma_1 = 2.0, Οƒ2=1.0\sigma_2 = 1.0, Οƒ3=0.3\sigma_3 = 0.3. The noise variance is Οƒ2=1\sigma^2 = 1 and total power is P=10P = 10.

(a) Find the water-filling power allocation.

(b) Compute the capacity.

(c) Compare with equal power allocation.

Common Mistake: Equal Power Allocation Is Near-Optimal at High SNR

Mistake:

Spending significant effort on water-filling at high SNR, or conversely, using equal power allocation at low SNR where it is highly suboptimal.

Correction:

At high SNR, the water level ΞΌ\mu is much larger than all Οƒ2/Οƒi2\sigma^2/\sigma_i^2, so piβˆ—β‰ˆP/rp_i^* \approx P/r for all active sub-channels. Equal power allocation loses very little.

At low SNR, water-filling concentrates all power on the strongest sub-channel (beamforming), which can provide several dB of gain over equal allocation. The transition occurs around SNRβ‰ˆΟƒ12/Οƒr2\text{SNR} \approx \sigma_1^2/\sigma_r^2 (ratio of strongest to weakest eigenvalue).

Quick Check

A 2Γ—22 \times 2 MIMO channel has singular values Οƒ1=3\sigma_1 = 3 and Οƒ2=0\sigma_2 = 0. What is the capacity at SNR =P/Οƒ2=20= P/\sigma^2 = 20 dB with optimal power allocation?

C=log⁑2(1+100Γ—9)=log⁑2(901)β‰ˆ9.82C = \log_2(1 + 100 \times 9) = \log_2(901) \approx 9.82 bits/s/Hz

C=2log⁑2(1+50Γ—9)β‰ˆ17.6C = 2 \log_2(1 + 50 \times 9) \approx 17.6 bits/s/Hz

C=log⁑2(1+100Γ—3)=log⁑2(301)β‰ˆ8.23C = \log_2(1 + 100 \times 3) = \log_2(301) \approx 8.23 bits/s/Hz

C=0C = 0 bits/s/Hz because one singular value is zero

Water-filling

The optimal power allocation strategy for parallel Gaussian channels that allocates more power to stronger sub-channels and less (or none) to weaker ones: piβˆ—=(ΞΌβˆ’Οƒ2/Οƒi2)+p_i^* = (\mu - \sigma^2/\sigma_i^2)^+.

Related: MIMO capacity, Spatial multiplexing

MIMO capacity

The maximum mutual information achievable over a MIMO channel, optimised over the input covariance matrix: C=max⁑Qlog⁑2det⁑(I+1Οƒ2HQHH)C = \max_{\mathbf{Q}} \log_2\det(\mathbf{I} + \frac{1}{\sigma^2}\mathbf{H}\mathbf{Q}\mathbf{H}^{H}).

Related: Water-filling, Channel rank