Parallel Gaussian Channels and Water-Filling

From One Channel to Many

Real wideband systems rarely face a single flat channel. An OFDM system sees KK parallel sub-channels with different gains; a MIMO system decomposes (via SVD) into parallel spatial streams. The fundamental question becomes: given a total power budget, how should we allocate power across sub-channels of unequal quality?

The answer β€” water-filling β€” is one of the most beautiful results in information theory, and it emerges naturally from the KKT conditions of a convex optimization. The same structure reappears in MIMO capacity (Chapter 13), OFDM power loading (Book telecom, Ch. 14), and even rate-distortion theory (Chapter 6) in its "reverse water-filling" form.

Definition:

Parallel Gaussian Channel

The parallel Gaussian channel consists of KK independent sub-channels:

Yk=GkXk+Zk,k=1,…,K,Y_k = G_k X_k + Z_{k}, \quad k = 1, \ldots, K,

where Gk∈CG_k \in \mathbb{C} is the (known) gain of sub-channel kk and Zk∼CN(0,N0)Z_{k} \sim \mathcal{CN}(0, N_0) are independent noise samples.

The total power is constrained: βˆ‘k=1KE[∣Xk∣2]≀Es\sum_{k=1}^K \mathbb{E}[|X_k|^2] \leq E_s, where Esk=E[∣Xk∣2]{E_s}_{k} = \mathbb{E}[|X_k|^2] is the power allocated to sub-channel kk.

Parallel Gaussian channel

A set of KK independent AWGN sub-channels with different gains G1,…,GKG_1, \ldots, G_K sharing a total power budget. Arises naturally in OFDM (frequency sub-carriers), MIMO (spatial streams after SVD), and DSL (tones).

Related: Water-filling, AWGN channel

Theorem: Capacity of the Parallel Gaussian Channel

The capacity of the parallel Gaussian channel is

C(Es)=max⁑Es1,…,EsKβ‰₯0βˆ‘kEsk≀Esβˆ‘k=1Klog⁑ ⁣(1+∣Gk∣2EskN0).C(E_s) = \max_{\substack{{E_s}_{1}, \ldots, {E_s}_{K} \geq 0 \\ \sum_k {E_s}_{k} \leq E_s}} \sum_{k=1}^K \log\!\left(1 + \frac{|G_k|^2 {E_s}_{k}}{N_0}\right).

The optimal power allocation is given by the water-filling solution:

Eskβˆ—=[Ξ½βˆ’N0∣Gk∣2]+,{E_s}_{k}^* = \left[\nu - \frac{N_0}{|G_k|^2}\right]_+,

where Ξ½\nu is chosen so that βˆ‘kEskβˆ—=Es\sum_k {E_s}_{k}^* = E_s. The resulting capacity is

C(Es)=βˆ‘k=1K[log⁑ ⁣(Ξ½β€‰βˆ£Gk∣2N0)]+.C(E_s) = \sum_{k=1}^K \left[\log\!\left(\frac{\nu\, |G_k|^2}{N_0}\right)\right]_+.

Water-filling allocates more power to stronger sub-channels and less (or zero) to weak ones. The geometric picture is vivid: invert the channel gains to form a "bowl" with depth N0/∣Gk∣2N_0/|G_k|^2, then "pour water" up to a common level Ξ½\nu. The water depth at each position is the allocated power. Deep nulls (weak sub-channels) receive no water β€” it is better to shut them off entirely.

Water-filling

The optimal power allocation for parallel Gaussian channels: Eskβˆ—=[Ξ½βˆ’N0/∣Gk∣2]+{E_s}_{k}^* = [\nu - N_0/|G_k|^2]_+. Allocates more power to stronger sub-channels, shutting off the weakest ones entirely. Named for the geometric analogy of pouring water over an uneven surface.

Related: Parallel Gaussian channel

Example: Water-Filling with Three Sub-Channels

Consider K=3K = 3 sub-channels with gains ∣G1∣2=10|G_1|^2 = 10, ∣G2∣2=2|G_2|^2 = 2, ∣G3∣2=0.1|G_3|^2 = 0.1, noise power N0=1N_0 = 1, and total power Es=1E_s = 1. Find the water-filling power allocation and the capacity.

Why Convexity Matters Here

The water-filling solution is not just "a good idea" β€” it is provably optimal because the underlying optimization is convex. This has important consequences:

  1. Uniqueness. There is exactly one optimal power allocation (up to degenerate cases where a channel is exactly at the threshold).
  2. KKT sufficiency. Any point satisfying the KKT conditions is globally optimal β€” no need to check second-order conditions or worry about local maxima.
  3. Efficient computation. The water level Ξ½\nu can be found by a simple bisection on the power constraint, or even in closed form for small KK.

This "convexity reflex" β€” recognizing when a problem is convex and exploiting it β€” is a recurring theme throughout information theory.

Water-Filling Power Allocation

Watch water pour over the inverted channel gains N0/∣Gk∣2N_0/|G_k|^2. As total power increases, the water level ν\nu rises and progressively activates weaker sub-channels. The optimal power on each sub-channel equals the water depth above its bowl level.

Water-Filling Power Allocation

Visualize the water-filling solution for parallel Gaussian channels. The "bowl" shows the inverted channel gains N0/∣Gk∣2N_0/|G_k|^2, and the water level ν\nu determines the power allocation. Adjust the total power to see how weak channels are progressively activated as power increases.

Parameters
8
5

From ISI Channels to Parallel Channels via OFDM

The parallel Gaussian channel model is not merely theoretical β€” it is the operational model for OFDM (Orthogonal Frequency Division Multiplexing), the dominant wideband modulation in 4G, 5G, Wi-Fi, and DSL.

Consider a discrete-time channel with intersymbol interference (ISI): Yi=βˆ‘β„“=0Lgβ„“Xiβˆ’β„“+ZiY_i = \sum_{\ell=0}^L g_\ell X_{i-\ell} + Z_{i}, where (g0,…,gL)(g_0, \ldots, g_L) is the finite impulse response. By prepending a cyclic prefix (CP) of length LL to each block of KK symbols, linear convolution becomes circular convolution. The resulting KΓ—KK \times K circulant matrix is diagonalized by the DFT:

G=FHdiag(G0,…,GKβˆ’1)F,\mathbf{G} = \mathbf{F}^H \text{diag}(G_0, \ldots, G_{K-1}) \mathbf{F},

where Gk=βˆ‘β„“=0Lgβ„“eβˆ’j2Ο€kβ„“/KG_k = \sum_{\ell=0}^L g_\ell e^{-j2\pi k\ell/K}. Applying IDFT at the transmitter and DFT at the receiver converts the ISI channel into KK parallel Gaussian sub-channels β€” and water-filling gives the optimal power allocation.

Definition:

Capacity of the ISI Channel via OFDM

For a discrete-time ISI channel with KK subcarriers and cyclic prefix of length LL, the capacity per channel use is

CK(Es)=KK+Lβ‹…1Kβˆ‘k=0Kβˆ’1[log⁑ ⁣(Ξ½β€‰βˆ£Gk∣2N0)]+,C_{K}(E_s) = \frac{K}{K+L} \cdot \frac{1}{K}\sum_{k=0}^{K-1} \left[\log\!\left(\frac{\nu\, |G_k|^2}{N_0}\right)\right]_+,

where the factor K/(K+L)K/(K+L) accounts for the rate loss due to the cyclic prefix overhead.

In the limit Kβ†’βˆžK \to \infty:

C(P)=βˆ«βˆ’1/21/2[log⁑ ⁣(Ξ½β€‰βˆ£G(ΞΎ)∣2N0)]+dΞΎ,C(\mathcal{P}) = \int_{-1/2}^{1/2} \left[\log\!\left(\frac{\nu\, |G(\xi)|^2}{N_0}\right)\right]_+ d\xi,

where G(ΞΎ)=βˆ‘β„“=0Lgβ„“eβˆ’j2πξℓG(\xi) = \sum_{\ell=0}^L g_\ell e^{-j2\pi\xi\ell} is the DTFT of the impulse response and Ξ½\nu satisfies βˆ«βˆ’1/21/2[Ξ½βˆ’N0/∣G(ΞΎ)∣2]+dΞΎ=P\int_{-1/2}^{1/2} [\nu - N_0/|G(\xi)|^2]_+ d\xi = \mathcal{P}.

Water-Filling Algorithm

Complexity: O(Klog⁑K)O(K \log K) due to the initial sort. The repeat loop runs at most KK iterations.
Input: Channel gains ∣G1∣2,…,∣GK∣2|G_1|^2, \ldots, |G_K|^2, noise power N0N_0, total power EsE_s
Output: Power allocation Es1βˆ—,…,EsKβˆ—{E_s}_{1}^*, \ldots, {E_s}_{K}^*
1. Sort sub-channels by effective noise level: N0/∣GΟ€(1)∣2≀⋯≀N0/∣GΟ€(K)∣2N_0/|G_{\pi(1)}|^2 \leq \cdots \leq N_0/|G_{\pi(K)}|^2
2. Set A←{1,…,K}\mathcal{A} \leftarrow \{1, \ldots, K\} (active set)
3. repeat
a. Compute water level: Ξ½=1∣A∣(Es+βˆ‘k∈AN0∣Gk∣2)\nu = \frac{1}{|\mathcal{A}|}\left(E_s + \sum_{k \in \mathcal{A}} \frac{N_0}{|G_k|^2}\right)
b. Compute power: Esk=Ξ½βˆ’N0/∣Gk∣2{E_s}_{k} = \nu - N_0/|G_k|^2 for k∈Ak \in \mathcal{A}
c. if βˆƒβ€‰k∈A\exists\, k \in \mathcal{A} with Esk<0{E_s}_{k} < 0:
Remove the worst channel from A\mathcal{A}
d. else: STOP
4. Set Esk=0{E_s}_{k} = 0 for kβˆ‰Ak \notin \mathcal{A}
5. return (Es1βˆ—,…,EsKβˆ—)({E_s}_{1}^*, \ldots, {E_s}_{K}^*)

In practice, a bisection search on Ξ½\nu is numerically simpler.

OFDM Water-Filling over a Frequency-Selective Channel

Visualize water-filling over the frequency response of a multipath channel. The inverted channel N0/∣G(ξ)∣2N_0/|G(\xi)|^2 forms the "bowl," and water is poured to level ν\nu. Deep fades receive no power. Adjust total power to see how sub-carriers are activated.

Parameters
4
64
10
⚠️Engineering Note

Cyclic Prefix Overhead in Practice

The factor K/(K+L)K/(K+L) in the OFDM capacity accounts for the rate loss due to the cyclic prefix. In 5G NR, the normal CP length is approximately L=K/14L = K/14 for the 15 kHz subcarrier spacing, giving an overhead of about 7%. For extended CP (used in high-delay-spread environments), the overhead increases to about 25%.

The choice of KK (FFT size) involves a tradeoff: larger KK reduces CP overhead but increases sensitivity to Doppler and requires longer processing blocks. 5G NR supports FFT sizes from 128 to 4096 with subcarrier spacings from 15 to 240 kHz.

Practical Constraints
  • β€’

    Normal CP overhead in 5G NR: ~7% for 15 kHz SCS

  • β€’

    Extended CP overhead: ~25%, used for high delay spread

  • β€’

    Larger FFT reduces CP overhead but increases Doppler sensitivity

πŸ“‹ Ref: 3GPP TS 38.211

Common Mistake: Equal Power Allocation Is Not Optimal

Mistake:

Distributing power equally across all sub-channels: Esk=Es/K{E_s}_{k} = E_s/K for all kk.

Correction:

Equal power allocation ignores the channel gains and wastes power on weak sub-channels. Water-filling allocates more power to stronger channels and shuts off the weakest ones. The capacity gap between equal power and water-filling grows with channel variability. However, at high SNR the gap shrinks because all channels become active and the power differences become small relative to N0/∣Gk∣2N_0/|G_k|^2.

πŸŽ“CommIT Contribution(1998)

BICM Capacity Analysis

G. Caire, G. Taricco, E. Biglieri β€” IEEE Trans. Inform. Theory, vol. 44, no. 3, pp. 927-946

Caire, Taricco, and Biglieri introduced the mutual information analysis of Bit-Interleaved Coded Modulation (BICM), showing that BICM achieves a different (generally lower) mutual information than ideal coded modulation, but with significant practical advantages in complexity and flexibility. The BICM mutual information analysis is directly related to the parallel Gaussian channel capacity framework developed in this section β€” each bit level of the modulation can be viewed as a parallel sub-channel with its own effective SNR.

This work was later extended by Guill'en i F`abregas, Mart'inez, and Caire, who showed that BICM can be viewed as mismatched decoding and derived the corresponding error exponents.

BICMcoded modulationparallel channelsView Paper β†’

Why This Matters: OFDM in 4G LTE and 5G NR

The parallel Gaussian channel and water-filling framework is the information-theoretic foundation of OFDM-based systems. In 4G LTE and 5G NR, each OFDM subcarrier is a parallel sub-channel, and the resource allocation problem (which subcarriers to use, how much power on each) is a practical instantiation of water-filling.

In practice, perfect water-filling is approximated by adaptive modulation and coding (AMC): the base station measures the channel quality on each subcarrier group (resource block) and selects the modulation order and code rate accordingly. This is "quantized water-filling" β€” a practical version of the continuous solution.

See Book telecom, Ch. 14 for the full treatment of OFDM and Book telecom, Ch. 17 for multiuser OFDMA resource allocation.

Quick Check

In a water-filling solution with K=5K = 5 sub-channels and total power Es=2E_s = 2, suppose two channels have N0/∣Gk∣2>νN_0/|G_k|^2 > \nu. How many channels are active?

5

3

2

Cannot determine without the exact gains