Parallel Gaussian Channels and Water-Filling
From One Channel to Many
Real wideband systems rarely face a single flat channel. An OFDM system sees parallel sub-channels with different gains; a MIMO system decomposes (via SVD) into parallel spatial streams. The fundamental question becomes: given a total power budget, how should we allocate power across sub-channels of unequal quality?
The answer β water-filling β is one of the most beautiful results in information theory, and it emerges naturally from the KKT conditions of a convex optimization. The same structure reappears in MIMO capacity (Chapter 13), OFDM power loading (Book telecom, Ch. 14), and even rate-distortion theory (Chapter 6) in its "reverse water-filling" form.
Definition: Parallel Gaussian Channel
Parallel Gaussian Channel
The parallel Gaussian channel consists of independent sub-channels:
where is the (known) gain of sub-channel and are independent noise samples.
The total power is constrained: , where is the power allocated to sub-channel .
Parallel Gaussian channel
A set of independent AWGN sub-channels with different gains sharing a total power budget. Arises naturally in OFDM (frequency sub-carriers), MIMO (spatial streams after SVD), and DSL (tones).
Related: Water-filling, AWGN channel
Theorem: Capacity of the Parallel Gaussian Channel
The capacity of the parallel Gaussian channel is
The optimal power allocation is given by the water-filling solution:
where is chosen so that . The resulting capacity is
Water-filling allocates more power to stronger sub-channels and less (or zero) to weak ones. The geometric picture is vivid: invert the channel gains to form a "bowl" with depth , then "pour water" up to a common level . The water depth at each position is the allocated power. Deep nulls (weak sub-channels) receive no water β it is better to shut them off entirely.
Independence gives additive capacity
Since the sub-channels are independent, the mutual information decomposes:
where the last equality uses the fact that the capacity-achieving input for each sub-channel is Gaussian: .
Formulate as convex optimization
We maximize a sum of concave functions (each is concave in ) over a convex constraint set (the simplex , ).
This is a convex problem, so the KKT conditions are necessary and sufficient. The point is that convexity guarantees a unique global optimum β the water-filling solution is not a heuristic but provably optimal.
KKT conditions yield water-filling
The Lagrangian is
Setting :
Complementary slackness: , . If , then and
Setting gives .
Water-filling
The optimal power allocation for parallel Gaussian channels: . Allocates more power to stronger sub-channels, shutting off the weakest ones entirely. Named for the geometric analogy of pouring water over an uneven surface.
Related: Parallel Gaussian channel
Example: Water-Filling with Three Sub-Channels
Consider sub-channels with gains , , , noise power , and total power . Find the water-filling power allocation and the capacity.
Compute the inverse gains
, , .
Try all three channels active
If all channels are active: , so . But then . Contradiction β channel 3 cannot be active.
Try two channels active
With channels 1 and 2 only: , so .
, , . Check: .
Compute capacity
|G_3|^2 = 0.1$) is shut off entirely β the power is better spent strengthening the already-good channels.
Why Convexity Matters Here
The water-filling solution is not just "a good idea" β it is provably optimal because the underlying optimization is convex. This has important consequences:
- Uniqueness. There is exactly one optimal power allocation (up to degenerate cases where a channel is exactly at the threshold).
- KKT sufficiency. Any point satisfying the KKT conditions is globally optimal β no need to check second-order conditions or worry about local maxima.
- Efficient computation. The water level can be found by a simple bisection on the power constraint, or even in closed form for small .
This "convexity reflex" β recognizing when a problem is convex and exploiting it β is a recurring theme throughout information theory.
Water-Filling Power Allocation
Water-Filling Power Allocation
Visualize the water-filling solution for parallel Gaussian channels. The "bowl" shows the inverted channel gains , and the water level determines the power allocation. Adjust the total power to see how weak channels are progressively activated as power increases.
Parameters
From ISI Channels to Parallel Channels via OFDM
The parallel Gaussian channel model is not merely theoretical β it is the operational model for OFDM (Orthogonal Frequency Division Multiplexing), the dominant wideband modulation in 4G, 5G, Wi-Fi, and DSL.
Consider a discrete-time channel with intersymbol interference (ISI): , where is the finite impulse response. By prepending a cyclic prefix (CP) of length to each block of symbols, linear convolution becomes circular convolution. The resulting circulant matrix is diagonalized by the DFT:
where . Applying IDFT at the transmitter and DFT at the receiver converts the ISI channel into parallel Gaussian sub-channels β and water-filling gives the optimal power allocation.
Definition: Capacity of the ISI Channel via OFDM
Capacity of the ISI Channel via OFDM
For a discrete-time ISI channel with subcarriers and cyclic prefix of length , the capacity per channel use is
where the factor accounts for the rate loss due to the cyclic prefix overhead.
In the limit :
where is the DTFT of the impulse response and satisfies .
Water-Filling Algorithm
Complexity: due to the initial sort. The repeat loop runs at most iterations.In practice, a bisection search on is numerically simpler.
OFDM Water-Filling over a Frequency-Selective Channel
Visualize water-filling over the frequency response of a multipath channel. The inverted channel forms the "bowl," and water is poured to level . Deep fades receive no power. Adjust total power to see how sub-carriers are activated.
Parameters
Cyclic Prefix Overhead in Practice
The factor in the OFDM capacity accounts for the rate loss due to the cyclic prefix. In 5G NR, the normal CP length is approximately for the 15 kHz subcarrier spacing, giving an overhead of about 7%. For extended CP (used in high-delay-spread environments), the overhead increases to about 25%.
The choice of (FFT size) involves a tradeoff: larger reduces CP overhead but increases sensitivity to Doppler and requires longer processing blocks. 5G NR supports FFT sizes from 128 to 4096 with subcarrier spacings from 15 to 240 kHz.
- β’
Normal CP overhead in 5G NR: ~7% for 15 kHz SCS
- β’
Extended CP overhead: ~25%, used for high delay spread
- β’
Larger FFT reduces CP overhead but increases Doppler sensitivity
Common Mistake: Equal Power Allocation Is Not Optimal
Mistake:
Distributing power equally across all sub-channels: for all .
Correction:
Equal power allocation ignores the channel gains and wastes power on weak sub-channels. Water-filling allocates more power to stronger channels and shuts off the weakest ones. The capacity gap between equal power and water-filling grows with channel variability. However, at high SNR the gap shrinks because all channels become active and the power differences become small relative to .
BICM Capacity Analysis
Caire, Taricco, and Biglieri introduced the mutual information analysis of Bit-Interleaved Coded Modulation (BICM), showing that BICM achieves a different (generally lower) mutual information than ideal coded modulation, but with significant practical advantages in complexity and flexibility. The BICM mutual information analysis is directly related to the parallel Gaussian channel capacity framework developed in this section β each bit level of the modulation can be viewed as a parallel sub-channel with its own effective SNR.
This work was later extended by Guill'en i F`abregas, Mart'inez, and Caire, who showed that BICM can be viewed as mismatched decoding and derived the corresponding error exponents.
Why This Matters: OFDM in 4G LTE and 5G NR
The parallel Gaussian channel and water-filling framework is the information-theoretic foundation of OFDM-based systems. In 4G LTE and 5G NR, each OFDM subcarrier is a parallel sub-channel, and the resource allocation problem (which subcarriers to use, how much power on each) is a practical instantiation of water-filling.
In practice, perfect water-filling is approximated by adaptive modulation and coding (AMC): the base station measures the channel quality on each subcarrier group (resource block) and selects the modulation order and code rate accordingly. This is "quantized water-filling" β a practical version of the continuous solution.
See Book telecom, Ch. 14 for the full treatment of OFDM and Book telecom, Ch. 17 for multiuser OFDMA resource allocation.
Quick Check
In a water-filling solution with sub-channels and total power , suppose two channels have . How many channels are active?
5
3
2
Cannot determine without the exact gains
A channel is active if and only if . Since two channels have inverse gains above the water level, they receive zero power. The remaining channels are active.