Ergodic Capacity with CSIR Only
What Can We Achieve Without Telling the Transmitter?
In many wireless systems, the receiver can estimate the channel through pilot symbols, but feeding this information back to the transmitter is costly or infeasible (especially in FDD systems or high-mobility scenarios). This raises a natural question: if only the receiver knows the channel, what is the best rate we can reliably achieve?
The remarkable answer is that for ergodic fading, the capacity with CSIR only is achieved by a simple strategy: transmit at constant power regardless of the channel state and let the decoder, which knows , handle the varying SNR. No transmitter adaptation is needed for the ergodic rate.
Theorem: Ergodic Capacity with CSIR Only
Consider the scalar fading channel with , average power constraint , and ergodic fading with . If the receiver knows (CSIR) but the transmitter does not, the capacity is
where . The capacity is achieved by i.i.d. Gaussian inputs (constant power, independent of ).
The decoder knows , so for each fading realization , the channel looks like an AWGN channel with SNR . By coding across many fading states, the effective rate averages to . The transmitter does not need to know because it uses constant power β the law of large numbers does the averaging.
Think of the fading channel as a collection of parallel AWGN sub-channels, one for each fading state.
For a given , the mutual information is . Average over .
Show that constant-power Gaussian input maximizes for each state, so no power adaptation helps.
Conditional mutual information
For a given fading realization , the channel is with . This is an AWGN channel with gain . From Chapter 10, the mutual information for input distribution with is maximized by , yielding
Notice that the maximizing input distribution does not depend on . This is the key observation.
Averaging over fading states
Since the fading is ergodic, the coding theorem for compound channels tells us that the capacity is
Since maximizes for every simultaneously, it also maximizes the expectation. Therefore
Achievability
Achievability follows from random coding over a block of length that spans many independent fading realizations. Generate a codebook of codewords, each i.i.d. . The decoder, knowing , uses joint typicality decoding.
For each fading state , the channel behaves as , and the decoder knows . By the ergodic theorem, almost surely.
Standard random coding analysis shows that any rate below this limit is achievable with vanishing error probability.
Theorem: Fading Reduces Ergodic Capacity (Jensen's Inequality)
For the ergodic fading channel with CSIR only,
Equality holds if and only if is deterministic (no fading). The gap measures the capacity penalty due to fading.
The function is strictly concave. By Jensen's inequality, the average of a concave function of a random variable is less than or equal to the concave function of the average. Intuitively, the capacity loss from deep fades is not compensated by the gain from strong fades β the saturates, so strong states help less than weak states hurt.
Apply Jensen's inequality
The function is strictly concave in (since ). Applying Jensen's inequality to :
where the last equality uses .
Equality condition
Equality in Jensen's inequality holds if and only if is a constant a.s., which requires a.s. (no fading). For any non-degenerate fading distribution, the inequality is strict.
The Fading Penalty Is Often Modest
While fading always hurts compared to AWGN (at the same average SNR), the loss is often surprisingly small for ergodic fading:
- For Rayleigh fading at dB, the ergodic capacity is about 90% of the AWGN capacity.
- At low SNR, the penalty is negligible because is nearly linear, so Jensen's inequality is nearly tight.
- At high SNR, the penalty approaches a constant gap (in bits) rather than growing with .
The real impact of fading manifests in the outage regime (Section 13.4), where the occasional deep fade causes catastrophic failure for a single codeword.
Example: Ergodic Capacity of Rayleigh Fading Channel
Compute the ergodic capacity of a Rayleigh fading channel () with CSIR only at dB. Compare with the AWGN capacity.
Set up the integral
For Rayleigh fading, with PDF for . The ergodic capacity is
Evaluate numerically
At (i.e., 10 dB), numerical integration gives
The AWGN capacity at the same SNR is
Compare
The fading penalty is bits, or about 8.7% of the AWGN capacity. This confirms that the ergodic capacity loss from Rayleigh fading is modest at moderate SNR.
In closed form, the Rayleigh ergodic capacity can be written using the exponential integral: , where .
Ergodic Capacity: Fading vs. AWGN
Compare the ergodic capacity of a Rayleigh fading channel (CSIR only) with the AWGN channel capacity as a function of . Observe that fading always reduces capacity (Jensen's inequality) but the gap remains modest, especially at low and moderate SNR.
Parameters
Upper limit of the SNR range in decibels
Distribution of the fading gain $|H|^2$
Low-SNR and High-SNR Behavior
The ergodic capacity has clean asymptotic forms:
Low SNR (): Since for small ,
Fading causes no capacity loss at low SNR. Intuitively, the log is nearly linear, so Jensen's inequality is nearly tight.
High SNR (): We have , so
The gap to AWGN capacity is , which is a constant independent of . For Rayleigh fading, bits, where is the Euler-Mascheroni constant. The high-SNR gap is thus about 0.42 bits.
Common Mistake: Constant Power Is Optimal Only for Ergodic Capacity
Mistake:
Concluding from the CSIR ergodic capacity result that constant-power transmission is always optimal, regardless of the performance metric.
Correction:
Constant power is optimal for ergodic capacity because the mutual information is maximized by for every simultaneously. But for other metrics β such as minimizing outage probability, maximizing delay-limited capacity, or optimizing throughput with hybrid ARQ β power adaptation with CSIT can provide significant gains. The next section shows that water-filling over fading states increases ergodic capacity when CSIT is available.
Quick Check
For the ergodic fading channel with CSIR only, why does constant-power transmission achieve capacity?
Because the transmitter has no information to adapt to
Because maximizes for every , so no state-dependent power control can help
Because Jensen's inequality forces equal power allocation
Because the channel is memoryless
The optimal input for each AWGN sub-channel (with gain ) is Gaussian with power , regardless of . Since the same input distribution is optimal for every fading state, there is nothing to be gained by adapting the power to .
Channel Estimation Overhead Reduces Effective Rate
The ergodic capacity formula assumes perfect CSIR, but in practice the receiver must estimate the channel from pilot symbols. In an OFDM system with time slots per coherence interval and frequency bins per coherence bandwidth, a block of resource elements is available. Of these, must be used for pilots, leaving for data.
The effective throughput is approximately
where accounts for the estimation error. For rapidly varying channels (small ), the overhead can consume a significant fraction of the resources.
- β’
Pilot density must satisfy to estimate all transmit dimensions
- β’
Channel estimation error adds an irreducible noise floor proportional to
- β’
In 5G NR, DMRS patterns allocate 1-4 OFDM symbols per slot for channel estimation
Ergodic capacity
The maximum achievable rate for reliable communication over a fading channel when the codeword spans many independent fading realizations. Given by for the scalar fading channel with CSIR.
Related: CSIR, Rayleigh fading
Key Takeaway
The ergodic capacity with CSIR only is , achieved by constant-power Gaussian transmission. Fading always reduces capacity compared to AWGN (by Jensen's inequality), but the loss is modest for ergodic channels. The transmitter does not need to know the channel to achieve the ergodic rate β CSIR alone suffices.