Coding Gain vs. Bandwidth Efficiency
The Two Currencies of the AWGN Channel
Every communication scheme over an AWGN channel spends two resources: bandwidth and power. Bandwidth limits how many independent complex dimensions per second we can use; power limits how much energy per dimension we can pour in. The point is that these two resources are not interchangeable in a continuous way — they trade off along the Shannon curve bits per dimension, and every uncoded scheme sits below this curve by an amount that we can measure in two natural ways.
Either we fix the rate (bits per channel use) and ask how much more power the uncoded scheme needs than Shannon — this gives a power gap in dB at fixed rate. Or we fix the SNR and ask how many fewer bits per dimension the uncoded scheme carries — this gives a rate gap at fixed SNR.
A good code recovers some of that gap. Historically, practitioners quote the recovered power at fixed rate, and call it the coding gain. Understanding what drives coding gain, and how it trades against bandwidth expansion, is the first step toward the design criteria we will pursue throughout this book.
Definition: Spectral Efficiency
Spectral Efficiency
Let a coded modulation scheme transmit information bits per -dimensional signal vector at rate signal vectors per second over a real bandwidth Hz. Its spectral efficiency is
or equivalently, bits per second per hertz of (complex-baseband) bandwidth whenever .
By convention in this book, we always measure in bits per 2D signal-space dimension, so that an uncoded -QAM scheme has , 8-PSK has , BPSK has bit/2D (since BPSK uses only one real dimension per two available).
The choice of 2D as the reference dimension is historical and pragmatic: most passband schemes naturally deliver two real dimensions per symbol (in-phase and quadrature), and spectral-efficiency tables in standards documents are usually written in these units.
Definition: Coding Gain
Coding Gain
Fix a target symbol- or codeword-error probability and a spectral efficiency . Let be the SNR required by a reference uncoded scheme of the same to reach , and let be the SNR required by the coded scheme to reach the same at the same . The coding gain of the coded scheme over the reference is
Two variants are in common use:
- Asymptotic coding gain. Taking , the ratio tends to times a rate penalty, giving a limiting asymptotic gain independent of the operating .
- Real (operational) coding gain. At any finite the coding gain is smaller than the asymptotic value because the coded scheme's error probability is controlled by the full distance spectrum and the number of nearest neighbors, not just .
vs. : Two Normalizations, Same Story
The ratio is the SNR per transmitted symbol; is the SNR per information bit. They are related by
where is the number of information bits per channel use. When comparing schemes of different spectral efficiency, is the meaningful axis (as in the Shannon limit curve). When comparing schemes at the same , either axis works and the horizontal shift is identical.
Uncoded AWGN BER Curves for BPSK, QPSK, and M-QAM
Bit-error probability on the AWGN channel for the standard constellations. At the same bit-error rate, different constellations require very different , and the horizontal gap between any two curves is the rate penalty of the higher-order constellation. Every dB we later recover with a code is measured against these baseline curves.
Parameters
Theorem: Uncoded BER for Gray-Labeled Square M-QAM
For a square -QAM constellation with , even, Gray labeling, and maximum-likelihood detection over AWGN, the bit-error probability satisfies
where the approximation holds at high SNR and absorbs a double-count of corner points. In particular, the exponent in the function is , which decreases as grows — this is the quantitative statement of the bandwidth-for-power tradeoff.
Each Gray-labeled symbol error typically flips a single bit, which is why the factor appears. The argument of the function is , and for square QAM . Substituting yields the displayed expression.
Compute for a square -QAM of energy — use that the constellation is a rectangular grid of spacing .
Express the symbol-error probability as a union bound over the nearest neighbors of a typical interior point.
Convert symbol error to bit error by dividing by , which is valid only under Gray labeling at high SNR.
Minimum distance of square M-QAM
Place the -QAM constellation on the grid on each axis times . Its average energy is
Solving for the minimum distance, .
Symbol-error probability via nearest-neighbor union bound
An interior constellation point has four nearest neighbors at distance . A boundary point has fewer. Under ML detection on AWGN with noise variance per real dimension, each pairwise decision is
Averaging the number of nearest neighbors over the constellation gives the factor , which accounts for the reduced neighborhood at boundary points.
Per-bit conversion under Gray labeling
With Gray labeling, at high SNR the dominant symbol error events flip exactly one bit out of . Dividing by and substituting yields
Example: The Cost of Going from 16-QAM to 64-QAM
A system currently transmits at bits/2D using 16-QAM at a target . The operator wants to go to bits/2D by switching to 64-QAM while keeping the same . How much additional will this require, approximately?
Express the argument of $\ntn{qfn}$ in each case
From the theorem above, the argument of the function is . For the same we need the argument to be approximately the same value, call it . So
Take the ratio
The factor evaluates to for and for . Their ratio is , which in decibels is dB.
Interpret
Going from 16-QAM to 64-QAM costs roughly 4.5 dB in for the same . This is the rate penalty we spend for every additional bit per dimension we demand — and it is the gap that a good code hopes to recover.
Definition: Power-Limited vs. Bandwidth-Limited Regimes
Power-Limited vs. Bandwidth-Limited Regimes
We say an operating point is power-limited if (many degrees of freedom, little power per dimension) and bandwidth-limited if (few dimensions, lots of power per dimension). The boundary bit/2D separates the two regimes.
In the power-limited regime, the Shannon limit approaches dB and additional bandwidth is essentially free, so codes with large bandwidth expansion (e.g., low-rate turbo, LDPC, and orthogonal signaling) are the right choice. In the bandwidth-limited regime, every doubling of costs roughly 3 dB in power and bandwidth is scarce; here the goal of coding is to recover power without consuming bandwidth — this is the territory of coded modulation.
The split between "binary coding over a binary input channel" and "coded modulation over a QAM channel" tracks this boundary almost exactly. Below 1 bit/2D, concatenation of a binary capacity-approaching code with BPSK is already near-optimal; above 1 bit/2D, doing the same with a QAM mapper loses a sizeable chunk of capacity unless the mapping is designed with care.
Key Takeaway
Coding gain and bandwidth expansion are two separate knobs. A code of rate expands bandwidth by and can therefore recover power in the power-limited regime. A coded modulation scheme keeps the bandwidth fixed and recovers power by exploiting the signal-space structure directly — this is the only option in the bandwidth-limited regime, where binary coding over a QAM mapper already costs too much.
Quick Check
Two uncoded systems operate at the same bit-error rate on an AWGN channel. System A is 4-QAM; system B is 64-QAM. Approximately how much more does System B require than System A?
about 2 dB
about 5 dB
about 8 dB
about 12 dB
The argument of the function in the high-SNR BER expression scales as : it equals for 4-QAM and for 64-QAM, a ratio of , i.e. about dB. In practice the gap at is approximately dB.
Common Mistake: Coding gain is measured against a matched-rate uncoded reference
Mistake:
Reporting that a code achieves "5 dB of coding gain" without specifying the reference constellation and the target error rate.
Correction:
Coding gain is always relative. A 5 dB gain over uncoded 16-QAM at is not the same thing as 5 dB of gain over uncoded QPSK. Always state (i) the baseline constellation, (ii) the target error probability, and (iii) the spectral efficiency at which the comparison is made.
Coding gain
The horizontal distance in dB between the -vs- curve of a coded scheme and that of a reference uncoded scheme of the same spectral efficiency, at a fixed target error probability.
Related: Shaping gain, Spectral Efficiency, Minimum Euclidean distance
Spectral efficiency
Number of information bits transmitted per two-dimensional signal-space symbol, or equivalently per hertz of complex baseband bandwidth. Denoted in this book.
Related: Coding Gain, Capacity Gap
Why This Matters: From the Spectral-Efficiency Plane to 5G NR MCS Tables
Modern cellular standards encode the coded-modulation tradeoff as an MCS (modulation and coding scheme) table: every MCS index picks a constellation (QPSK, 16-QAM, 64-QAM, 256-QAM) and an LDPC code rate, together implementing a specific operating point. The base station chooses the MCS by estimating the channel SNR and selecting the highest- entry whose required SNR is below the measured one. The structure of the table — a staircase of (constellation, code rate) pairs climbing through the spectral-efficiency plane — is a direct engineering reflection of the theory we develop in this chapter.