BICM on Fading Channels: The Diversity Theorem
From AWGN to Fading: What Changes
Everything in s01–s02 was AWGN: a deterministic channel where the PEP decays as with a constant , and where "bigger " is the only design knob. On a fading channel, the physics is different: the received signal is with random amplitude , and when is small the entire transmission is lost. The PEP is then an EXPECTATION over the fading realisation, and its high-SNR behaviour is not exponential but power-law: . The exponent is the diversity order — the number of independent fading samples that must all be bad for a decoding error to occur.
The central result of this section — and arguably the central result of the entire Caire–Taricco–Biglieri 1998 paper — is an exact formula for the diversity order of BICM: This formula replaces as the primary design criterion on fading. It tells the designer: pick a binary code with large and a labelling with large . The "diversity product" is what you want to maximise, not the AWGN exponent.
The point is that on fading, capacity alone does not tell the full story. Even if BICM's capacity is close to CM capacity, the operating SNR at a target BER depends on the DIVERSITY order achievable. The formula above is the design criterion, and it factors cleanly into "binary code" and "labelling" contributions — which is exactly what makes BICM modular.
Definition: Fully-Interleaved Rayleigh Fading Channel
Fully-Interleaved Rayleigh Fading Channel
The fully-interleaved Rayleigh fading channel has input-output relation at symbol time where is the transmitted constellation symbol, is the channel gain, and is complex Gaussian noise. The fading amplitudes satisfy (unit-mean exponential, the squared magnitude of a unit-variance random variable), and the sequence is i.i.d. — i.e., the interleaver is long enough to fully de-correlate consecutive symbol fadings. The receiver has perfect channel state information (CSI) .
"Fully interleaved" is an idealisation: it assumes the interleaver length is infinite relative to the coherence time . Section 5 quantifies what happens when is finite.
Definition: Diversity Order
Diversity Order
The diversity order of a scheme over a fading channel is provided the limit exists. Operationally, is the high-SNR slope of the log–log BER curve. A BER versus plot in dB–dB coordinates becomes a straight line at high SNR with slope . The coding gain is the horizontal (SNR) offset of that line relative to a reference curve with the same slope.
On AWGN the diversity order is infinite (the BER decays exponentially, not polynomially) — is a fading-specific concept. For an uncoded system with a single antenna, (one Rayleigh fade can wipe out the symbol). To exceed one must spread each information bit across multiple independent fading realisations — which is exactly what a binary code with bit-disagreements does when bits are fully interleaved.
Definition: Minimum Number of Distinct Bit Positions
Minimum Number of Distinct Bit Positions
For a labelling , define and the minimum number of distinct bit positions Informally, counts the smallest number of distinct constellation positions that differ in a single label bit across label pairs, minimised over bit positions and bit-value flips.
For most "reasonable" labellings on symmetric QAM/PSK constellations, — a single bit flip often corresponds to a single nearest-neighbour move. Both Gray and Set-Partition labellings on square QAM achieve . A labelling with would need every single bit flip to move across at least two distinct constellation positions simultaneously, a rare property.
Theorem: BICM Diversity Order on Fully-Interleaved Rayleigh Fading
Consider BICM with labelling and binary code of minimum Hamming distance on a fully-interleaved Rayleigh fading channel with receiver CSI. For any two codewords at Hamming distance , the PEP is bounded at high SNR by up to constants, where and are fading-specific averaging quantities. Consequently the diversity order of the BICM codeword error probability is For Gray labelling on square QAM, , so : the binary code's minimum Hamming distance converts directly into diversity order.
Here is the mental picture. The fully-interleaved channel turns the single fading channel into independent parallel sub-channels — one per differing coded bit. Each differing bit position is mapped to a label position with distinct constellation choices; in the Gray case, means each such bit goes through a single fading realisation. The diversity order is the number of independent fadings the error event must jointly survive, which is .
Compare this with the CM (no-BICM) case: on fading, TCM with SP labelling achieves diversity where is the trellis free distance in symbols. BICM with Gray achieves exactly — it trades the SP geometric structure for a simpler, more interleaver-friendly one.
Start from the conditional AWGN PEP (Thm. 1 of s01) applied to a single fading realisation, with effective SNR .
Average over the Rayleigh fading: for exponential with unit mean.
Under the fully-interleaved assumption, the differing coded bits see INDEPENDENT realisations; the joint PEP becomes a product of such factors.
Within each bit channel , the expectation further factorises over label-pair alternatives. Apply Jensen to get a lower bound on the exponent involving .
The high-SNR behaviour of is , so the diversity exponent is the total count of factors: .
Step 1: Conditional PEP given fading
Condition on the fading realisation . By Thm. 1 of s01 applied to the equivalent AWGN channel with per-symbol SNR , the conditional PEP for at Hamming distance is bounded by where is the conditional squared distance for the label pair visited by the interleaver at time .
Step 2: Averaging over Rayleigh fading
Take expectation over the i.i.d. unit-mean exponential . Using the MGF for , each factor contributes At high SNR (small ) this is approximately , i.e., decays as per bit position.
Step 3: Subset averaging inside a bit position
For a given bit position and a given bit flip , the "distance" depends on which specific label pair was visited. A second averaging over these possibilities gives At high SNR, the TOP factors of this product decay as each, contributing per bit position. Positions with fewer distinct alternatives (i.e., ) do not exist by definition of .
Step 4: Combine over $d$ bit disagreements
There are bit positions at which and disagree, and by the fully-interleaved assumption each sees INDEPENDENT fading and INDEPENDENT label-pair averaging. Multiplying: where is a labelling-dependent constant (the coding gain).
Step 5: Minimise over $d$ for the codeword error probability
The codeword error probability is dominated at high SNR by the pairs at the minimum Hamming distance . Therefore which is the stated formula.
BICM Diversity Order on Fading Channels
Sections V and VI of the Caire–Taricco–Biglieri 1998 paper establish the diversity formula for BICM on fully-interleaved Rayleigh fading. Here is the minimum Hamming distance of the binary code and is the minimum, over bit positions , of the minimum number of DISTINCT constellation positions whose labels differ in bit between opposite bit values .
For square QAM with Gray labelling, , and the formula collapses to : the binary code's Hamming distance IS the diversity order, directly. This is the clean operational conclusion that allows modern wireless standards to use a single LDPC or convolutional code across QAM orders without redesigning the fading performance: the code contributes , the Gray mapper contributes , and the product is the fading slope.
The theorem tells the code designer: on fully-interleaved fading, BICM with Gray labelling has exactly the same diversity as a binary code running over a parallel-channel model. No geometric labelling trickery is needed. This clarity is what made the BICM framework — rather than multilevel coding or trellis-coded modulation — the de facto standard for wireless communications after 2000. The result is among the most operationally important theorems in the theory of coded modulation, and the paper that introduced it remains one of the most-cited papers in IEEE Trans. Information Theory.
Key Takeaway
The BICM diversity order is . On a fully-interleaved Rayleigh fading channel, the high-SNR slope of the BICM BER curve is determined by the product of the binary code's minimum Hamming distance and the labelling's minimum-distinct-bit-position count. For Gray labelling on square QAM, and the slope equals . Design rule on fading: pick a code with large . The labelling choice affects only the coding gain, not the diversity slope.
BICM BER on Rayleigh Fading: Diversity Slopes
Average BER of BICM on fully-interleaved Rayleigh fading, computed from the MGF-based PEP bound, for a family of binary codes with minimum Hamming distances under Gray labelling (). Each curve is asymptotically a straight line of slope on the log–log plot — this is the diversity theorem made visible. Compare the curves to see that doubling doubles the diversity slope; increasing ( size) hurts the coding gain but does NOT change the slope.
Parameters
Gray vs Set-Partition on Rayleigh: Diversity Slope Is the Same
At fixed binary code , plot the BICM BER on fully-interleaved Rayleigh fading for Gray and Set-Partition labellings on 16-QAM. Both curves have slope at high SNR (since for 16-QAM), but Gray is a constant number of dB better because its coding gain is higher. This is a different story from AWGN (where Gray won on slope — no, slope is infinite on AWGN — wins on exponent), and different from BICM-ID (Ch. 8) where SP wins at low BER once iterative feedback is added.
Parameters
Example: Computing for Gray 16-QAM
Show directly from the definition that for square Gray-labelled 16-QAM, and compute for a rate-, constraint-length-5 convolutional code with free distance .
Gray 16-QAM labelling structure
Arrange the 16 constellation points on a grid with the standard Gray-coded PAM labelling on each axis. Bit 0 (I-axis LSB) flips between "left outer" and "left inner" — specifically, the constellation points and have bit 0 opposite, for every .
Find a bit position with a single-distinct-pair flip
Look at bit 0 of the I-axis label. For we have the set ; for , the set . For the specific points — same , adjacent I-axis bits differing in bit 0 — we have only ONE pair at the nearest-neighbour distance . Therefore for this specific bit-flip direction involving nearest neighbours.
Verify for other bit positions
By the symmetry of the Gray-PAM labelling, the same single-pair condition holds at bits 1, 2, 3 (with appropriate relabelling for the Q-axis). Hence .
Diversity order of the convolutional BICM
By Thm. 3, on fully-interleaved Rayleigh the convolutional-code + Gray-16-QAM BICM has diversity order This is the same diversity order as the binary code running over seven independent Rayleigh fades — the mapper contributes no additional geometric diversity, only coding gain.
Example: Diversity of a Rate-1/2, K=5 Convolutional Code with 64-QAM BICM
The industry-standard rate-, constraint-length- convolutional code (generators , NASA/CCSDS, widely used in HSPA and satellite) has free distance . It is paired with 64-QAM and Gray labelling in a BICM system. Compute the BICM diversity order on fully-interleaved Rayleigh fading, and estimate the SNR needed for .
Diversity order
For Gray-labelled 64-QAM, (same argument as the 16-QAM example: each bit position has nearest-neighbour pairs at the I- or Q-minimum-distance). Hence
High-SNR BER estimate
The dominant-pair approximation gives where is the input-weight-at-free-distance coefficient (for this code, , from the standard transfer function) and (rate , binary input). Setting , : In practice the hidden constants and 64-QAM spectral-efficiency penalty push the required to about dB at BER — consistent with LTE 64-QAM field measurements in mobility.
Comparison: what would diversity $d = 1$ cost?
An uncoded 64-QAM would have diversity , so . For this requires — roughly dB worse than the coded system at the same spectral efficiency. The convolutional code's converts to a dB diversity-gain advantage. This is the entire value proposition of BICM on fading.
Common Mistake: The Slope Only Shows at High SNR
Mistake:
A common trap in reading simulation curves is to look at BER and try to read off the slope — then conclude "the code has diversity order 3" because that matches the slope locally.
Correction:
Thm. 3 predicts the ASYMPTOTIC slope: as . At BERs around the curve is still in the "waterfall" region where many error patterns contribute non-negligibly and the union bound is loose. The asymptotic slope becomes visible only in the "error floor" region, typically below for practical codes. Simulation curves plotted in log–dB coordinates should be checked at several SNR values to confirm the slope has settled.
Why This Matters: From Convolutional Codes to Modern LDPC: Diversity Still Rules
The diversity theorem in this section uses minimum Hamming distance — which was the right invariant for convolutional codes in the 1990s, where or was the design target. Modern wireless standards use LDPC codes (LTE, 5G NR, Wi-Fi) and polar codes (5G NR control channels), where "minimum distance" is replaced by the smaller-weight codewords in the weight enumerator . The same formula still applies operationally: the asymptotic diversity slope is governed by the smallest with , which for a well-designed LDPC code is typically –. The product stays the right invariant, and the fact that for Gray QAM is why LDPC-BICM achieves essentially the LDPC's own diversity regardless of QAM order. This modularity — "plug any good binary code into the Gray-QAM BICM frontend" — is the engineering payoff of the diversity theorem.
Quick Check
For Gray-labelled 64-QAM BICM with a rate-, constraint-length- convolutional code () on fully-interleaved Rayleigh fading, what is the diversity order ?
By Thm. 3, on Gray-labelled 64-QAM (the fact that for any square QAM is the reason QAM order doesn't matter for diversity).
Diversity Order
The high-SNR slope of a fading-channel error-probability curve: . For BICM with labelling and binary code of minimum Hamming distance on fully-interleaved Rayleigh fading, .
Related: Coding Gain, BICM Codeword Pair and Hamming Distance, Rayleigh Fading
Coding Gain
At fixed diversity order , the horizontal (SNR-axis) offset between the BER curve of a coded scheme and an uncoded reference with the same diversity , both plotted in dB–dB coordinates. Measured in dB. For BICM, coding gain depends on labelling through and through the multiplicities of the binary code's weight spectrum.
Related: Diversity Order, Average Squared Intra-Subset Euclidean Distance , Weight Enumerator and Input-Weight Multiplicities