Union-Bound Analysis of Coded BICM
From a Single PEP to the Bit Error Probability
Theorems 1 and 3 bounded the pairwise error probability for two BICM codewords at Hamming distance . A real receiver, however, faces ALL competing codewords at once. The passage from "one PEP" to "codeword error probability" is the union bound, and from "codeword error probability" to "bit error probability" is a weight-enumerator weighted sum. In this section we assemble those two bounds into the standard BICM union-bound BER formula, discuss where it is tight (error floor) and where it is loose (waterfall), and flag tighter alternatives (Poltyrev, sphere packing) that will be developed in Ch. 7.
Definition: Weight Enumerator and Input-Weight Multiplicities
Weight Enumerator and Input-Weight Multiplicities
For a binary linear code of block length and dimension , the weight enumerator is For a convolutional code, the relevant quantity for BER analysis is the input-weight enumerator where is the number of information bits in error per weight- error event. We write for β the total input-weight contribution of weight- error events.
For a rate-, constraint-length- convolutional code (generators ), the transfer-function analysis of Viterbi (1971) gives and . For LDPC codes, can be estimated by saddlepoint methods or density evolution β see [?mackay-2003].
Theorem: Union-Bound Bit Error Probability for BICM
Consider a BICM system with binary code of rate and weight enumerator (or input-weight multiplicities for convolutional codes). Under maximum-likelihood decoding on fully-interleaved Rayleigh fading with Gray labelling, the bit error probability is bounded by where is the diversity- PEP given by Thm. 3 of s03. At high SNR this simplifies to
Each pair at Hamming distance contributes , weighted by the number of such pairs () and the expected number of information-bit errors they cause ( per source bit on average). Summing over gives the union bound. At high SNR only the smallest- term matters; at low SNR every term contributes comparably and the bound becomes loose.
Start from the codeword error probability bound and convert the double sum into a sum over pairs at each Hamming distance.
A pair at Hamming distance contributes information-bit errors on average (by the assumption that the all-zero codeword is transmitted, valid for linear codes).
Group terms: in the block-code case, or in the convolutional-code case.
Divide by to convert from codeword-error bits to information-bit error rate.
Step 1: From pairwise to codeword error
For a linear code, the all-zero assumption is WLOG. The union bound over nonzero codewords gives Group by Hamming weight:
Step 2: From codeword error to bit error
Each codeword at distance contains some number of information-bit errors out of the information bits. By the linearity of the code, depends only on the weight and the code's structure, and its expectation over weight- codewords is . Therefore which is the theorem. (In the convolutional-code literature is absorbed into ; the formula there is directly.)
Step 3: High-SNR leading term
By Thm. 3 of s03, at high SNR. The smallest dominates, and The coefficient is the bit-error multiplicity at the error floor; its inverse (in dB) is the coding gain relative to an uncoded reference with the same diversity order.
Example: Union Bound for a Rate-1/2, Convolutional BICM
The rate-, constraint-length- convolutional code has input-weight multiplicities (from the transfer function): Compute the union-bound BER for this code with Gray-16-QAM BICM on fully-interleaved Rayleigh fading at and compare with the leading-term approximation.
Per-$d$ PEP on Rayleigh
Under the MGF-based PEP, the Rayleigh PEP at diversity order with per-channel SNR is (the standard high-SNR Rayleigh result for diversity , [?proakis-2008] Β§14.4). At (linear), .
Evaluate first few terms
- : .
- : .
- : .
- : contribute proportionally smaller amounts.
Union-bound sum (convolutional form, $k = 1$)
\approx 1.4 \times 10^{-11}d = 7$ term.
Leading-term approximation
The leading-term approximation is Within 5% of the full union bound at this SNR. The leading term becomes increasingly accurate as SNR grows, because each higher- contribution decays a full extra power of faster.
Example: Where the Union Bound Goes Loose
A binary code has codewords at minimum distance . On Rayleigh fading with diversity order , compute the union-bound PEP at the SNR where the true codeword error probability is . What does this tell us about the union bound?
Leading-term union bound
Under at high SNR, and ignoring the binomial prefactor for simplicity, At the true operating point, the union bound is not operating in the "high-SNR" regime β numerical evaluation easily gives , which is trivially true but completely uninformative.
Interpretation: waterfall is not where the UB lives
The union bound is exponentially tight at the error floor where the leading-PEP-term dominates. In the waterfall region (where is in the range to ), many codewords contribute comparably and the union-bound addition over-counts by several orders of magnitude. The reader should not trust the union bound to within an order of magnitude until the true BER is below ~.
What to use instead for the waterfall
Tighter bounds β Poltyrev (1994), Divsalar (1999), tangential-sphere bound, or exact density-evolution for LDPC codes β become necessary in the waterfall region. These are developed for BICM in Ch. 7 (error exponents and cutoff rate). For engineering BER targets at and below, the leading-term union bound is adequate.
Common Mistake: The Union Bound Is NOT a BER Estimate at the Waterfall
Mistake:
A very common mistake in BICM performance studies is to plot the leading- PEP-term union bound and read it as an accurate prediction of simulated BER, especially in the waterfall region around to . At these BERs the union bound can exceed (i.e., be trivial) even when the true BER is .
Correction:
Reserve the union bound for the error floor region, typically below BER, where the leading PEP term dominates and the bound becomes exponentially tight. For the waterfall region, use either (a) Monte Carlo simulation, (b) the tighter bounds of Ch. 7, or (c) density evolution / EXIT analysis for specific code families. Never report the union bound as a BER at β it is not a predictor there, only an upper bound (and often a laughably loose one).
Common Mistake: Upper Bound vs Actual: PEP Is , Not BER
Mistake:
When computing the PEP for a specific , students sometimes read this directly as "the BER at that diversity order."
Correction:
is the probability of confusing ONE specific pair of codewords at distance . The BER aggregates over ALL competing codewords (the weighting in Thm. 4) and over information-bit counts. The BER is strictly smaller than a single PEP times the number of codewords only because the union over disjoint events can over-count; it is strictly larger than a single PEP because many pairs contribute. The union bound gives a valid UPPER bound on BER in terms of the PEP.
Union Bound vs Tighter BER Bounds
Compare the leading-term union bound against (a) the full union bound summing for , and (b) a tighter Poltyrev-style bound that is valid in the waterfall region. Adjust and the multiplicity to see how the gap evolves. Key observation: the bounds converge below ~ BER but diverge sharply in the waterfall β which is exactly the regime where simulation is essential.
Parameters
Union-Bound BER Computation for Convolutional BICM
Complexity: β essentially free after the transfer function is computedThe truncation order is chosen so that the tail is a negligible fraction of the dominant term . For Rayleigh fading at moderate SNR (), is safely more than enough. At low SNR (waterfall), one may need (the entire block length), at which point the bound is anyway loose by orders of magnitude and simulation is preferred.
Why This Matters: LDPC Codes and the Union Bound: When It Breaks
For LDPC codes (used in LTE, 5G NR, DVB-S2, Wi-Fi 6), the weight enumerator is effectively random over the ensemble and is typically small (5β15 for practical block lengths). The union bound predicts an error floor at BER , but modern short-block LDPC codes achieve BER at SNRs where the union bound is loose by β dB. The gap between union-bound prediction and simulated BER in the waterfall is closed by density-evolution analysis, which tracks the message distribution iteratively through the Tanner graph. This is the right analytical tool for modern coded systems; the union bound of this section is the foundational framework but not the final tool.
Quick Check
At very high SNR on fully-interleaved Rayleigh fading, the BICM union bound is dominated by which term?
The term at the BLOCK LENGTH
The term at (minimum Hamming distance)
The term at (information dimension)
The term at
, so the smallest gives the slowest decay. At high SNR, .
5G NR: Does the Union Bound Match Simulation?
5G NR uses a base-graph-2 LDPC code of effective rate between and after rate matching. For a representative 1024-bit code block at rate , the simulated BER at dB on Rayleigh fading is , while the union-bound prediction (leading term, for this ensemble) is β off by an order of magnitude. At , simulated BER is and the union bound predicts β within a factor of 2. 3GPP system-level simulators use the union bound for SNR targets in the error floor and Monte Carlo (plus extrapolation from short-block BER tables) in the waterfall. The practical lesson is: use the bound where it is tight, simulate where it is loose.
- β’
Base graph: BG2 for bits, BG1 for larger blocks.
- β’
Rate matching via repetition and puncturing; effective rates to .
- β’
Block length: 40 to 8448 information bits after rate matching.