Union-Bound Analysis of Coded BICM

From a Single PEP to the Bit Error Probability

Theorems 1 and 3 bounded the pairwise error probability P(d)P(d) for two BICM codewords at Hamming distance dd. A real receiver, however, faces ALL competing codewords at once. The passage from "one PEP" to "codeword error probability" is the union bound, and from "codeword error probability" to "bit error probability" is a weight-enumerator weighted sum. In this section we assemble those two bounds into the standard BICM union-bound BER formula, discuss where it is tight (error floor) and where it is loose (waterfall), and flag tighter alternatives (Poltyrev, sphere packing) that will be developed in Ch. 7.

Definition:

Weight Enumerator and Input-Weight Multiplicities

For a binary linear code C\mathcal{C} of block length nn and dimension kk, the weight enumerator is W(z)=βˆ‘d=0nWdzd,Wd=∣{c∈C:wt(c)=d}∣.W(z) = \sum_{d = 0}^n W_d z^d, \qquad W_d = |\{\mathbf{c} \in \mathcal{C} : \text{wt}(\mathbf{c}) = d\}|. For a convolutional code, the relevant quantity for BER analysis is the input-weight enumerator T(z,I)=βˆ‘dcd(I)zd,T(z, I) = \sum_{d} c_d(I) z^d, where cdc_d is the number of information bits in error per weight-dd error event. We write cdc_d for βˆ‘IIβ‹…cd(I)\sum_I I \cdot c_d(I) β€” the total input-weight contribution of weight-dd error events.

For a rate-1/21/2, constraint-length-55 convolutional code (generators (23,35)oct(23, 35)_{\rm oct}), the transfer-function analysis of Viterbi (1971) gives dfree=7d_{\rm free} = 7 and c7=4c_7 = 4. For LDPC codes, WdW_d can be estimated by saddlepoint methods or density evolution β€” see [?mackay-2003].

,

Theorem: Union-Bound Bit Error Probability for BICM

Consider a BICM system with binary code C\mathcal{C} of rate R=k/nR = k/n and weight enumerator WdW_d (or input-weight multiplicities cdc_d for convolutional codes). Under maximum-likelihood decoding on fully-interleaved Rayleigh fading with Gray labelling, the bit error probability is bounded by β€…β€ŠPb≀1kβˆ‘d=dHnWdβ‹…cdβ‹…P(d),β€…β€Š\boxed{\; P_b \le \frac{1}{k} \sum_{d = d_H}^{n} W_d \cdot c_d \cdot P(d), \;} where P(d)P(d) is the diversity-dd PEP given by Thm. 3 of s03. At high SNR this simplifies to Pbβ‰ˆWdHβ‹…cdHkβ‹…P(dH)∝SNRβˆ’dHβ‹…Lmin⁑(ΞΌ).P_b \approx \frac{W_{d_H} \cdot c_{d_H}}{k} \cdot P(d_H) \propto \text{SNR}^{-d_H \cdot L_{\min}(\mu)}.

Each pair at Hamming distance dd contributes P(d)P(d), weighted by the number of such pairs (WdW_d) and the expected number of information-bit errors they cause (cd/kc_d / k per source bit on average). Summing over dd gives the union bound. At high SNR only the smallest-dd term matters; at low SNR every term contributes comparably and the bound becomes loose.

,

Example: Union Bound for a Rate-1/2, dH,free=7d_{H,{\rm free}} = 7 Convolutional BICM

The rate-1/21/2, constraint-length-55 convolutional code has input-weight multiplicities (from the transfer function): c7=4,β€…β€Šc8=12,β€…β€Šc9=20,β€…β€Šc10=72,β€…β€Šc11=225,…c_7 = 4, \; c_8 = 12, \; c_9 = 20, \; c_{10} = 72, \; c_{11} = 225, \ldots Compute the union-bound BER for this code with Gray-16-QAM BICM on fully-interleaved Rayleigh fading at Es/N0=15Β dBE_s/N_0 = 15 \text{ dB} and compare with the leading-term approximation.

Example: Where the Union Bound Goes Loose

A binary code has WdH=106W_{d_H} = 10^6 codewords at minimum distance dH=20d_H = 20. On Rayleigh fading with diversity order dBICM=20d_{\rm BICM} = 20, compute the union-bound PEP at the SNR where the true codeword error probability is 0.50.5. What does this tell us about the union bound?

Common Mistake: The Union Bound Is NOT a BER Estimate at the Waterfall

Mistake:

A very common mistake in BICM performance studies is to plot the leading- PEP-term union bound and read it as an accurate prediction of simulated BER, especially in the waterfall region around 10βˆ’210^{-2} to 10βˆ’410^{-4}. At these BERs the union bound can exceed 11 (i.e., be trivial) even when the true BER is 10βˆ’210^{-2}.

Correction:

Reserve the union bound for the error floor region, typically below 10βˆ’510^{-5} BER, where the leading PEP term dominates and the bound becomes exponentially tight. For the waterfall region, use either (a) Monte Carlo simulation, (b) the tighter bounds of Ch. 7, or (c) density evolution / EXIT analysis for specific code families. Never report the union bound as a BER at 10βˆ’210^{-2} β€” it is not a predictor there, only an upper bound (and often a laughably loose one).

Common Mistake: Upper Bound vs Actual: PEP Is PpairwiseP_{\rm pairwise}, Not BER

Mistake:

When computing the PEP P(d)P(d) for a specific dd, students sometimes read this directly as "the BER at that diversity order."

Correction:

P(d)P(d) is the probability of confusing ONE specific pair of codewords at distance dd. The BER aggregates over ALL competing codewords (the Wdcd/kW_d c_d / k weighting in Thm. 4) and over information-bit counts. The BER is strictly smaller than a single PEP times the number of codewords only because the union over disjoint events can over-count; it is strictly larger than a single PEP because many pairs contribute. The union bound gives a valid UPPER bound on BER in terms of the PEP.

Union Bound vs Tighter BER Bounds

Compare the leading-term union bound Pbβ‰ˆcdHk(4SNR)βˆ’dHP_b \approx \frac{c_{d_H}}{k} (4\text{SNR})^{-d_H} against (a) the full union bound summing cdc_d for d=dH,dH+1,…d = d_H, d_H + 1, \ldots, and (b) a tighter Poltyrev-style bound that is valid in the waterfall region. Adjust dHd_H and the multiplicity cdc_d to see how the gap evolves. Key observation: the bounds converge below ~10βˆ’510^{-5} BER but diverge sharply in the waterfall β€” which is exactly the regime where simulation is essential.

Parameters
10
1

Union-Bound BER Computation for Convolutional BICM

Complexity: O(βˆ£Ξ³βˆ£β‹…D)O(|\gamma| \cdot D) β€” essentially free after the transfer function is computed
Input: Convolutional code transfer function T(z,I)=βˆ‘cd(I)zdT(z, I) = \sum c_d(I) z^d,
diversity order ddiv=dHLmin⁑(μ)d_{\rm div} = d_H L_{\min}(\mu), SNR grid {γi}\{\gamma_i\},
labelling ΞΌ\mu, truncation order DD.
Output: Union-bound BER curve Pb(Ξ³)P_b(\gamma).
1. For each Ξ³\gamma in the grid:
2. Pb←0\quad P_b \leftarrow 0
3. \quad for d=dH,dH+1,…,Dd = d_H, d_H + 1, \ldots, D do
4. \qquad Compute diversity-dd PEP: P(d)←(2dβˆ’1d)(4Ξ³)βˆ’dLmin⁑(ΞΌ)P(d) \leftarrow \binom{2d-1}{d} (4\gamma)^{-d L_{\min}(\mu)}
5. \qquad Lookup cdc_d from transfer function
6. Pb←Pb+cdβ‹…P(d)/k\qquad P_b \leftarrow P_b + c_d \cdot P(d) / k
7. \quad end for
8. \quad Record (Ξ³,Pb)(\gamma, P_b)
9. Return: curve {(Ξ³i,Pb,i)}\{(\gamma_i, P_{b,i})\}

The truncation order DD is chosen so that the tail βˆ‘d>DcdP(d)\sum_{d > D} c_d P(d) is a negligible fraction of the dominant term cdHP(dH)c_{d_H} P(d_H). For Rayleigh fading at moderate SNR (>10Β dB> 10 \text{ dB}), D=dH+10D = d_H + 10 is safely more than enough. At low SNR (waterfall), one may need D=nD = n (the entire block length), at which point the bound is anyway loose by orders of magnitude and simulation is preferred.

Why This Matters: LDPC Codes and the Union Bound: When It Breaks

For LDPC codes (used in LTE, 5G NR, DVB-S2, Wi-Fi 6), the weight enumerator WdW_d is effectively random over the ensemble and dHd_H is typically small (5–15 for practical block lengths). The union bound predicts an error floor at BER ∼10βˆ’4\sim 10^{-4}, but modern short-block LDPC codes achieve 10βˆ’510^{-5} BER at SNRs where the union bound is loose by 33–55 dB. The gap between union-bound prediction and simulated BER in the waterfall is closed by density-evolution analysis, which tracks the message distribution iteratively through the Tanner graph. This is the right analytical tool for modern coded systems; the union bound of this section is the foundational framework but not the final tool.

Quick Check

At very high SNR on fully-interleaved Rayleigh fading, the BICM union bound Pb≀1kβˆ‘dWdcdP(d)P_b \le \frac{1}{k} \sum_d W_d c_d P(d) is dominated by which term?

The term at the BLOCK LENGTH d=nd = n

The term at d=dHd = d_H (minimum Hamming distance)

The term at d=kd = k (information dimension)

The term at d=Lmin⁑(μ)d = L_{\min}(\mu)

⚠️Engineering Note

5G NR: Does the Union Bound Match Simulation?

5G NR uses a base-graph-2 LDPC code of effective rate between 0.30.3 and 0.950.95 after rate matching. For a representative 1024-bit code block at rate 1/21/2, the simulated BER at Es/N0=3E_s/N_0 = 3 dB on Rayleigh fading is ∼10βˆ’2\sim 10^{-2}, while the union-bound prediction (leading term, dHβ‰ˆ10d_H \approx 10 for this ensemble) is ∼10βˆ’1\sim 10^{-1} β€” off by an order of magnitude. At 6Β dB6 \text{ dB}, simulated BER is ∼10βˆ’5\sim 10^{-5} and the union bound predicts ∼2Γ—10βˆ’5\sim 2 \times 10^{-5} β€” within a factor of 2. 3GPP system-level simulators use the union bound for SNR targets in the error floor and Monte Carlo (plus extrapolation from short-block BER tables) in the waterfall. The practical lesson is: use the bound where it is tight, simulate where it is loose.

Practical Constraints
  • β€’

    Base graph: BG2 for k≀3840k \le 3840 bits, BG1 for larger blocks.

  • β€’

    Rate matching via repetition and puncturing; effective rates 0.330.33 to 0.920.92.

  • β€’

    Block length: 40 to 8448 information bits after rate matching.

πŸ“‹ Ref: 3GPP TS 38.212, Β§5.3.2