Practical Complexity and Historical Arc

The Arc from 1977 to DVB-S2

Having laid out the capacity-theoretic picture, we close the chapter by taking stock of MLC as an engineering proposition. The theory says MLC/MSD achieves the constellation capacity CCMC_{\rm CM}; Gray-BICM achieves almost as much at a fraction of the complexity. We now survey what it actually takes to deploy MLC, why standards bodies consistently chose BICM, and in which niches MLC still wins. The section ends with a brief historical arc: how the idea travelled from Imai and Hirasawa's 1977 paper to the capacity-approaching LDPC designs of the 2000s.

, ,
⚠️Engineering Note

Designing the LL Binary Codes for MLC

Each of the LL binary codes in MLC must be designed for a different binary sub-channel. The top-level channel is the worst (low effective SNR, low capacity C0C_0), so level 0 gets a low-rate, powerful code. The bottom-level channel is the best, so level L1L - 1 gets a near-rate-1 code. Practical code design proceeds as follows:

  1. Compute CiC_i for each level at the target operating SNR (the capacity rule of s02).
  2. For each level, select an LDPC or polar code whose design threshold matches CiC_i. For LDPC, this usually means varying the degree distribution; for polar codes, the bit-channel polarisation table.
  3. Optimise the interleaver between levels (if any) and the bit-mapping to the constellation, to control error propagation.
  4. Verify error-floor behaviour below the target (10910^{-9} or better in commercial systems), since MSD error propagation is especially sensitive to low-weight codewords at the top levels.

The result is LL distinct codes that must be stored, loaded, and decoded per modulation format. Multiply by the 10\sim 10 modulations in a modern standard and one ends up with 40\sim 40 codes — a substantial codebook compared with BICM's 10\sim 10.

Practical Constraints
  • Code-rate granularity usually 1/501/50 or finer to match CiC_i closely

  • Decoder throughput budget split LL ways for sequential MSD

  • Error-floor target: 10910^{-9} or better for commercial broadcast; requires low-weight codeword control at every level

,
🚨Critical Engineering Note

Why BICM Dominates Modern Wireless Standards

Every wireless standard since DVB-S2 (2004) has chosen BICM over MLC: DVB-S2, DVB-T2, LTE, 5G NR, Wi-Fi 6/7. The technical reasons:

  1. A single code covers every modulation. In 5G NR, one LDPC base graph handles QPSK, 16-QAM, 64-QAM, and 256-QAM simply by puncturing to the required rate. With MLC one would need a separate code (or separate rates within a code) for each of 2,4,6,82, 4, 6, 8 levels — i.e.
    roughly 2+4+6+8=202 + 4 + 6 + 8 = 20 distinct code configurations instead of one.
  2. Rate adaptation is trivial. Adaptive modulation and coding (AMC) in 5G picks a new MCS every slot (0.5 ms). BICM adapts by adjusting a single code rate. MLC would require adjusting LL rates simultaneously, with the capacity rule re-run for every new SNR point.
  3. Iterative decoding closes the BICM gap. BICM-ID (iterative demapping + decoding) iterates between the demapper and the binary decoder, and recovers most of the residual CM-vs-BICM capacity gap. With this post-hoc improvement there is essentially no capacity reason left to pay the MLC complexity.
  4. Interleaver depth controls fading. In wireless channels with frequency-selective fading, a deep bit interleaver maps the LL label positions to independent fading realisations — this is the original motivation of BICM (Zehavi 1992) and it is incompatible with row-column MSD structure.

MLC still wins for satellite links with known-fixed channel conditions and stringent throughput targets (where the small capacity gain is worth the complexity), and for lattice-based schemes in Ch. 4 where partition-based labelling is natural.

Practical Constraints
  • 5G NR LDPC: base graph 1 (long packets) or base graph 2 (short) — single structure adapted by rate matching

  • Rate matching grid: 1/5 to 8/9 in ~20 steps

  • DVB-S2: 28 MODCOD points, all single-LDPC-code BICM

📋 Ref: 3GPP TS 38.212 §5.3.2 (LDPC base graphs); ETSI EN 302 307 §5 (DVB-S2 MODCOD)
, ,

Historical Note: Imai and Hirasawa (1977): The Original MLC Paper

1977

Hideki Imai and Shuji Hirasawa of Yokohama National University published "A new multilevel coding method using error-correcting codes" in the IEEE Transactions on Information Theory in 1977 — five years before Ungerboeck's TCM paper. The paper introduced the idea of using separate binary codes per level of a set-partitioning chain, combined with a partition-based labelling. Imai and Hirasawa analysed the scheme for small M-PSK constellations and demonstrated explicit coding gains of 3344 dB over uncoded reference constellations.

Historically, the paper received far less attention than Ungerboeck's TCM five years later. Two reasons. First, in 1977 the binary coding toolchain was limited — convolutional codes with Viterbi decoding were just becoming practical, turbo codes were twenty years away. Without strong binary codes the asymptotic capacity-approaching promise of MLC could not be demonstrated in hardware. Second, TCM offered a single trellis code that was easier to standardise and decode with the same Viterbi machinery then used for voiceband modems.

Imai and Hirasawa's insight had to wait for Wachsmann, Fischer, and Huber in 1999 to be connected back to the chain-rule capacity argument and to modern binary codes. By then, the industry was already moving toward BICM.

Historical Note: Wachsmann, Fischer, and Huber (1999): The Capacity Rule, Formalised

1999

Udo Wachsmann, Robert F.\ H.\ Fischer, and Johannes B.\ Huber of the Erlangen–Nürnberg school published "Multilevel codes: theoretical concepts and practical design rules" in IEEE Trans.\ Information Theory in 1999. The paper is the definitive reference for MLC/MSD. Its main contributions:

  • Formal proof of the capacity rule C=iCiC = \sum_i C_i via the chain rule, with rigorous converses.
  • Design tables showing for each constellation (8-PSK, 16-QAM, 32-PSK, 64-QAM) and target η\eta the capacity-rule rate allocation and the required binary code rates.
  • Quantitative comparison with BICM, establishing the <0.5< 0.5 bit gap under Gray labelling that this chapter summarises.
  • Error-propagation analysis, showing that at the capacity operating point propagation is negligible (our Thm thm-msd-error-propagation is a simplified form of their bound).
  • Practical code-design recipes for combining MLC with then-new turbo codes and early LDPC constructions.

The paper is a model of how to take a 20-year-old idea, connect it to the contemporary coding toolbox, and convert information-theoretic truths into engineering practice. Reading it alongside this chapter is strongly recommended.

Decoder Complexity Summary: MLC/MSD vs BICM

ResourceMLC/MSDBICM (single code)BICM-ID (iterative)
Number of binary decodersL=log2ML = \log_2 M11
Number of code designs storedLL per modulation1 per modulation1 per modulation
Sequential vs parallelSequential (latency-limited)Parallel-friendlySequential (but few outer iterations)
Demapper per passSum over 2Li2^{L-i} pointsSum over 2L12^{L-1} points (per bit)Sum over 2L12^{L-1} points (per bit)
Adaptation to new SNRReselect LL ratesReselect 1 rateReselect 1 rate
Capacity achievedCCMC_{\rm CM}CBICMC_{\rm BICM} (Gray) CCM\le C_{\rm CM}Closes most of the gap to CCMC_{\rm CM}
Error propagationYes, across stagesNone (independent bits)Controlled by iterations

Why This Matters: When MLC Still Wins: APSK and Lattices

Two niches preserve MLC's relevance in modern systems:

  1. Amplitude–phase shift keying (APSK). DVB-S2X and higher-SNR satellite links use APSK constellations (e.g.\ 16-APSK, 32-APSK) that do not admit a good Gray labelling — the ring structure creates nearest-neighbour conflicts across amplitude levels. On these constellations, MLC's partition-based labelling recovers the sub-bit-per-dim that BICM leaves on the table.
  2. Lattice coded modulation. Chapter 4 treats coset codes over lattices, where the partition chain is naturally tied to a sub-lattice sequence Λ0Λ1\Lambda_0 \supset \Lambda_1 \supset \cdots. MLC is the native framework here: each level codes for a cosets-of- Λi\Lambda_i-in-Λi1\Lambda_{i-1} index. The capacity rule of this chapter is what we apply in the lattice setting, where the CiC_i's are binary capacities of the quotient channels.

In both niches the extra complexity of MLC pays off. In the QAM workhorse regime, BICM remains the practical winner.

,

Common Mistake: "MLC is an obsolete idea" — no

Mistake:

Concluding, from the absence of MLC in wireless standards, that the construction is obsolete.

Correction:

MLC is not obsolete; it is specialised. It remains the right framework for (i) lattice-coded modulation (Ch. 4), (ii) APSK on satellite links, (iii) low-latency scenarios where the LL-way pipelining of MSD is beneficial, and (iv) as the conceptual scaffolding for understanding why BICM works as well as it does (the capacity rule is the yardstick). In textbook terms: TCM and MLC together teach us how coded modulation can reach capacity; BICM then teaches us the price to pay for giving up some of that capacity in exchange for a simpler decoder. All three deserve chapters.

Quick Check

Which of the following is not a legitimate reason that modern wireless standards choose BICM over MLC/MSD?

A single binary code can cover every modulation size, simplifying the codebook

Gray-BICM is within a fraction of a dB of CM capacity for QAM

MLC cannot achieve the CM capacity even in the limit of large block lengths

Iterative demapping (BICM-ID) closes most of the residual gap

Key Takeaway

MLC is theoretically optimal, operationally niche. The capacity rule of this chapter establishes that MLC/MSD can reach CCMC_{\rm CM} exactly — a clean theoretical result. In practice, Gray-BICM captures almost all of that capacity at a fraction of the complexity, and wins in every major wireless standard since DVB-S2. MLC remains the natural choice for lattices and APSK, and is indispensable as the conceptual yardstick against which BICM's simplicity is measured. The golden thread of this chapter is that the chain rule of mutual information both enables MLC and, once we drop the conditioning, quantifies BICM's price.