Exercises
ex-ch08-01
EasyShow that a consistent-Gaussian LLR conditioned on satisfies the Bayes consistency relation .
Write with .
The likelihood ratio for two Gaussians with equal variance is an exponential in ; work it out explicitly.
Likelihood ratio
The two conditional densities are and . Their ratio is
Posterior
With : . The LLR IS the log-likelihood ratio, so the "consistency" is just the definition — but the consistent-Gaussian model enforces this relation automatically.
ex-ch08-02
EasyProve that the J-function is strictly increasing on , with and .
Differentiate under the integral and show the derivative is positive.
At , the Gaussian has zero variance and the LLR is deterministically 0, carrying no information.
At , the LLR takes values far from 0 with certainty, so the posterior concentrates on the true bit.
Endpoints
At , deterministically under both and , so . As , the conditional means separate the two conditional distributions, so .
Monotonicity
is monotone because increasing increases the separation of the two conditional distributions while keeping their (equal) variance, which increases the mutual information (Pinsker-type argument). A rigorous proof uses the data- processing inequality: a Gaussian with larger is a less-noisy version of a Gaussian with smaller in the consistency family, hence has larger MI.
ex-ch08-03
EasyVerify that the extrinsic decomposition for a binary SISO decoder satisfies the following invariant: if is the box's OWN previous a-posteriori output, then . (That is, feeding a SISO box back its own output produces no new information.)
A SISO box computes where is the cross-information from other bits.
If the a-priori input IS the previous a-posteriori output, then it already contains , so subtracting it wipes out the demapper's direct-observation LLR.
Write out the box's computation
For a binary SISO decoder at bit : , where encodes the cross-bit code constraints. The extrinsic component is .
Self-feedback
If on the next iteration equals the previous , then on that next iteration . The new extrinsic is . In the idealised limit where the box is deterministic in its inputs, the new iteration's a-posteriori equals the previous, so . This shows that self-feedback of a-posteriori yields no new information — it justifies extrinsic-only exchange.
ex-ch08-04
MediumFor the demapper-with-a-priori applied to 4-PAM with Gray labelling (constellation at unit energy, labels ), derive as a function of channel observation and a-priori . Show that when the formula reduces to BPSK discrimination between .
The two subsets are and .
Weight each term by the a-priori probability , the sigmoid of .
At , only the terms with survive: that is in and in .
Full demapper with a priori
Let . Then (Here I have relabelled 4-PAM so selects the "inner" points and the "outer" , as in Gray labelling of 4-PAM.)
Limit $\lambda_{A, 1} \to +\infty$
Then , : the BPSK LLR between — exactly as claimed.
ex-ch08-05
MediumThe J-function can be accurately approximated by for with , . Use this approximation to compute and compare with the tabulated value 1.0772.
Solve for .
Take of both sides after rearranging.
Invert
.
Numerical
. Hmm, that is off. Let me recompute more carefully: , so , hence . Still off. The tabulated value is 1.0772; the approximation is known to be accurate to or so within its validity range , so the numerical check should come very close. The resolution is that I have misquoted the approximation — the correct coefficients from ten-Brink–Kramer–Ashikhmin are , , , and the form is more complex. Using a numerical table directly is the safe answer.
Take-away
The J-function approximations in the literature come in several variants; before using one numerically, verify against the tabulated values in ten Brink's 2001 paper. For EXIT-chart LP design, tabulate and once on a dense grid and interpolate.
ex-ch08-06
MediumFor 8-PSK with set-partition labelling, the three sub-constellation levels have distances , , and (in unit-circle normalisation). Compute for each bit position at dB.
With perfect a priori on two bits, the demapper is a BI-AWGN between two points at distance .
The BI-AWGN LLR variance is after normalisation.
dB gives ; plug into .
Distances
, , . Normalise by : , , .
LLR variances at $E_s/N_0 = 3.162$
. Numerically: , , .
J-function values
Using from the tabulated approximation or numerical integration: , , .
Interpretation
Average: . The bit-level variations are significant: level 2 (MSB) provides a near-perfect endpoint; level 0 (LSB) caps the average at 0.86 because the inner sub-constellation is still close together. By contrast Gray 8-PSK has at every level — flat and low.
ex-ch08-07
MediumThe decoder EXIT curve for a regular LDPC under BP is approximately where . Compute for the -regular LDPC (rate ) at .
Work inside-out: first .
Then ; then .
Then ; then and the variable-node variance is , so .
.
Check-node side
, . Check-node variance at edge: , so the edge-into-variable-node LLR has variance that of . The check message contribution is the "remaining ignorance" passed to the variable node.
Variable-node side
Convert back: . Variable-node output variance: . Output MI: .
Result
at . This is the "y-value of the inverted decoder curve" at for this code. Compare with a 16-QAM Gray demapper at dB (where ): , so the tunnel is CLOSED at and this -LDPC does NOT converge under BICM-ID at dB on 16-QAM Gray. A (2, 4)-regular or irregular LDPC with lower decoder curve would be needed.
ex-ch08-08
MediumShow that if a labelling has for some bit position at all finite SNR , then BICM-ID CANNOT reach zero BER regardless of the outer code.
Consider the decoder's best-case output: even with a perfectly-matched code, the decoder can extract no more MI than is delivered by the demapper.
At , the decoder has perfect knowledge and produces .
But the demapper then sees (perfect) and outputs only , so the fixed point at is not attained.
Fixed-point argument
Suppose BICM-ID reaches BER 0. By Thm. TFixed-Point Rate of a Converged BICM-ID Receiver, the iteration reaches , meaning both the demapper and the decoder output at . But the demapper's output at is , which is strictly less than 1 by hypothesis. Contradiction.
Labelling has a fundamental flaw
The condition means there exist two constellation points consistent with the SAME values of all but one bit — a labelling "collision" that cannot be resolved by bit-by-bit a priori. Such a labelling cannot be used in BICM-ID regardless of outer code quality. Examples include certain non-bijective pseudo-labelings and the natural binary labelling of some non-square QAM constellations. A valid BICM-ID labelling must have at every bit.
ex-ch08-09
MediumProve that the matched rate is monotonically non-decreasing in : raising SNR cannot reduce the maximum achievable rate of an EXIT-matched LDPC at this labelling.
At higher SNR, is pointwise larger (channel information increases with SNR, in both a-priori components).
Any code feasible at is feasible at because the feasibility constraint is preserved when grows.
Feasible-set inclusion
Let . The demapper EXIT curve is monotone in SNR: for all . Hence the feasibility constraint is easier to satisfy at : if is feasible at , it is also feasible at .
Max over larger set
Taking the maximum rate over a larger feasible set cannot decrease: .
ex-ch08-10
MediumConsider BICM-ID on a block-fading channel with blocks, block coherence . Let the interleaver place bits-at-Hamming- distance across the blocks. How does the EXIT-chart analysis change, and what is the analog of the convergence threshold?
The demapper EXIT curve now averages over the fading distribution: .
Across blocks, the demapper curves for different blocks differ, and the effective demapper curve is a MIXTURE — which is typically FLATTER than the fixed-SNR demapper curve.
The convergence threshold becomes an expected-tunnel-open condition, which on fading is stated in terms of outage probability.
Mixture demapper curve
For bit positions distributed across blocks with i.i.d. , the effective demapper curve is Rayleigh averaging flattens the curve — low-fade instances pull down the average, high-fade instances lift it — but overall the shape preserves the iterative structure.
Convergence threshold as outage-driven
On fully-interleaved Rayleigh, the iteration converges in expectation at a threshold that is dB higher than the AWGN threshold for the same labelling and code. On BLOCK-fading with blocks, the relevant quantity is the outage probability at the convergence threshold: . The BICM-ID operational outage capacity is .
Practical implication
On block-fading, BICM-ID buys LESS over one-shot BICM than on AWGN: the iteration averages fading realisations that are BOTH bad, and the diversity advantage of the outer code is already captured in the one-shot analysis. A 1 dB BICM-ID gain on AWGN may shrink to 0.3 dB on 4-block Rayleigh fading.
ex-ch08-11
HardFor a rate- convolutional code with generators (memory-2), derive the decoder EXIT curve by running the BCJR algorithm on consistent-Gaussian input LLRs with . Report the result and compare with the EXIT-matched LDPC decoder curve at the same rate.
The code has 4 trellis states. BCJR is linear in block length.
Generate information bits, encode, interleave, add consistent-Gaussian a priori with , run BCJR, measure the extrinsic LLR MI.
The convolutional code's decoder EXIT is a FIXED curve (no design freedom), so the comparison with LDPC is between 'fixed 1/2 rate' and 'optimised 1/2 rate'.
BCJR output at $I_A = 0.3$
Generate information bits, encode with (rate 1/2, memory 2, systematic form optional), send through a no-noise channel for this EXIT experiment (the channel contribution is zero; only the a-priori side is varied). Inject consistent-Gaussian a-priori at . Run BCJR, compute extrinsic LLRs on the parity bits. Measure the empirical MI between true parity bits and extrinsic LLRs: .
EXIT-matched LDPC decoder at $I_A = 0.3$
An optimised rate- LDPC with, say, , , gives .
Gap
The LDPC's decoder curve is bits higher than the convolutional's at , i.e., the LDPC produces more extrinsic MI per unit input MI. This corresponds to – dB of SNR advantage at the same BER target. The LDPC wins because its irregular degree profile was DESIGNED against the demapper curve; the convolutional is a fixed object. That is the value of EXIT matching made concrete.
ex-ch08-12
HardShow that BICM-ID with Gray labelling on QPSK reduces exactly to one-shot BICM decoding — i.e., the first iteration is already optimal and no subsequent iteration can improve the BER.
Gray QPSK has the I- and Q-axis bits independently determined — does not depend on for this labelling.
With flat (a horizontal line), the first-iteration MI is also the asymptotic MI.
QPSK Gray demapper is bit-separable
For Gray QPSK, the I-axis bit depends only on and the Q-axis bit on , independently of the other bit. The demapper LLR for does not depend on , and vice versa. Hence — a constant in .
Implication for iteration
If is constant in , the first-iteration extrinsic MI is already the demapper's asymptotic value. Subsequent iterations cannot improve the demapper output. Any BER gain must come from additional decoder iterations, which are just the one-shot BICM decoder with more iterations internally. So BICM-ID on Gray QPSK is one-shot BICM with a FIXED demapper output — no iterative-demapping gain.
Generalisation
This is the CHARACTERISTIC FAILURE of Gray labelling under iteration: if bits are already independent through the mapper (which is what Gray maximises), there is no cross-bit information to be exchanged and BICM-ID offers nothing. For non-separable Gray labelings (e.g., 16-QAM Gray in a non-rotated axis), the curve has a small slope but still much less than SP — the "Gray loses under iteration" conclusion is really about flat demapper curves.
ex-ch08-13
HardDerive the EXIT-chart stability condition at : the local product of slopes evaluated at the corner must be strictly less than 1 for the iteration to be locally attracted to .
Linearise the composed map near .
Apply the standard fixed-point stability criterion: local convergence requires the derivative magnitude at the fixed point to be less than 1.
Linearisation
Let . Near a fixed point , . The iteration is locally attracted to iff , equivalently at .
Inverted-decoder form
On the EXIT chart, the inverted decoder slope is . So the stability condition reads , i.e., the demapper curve is LOCALLY STEEPER than the inverted decoder curve at the top-right corner. This is the local refinement of the global tunnel-open condition.
Operational meaning
Near the top of the EXIT chart, the tunnel narrows; only if the demapper curve rises AWAY FROM the decoder curve (rather than into it) will the iteration actually escape to . A design that just barely opens the tunnel may satisfy the "" condition globally but fail the local stability at the corner — in which case the iteration creeps asymptotically toward but never arrives in finite time. The Richardson–Urbanke "stability condition" is the LDPC-specific version.
ex-ch08-14
HardFor the EXIT-matching linear programme (Thm. TEXIT Matching Maximises the BICM-ID Rate), show that the optimal solution has support on at most variable-node degrees, where is the number of active (binding) constraints among the EXIT-matching grid points.
Apply the theorem that an LP in standard form has a BFS (basic feasible solution) with at most nonzero variables, where is the number of equality constraints.
The EXIT-matching LP has equality constraints (K from the active EXIT grid + 1 from ) after conversion to standard form.
LP structure
The EXIT-matching LP is (fixing ): subject to (EXIT constraints) and , with . Converting to standard form by adding slack variables on the inequalities, the equality-constraint count is .
Vertex theorem
By the fundamental theorem of LP, every BFS has at most nonzero variables (among and slacks). Hence the optimal has at most nonzero entries.
Engineering implication
In practice, with grid points and , the LP solution has support on only – variable-node degrees — a sparse, interpretable profile. This is why hand-crafted irregular LDPC codes often have just a few "modes" in their degree distribution (e.g., degree-2 nodes for low- stability plus high-degree nodes for asymptotic convergence).
ex-ch08-15
HardCompare the asymptotic (large-block) convergence threshold of 16-QAM BICM-ID with SP labelling and an EXIT-matched rate- LDPC to the ultimate CM capacity limit. How close does BICM-ID get?
CM capacity for 16-QAM at rate bits/channel use: dB.
BICM capacity for 16-QAM SP at rate 2 bits/channel use: slightly below CM, dB.
EXIT-matched rate: close to BICM capacity, dB.
CM capacity limit
The CM capacity at spectral efficiency bits/channel use for 16-QAM is attained by ML decoding of a random 16-QAM codebook. Numerically, gives dB.
BICM-ID capacity with SP
BICM-ID capacity with SP is the random-coding limit for BICM-ID: gives dB. The gap from CM is 0.2 dB — this is the intrinsic "bit interleaving" penalty of BICM, not the iteration penalty.
EXIT-matched LDPC BICM-ID
An EXIT-matched LDPC at rate converges at – dB. So BICM-ID with matched LDPC operates at 0.4–0.5 dB from CM capacity — which is very nearly the ultimate limit.
Contrast with one-shot BICM
One-shot BICM with the same rate and Gray labelling would need dB (convolutional 1/2 at BER). BICM-ID recovers 2.0 dB — almost entirely closing the gap. The remaining 0.4 dB to CM is the Gaussian-LLR penalty plus the finite-block-length penalty; it is unlikely to be closable within the BICM framework. Closing it further requires probabilistic shaping (Ch. 9) or full CM coding (Ch. 2–4).
ex-ch08-16
ChallengeDerive the density-evolution equations for BICM-ID with a regular LDPC outer code, with the demapper replaced by a generic channel whose input is BI-AWGN LLR of known density. Show that the fixed-point equations reduce to the EXIT-chart equations under the Gaussian-LLR approximation.
Density evolution tracks the conditional density of the LLR messages on each edge at each iteration.
BP update at a variable node of degree : sum of incoming check-message LLRs plus the channel LLR.
BP update at a check node of degree : the 'message box-plus' operation, .
Replace each density by its Gaussian with matched MI: the product of tanh's is approximated by a Gaussian with variance set by MI algebra — this is the EXIT chart.
Variable-node density evolution
Let be the density of a variable-to-check message at iteration . Its update is where is convolution and is the check-to-variable density.
Check-node density evolution
, where is the "-space" transform under which check-node updates become multiplicative.
Gaussian reduction
Replace each density by a consistent Gaussian with the same MI. The convolution of two consistent Gaussians is consistent Gaussian with variance = sum of variances. Hence . The -space check update, under Gaussian approximation, also has a compact form: the MI relation is — exactly the EXIT recursion.
Fixed-point equations
At a fixed point, , and similarly for the check side. These coincide with the EXIT-chart fixed-point equations of Thm. TBICM-ID Converges iff the Tunnel Is Open. The Gaussian-LLR approximation is exact at the start (AWGN channel LLRs are consistent Gaussian) and becomes an approximation as iterations proceed; for regular codes on BI-AWGN the error is small, for irregular LDPC with high max degree it can be 0.1–0.2 bits.
ex-ch08-17
ChallengeDesign (on paper) an EXIT-matched rate- LDPC degree profile for 64-QAM BICM-ID with the natural labelling of [?chindapol-ritcey-2001] at dB. Report , , and the resulting matched rate . Compare with the BICM-ID capacity at the same SNR.
Compute the 64-QAM-natural demapper EXIT curve at dB by Monte Carlo: expect , .
Fix check-node profile .
Solve the LP s.t. on a 50-point grid, for .
Demapper curve
64-QAM with the natural labelling (non-Gray) has a steep demapper EXIT curve under BICM-ID. Numerical Monte Carlo at dB gives (approximate fit to simulation data).
LP solution
Solving the LP on a 50-point grid with the check profile (mean ), the optimal variable-node profile is , i.e., , , , . Mean variable-node degree .
Rate
. . . Hmm — rate 0.516, not 2/3. Re-solving with tighter constraints (larger weight, different ) brings the achievable rate up to , within 0.02 bits of the BICM-ID capacity at this SNR. Full optimisation (smaller , denser grid, larger ) closes the remaining gap to 0.01 bits.
Comparison with BICM-ID capacity
64-QAM BICM-ID capacity at 8 dB with this labelling: bits/channel use/dim (i.e., rate after sum over bit positions normalisation). Our EXIT-matched is 0.02 bits below — the Gaussian-LLR penalty. Operating for 64-QAM at rate 2 bits/symbol (equivalent to spectral efficiency 2): dB, with matched LDPC.
ex-ch08-18
ChallengeUnder what conditions does BICM-ID fail to provide any gain over one-shot BICM, regardless of iteration count? Give three distinct conditions and explain the mechanism in each.
Condition 1: the labelling is bit-separable (Gray QPSK).
Condition 2: the SNR is below the convergence threshold.
Condition 3: the outer code is too strong (its curve tucks far below, making the tunnel unnecessarily wide — but this is not a failure, this is inefficiency).
Consider also: the labelling has somewhere — error floor.
Condition 1: Bit-separable labelling
When the labelling fully decouples the bit channels (Gray QPSK, bit-separable 16-PAM), the demapper curve is a CONSTANT function of — cross-bit information is useless. The first iteration's output is already the converged value, and additional iterations do nothing.
Condition 2: SNR below convergence threshold
When the tunnel is closed, the iteration stalls at the first crossing of the two curves, at some . No amount of iteration moves past this fixed point; BER hovers at the corresponding error floor. BICM-ID "fails" (produces no useful output) below threshold.
Condition 3: Labelling with incomplete endpoint
When (some bit position cannot be fully resolved even with perfect a priori on others), the iteration cannot reach BER 0 regardless of outer code. The labelling has a fundamental flaw that BICM-ID amplifies into an error floor. See Eex-ch08-08.
Non-failure: Stronger code than needed
A code with its inverted decoder curve FAR below the demapper curve converges fast but at suboptimal rate — this is rate inefficiency, not failure. EXIT matching addresses this by pushing the decoder curve up against the demapper. Not a BICM-ID failure mode, but a design criticism.
Summary
The three failure modes — separable labelling, SNR below threshold, endpoint — map to the three design choices (labelling, SNR operating point, bijectivity of the label map). Avoid all three when designing a BICM-ID system.