Exercises
ex-cm-ch01-01
EasyCompute the Shannon-limit (in dB) at bits/2D, bits/2D, and bits/2D.
Use .
Convert the linear value to dB by .
Apply the Shannon formula
- : , which is dB.
- : , which is dB.
- : , which is dB.
The additional dB per extra bit/2D at high is visible in the spacing of these values.
ex-cm-ch01-02
EasyCompute for (a) BPSK, (b) 4-PAM (points normalized to unit average energy), and (c) 8-PSK of unit average energy.
For an -PAM of points , the average energy before normalization is .
For -PSK of unit energy, adjacent points are at distance .
(a) BPSK
, . Ratio .
(b) 4-PAM of unit energy
Raw energy of is . Scale each point by to get unit energy; then and . Ratio .
(c) 8-PSK of unit energy
Adjacent points at distance ; so and the ratio is .
Interpretation
BPSK has the largest but the lowest rate. Higher-order constellations trade minimum-distance ratio for higher : from BPSK's to 8-PSK's is a dB loss of normalized distance in exchange for 2 extra bit/2D.
ex-cm-ch01-03
EasyA system achieves at dB while transmitting at bits/2D. How far is this from the Shannon limit?
Evaluate and convert to dB.
Shannon limit at $\eta = 2$
, which is dB.
Gap
dB above capacity. This is a textbook near-capacity result, consistent with a modern LDPC+QPSK system at moderate block length.
ex-cm-ch01-04
EasyState the Chernoff upper bound and use it to produce an upper bound on the pairwise error probability on AWGN in terms of and .
Substitute with .
Substitute into the Chernoff bound
With and ,
ex-cm-ch01-05
EasyFor 16-QAM of energy (standard integer grid), compute directly and verify the formula from Section 1.
The standard 16-QAM grid is .
Energy of the grid
The 16 points have average energy .
Minimum distance
Adjacent points differ by in one coordinate: .
Check the formula
.
ex-cm-ch01-06
MediumStarting from the Shannon capacity formula and the identity , derive the Shannon-limit curve and show that for all .
Invert and substitute.
For the monotonicity, take the derivative with respect to and show its sign.
Derivation
From , reliable communication requires . Using , divide by : . The minimum is .
Monotonicity
Let . Compute . The numerator at equals (both terms vanish), and its derivative is , so the numerator is strictly positive for . Hence , and the Shannon-limit curve is strictly increasing in .
ex-cm-ch01-07
MediumAn AWGN PEP is . Using the upper and lower bounds (valid for ), show that on AWGN, grows as plus lower-order terms as SNR .
Take of both bounds and show they coincide to leading order in the SNR.
Identify and note as .
Take logs
With ,
Leading-order SNR scaling
As (i.e., high SNR), the first term dominates; the logarithmic term grows only as and is subexponential. Hence , confirming that minimum Euclidean distance sets the asymptotic error exponent.
ex-cm-ch01-08
MediumFor 8-PSK at energy , compute and the number of nearest neighbors per point. Then apply the high-SNR union-bound approximation to estimate the (dB) needed for .
For -PSK, and each point has 2 nearest neighbors.
Solve for and invert to .
Nearest-neighbor geometry
For 8-PSK, ; . Each point has nearest neighbors.
Solve for SNR
Set , so , giving . Then with :
which is dB. (In practice, uncoded 8-PSK requires about dB of for , matching closely.)
ex-cm-ch01-09
MediumCompute the CM capacity of a binary antipodal constellation () on the AWGN channel at SNR (in natural log), and verify that as it matches the Shannon capacity nats up to .
The CM capacity of BPSK is for uniform.
Use and expand in SNR.
At low SNR, BPSK is Gaussian-like and the mutual information approaches Shannon.
BPSK mutual information
For BPSK in AWGN with SNR (variance normalized to ):
(An equivalent form uses and the J-function; any derivation is acceptable.)
Low-SNR expansion
Expand the integrand to second order in : nats. The Shannon capacity is nats. The two match to first order, confirming that BPSK is capacity-optimal at low SNR.
High-SNR saturation
At high SNR, bit (since there are only 2 signals to disambiguate), while Shannon capacity diverges. This is the shaping / cardinality limit: BPSK can carry at most 1 bit per channel use.
ex-cm-ch01-10
MediumProve that the shaping gain is independent of the choice of code, i.e., that swapping an LDPC code for a polar code for the same does not change the upper bound dB imposed by the shaping of the underlying QAM.
The shaping loss is a property of the input distribution , not of .
Show that CM capacity depends only on the distribution over , and uniform input is the worst-case.
Shaping loss is a function of $P_X$
CM capacity at SNR is under the given input distribution . Uniform on is not Gaussian and incurs a loss vs. Shannon capacity that is bounded asymptotically by .
Code choice does not change $P_X$
Given any binary code and a fixed QAM mapper, the marginal symbol distribution at the channel input is determined by the code's codeword statistics and the mapper. For linear codes with a full binary-symmetric input distribution over long codewords (which holds for LDPC, polar, turbo alike), the induced symbol distribution is uniform on . The shaping loss therefore holds regardless of which of these codes is used.
Conclusion
Closing the shaping gap requires a distribution matcher (PAS) or a geometric shaping (non-uniform constellation) β it cannot be done by a different binary code.
ex-cm-ch01-11
MediumProve that on AWGN with noise variance per real dimension, the ML detector is independent of : more precisely, show that the ML decision rule does not require knowledge of when the constellation has uniform priors.
Write the ML rule explicitly for Gaussian noise.
Cancel the -dependent normalization and argue that the argmin depends only on .
Write ML rule
. The ML rule is equivalent to . For uniform priors, this is . The factor is positive, so the minimizer is independent of .
Remark on non-uniform priors
If priors are non-uniform, the rule becomes , which does depend on . Uniform priors are what make the ML rule agnostic to the SNR estimate β a property used heavily in practical receivers.
ex-cm-ch01-12
MediumSketch the 16-QAM set-partition labeling tree Γ la Ungerboeck. At each level of the tree, compute the intra-subset minimum squared distance , and verify that .
Start with all 16 points; split by the most significant bit to get two 8-point subsets with doubled.
Iterate: at each level, the intra-subset minimum distance doubles (Ungerboeck's rule for standard QAM grids).
Initial constellation
16-QAM on the standard grid has (adjacent-point distance).
Level 1 (most significant label bit)
Split into two 8-point cosets offset by in a checkerboard pattern. The intra-subset minimum distance is (diagonal nearest-neighbors).
Level 2
Split each 8-point subset into two 4-point subsets on a rotated square grid. Intra-subset minimum distance .
Level 3 (individual points)
Further split by the last label bit: each 4-point subset becomes two 2-point subsets of distance .
Verification
. Each partitioning level doubles the intra-subset minimum squared distance β Ungerboeck's rule. The code in TCM will protect the coarse-partition label bit with convolutional coding, since it offers the largest Euclidean gain per bit.
ex-cm-ch01-13
MediumA system transmits at bits/2D using a rate- code on QPSK. Compute (a) the effective spectral efficiency (after coding), and (b) the required at the Shannon limit, expressed in terms of the channel .
With rate on -QAM, .
Express .
(a) Effective $\eta_{\rm info}$
Rate- on QPSK () gives bit/2D.
(b) Shannon limit at $\eta_{\rm info} = 1$
, i.e., dB.
Relation to $E_s/\ntn{n0}$
dB. Equivalently, the symbol SNR at the Shannon limit is dB (i.e., ).
ex-cm-ch01-14
HardProve the nearest-neighbor union-bound approximation: for a constellation of equiprobable points on AWGN transmitted with ML decoding, as , where is the average per-codeword count of nearest neighbors.
Decompose the union bound into pairs at and pairs at larger distances.
Show that the ratio of a non-minimum-distance term to the minimum-distance term is for some .
Sum up and show the non-minimum contribution is dominated by the minimum.
Union bound decomposition
d = |\mathbf{x} - \hat{\mathbf{x}}|N(d)dP_e \le \tfrac{1}{M} \sum_d N(d) Q(d/(2\sigma))d_1 < d_2 < \ldots$ be the distinct distances.
Leading term
is the largest term; the next, , is smaller by a factor of , which is exponentially small in . Hence as ,
with by definition.
Tightness
The bound is tight asymptotically: a lower bound using only the first-order error event yields the same expression. Thus at high SNR.
ex-cm-ch01-15
HardProve the suboptimality theorem (Section 5): for a binary code of Hamming distance concatenated with a symbol mapper whose one-bit-neighbor squared minimum distance is . Identify explicitly when the bound is tight.
Pick two codewords at Hamming distance exactly with all differing bits in distinct symbol blocks.
Show that in each such block, the symbols differ in exactly one bit, so their Euclidean distance is at least the one-bit-neighbor minimum.
Tightness requires (i) all differing bits to fall in distinct blocks and (ii) each block pair to attain exactly the one-bit-neighbor minimum distance.
Construct a witness pair
Choose with Hamming weight of the difference vector . By hypothesis such a pair exists. Further, choose the pair so that all differing bits are in distinct symbol blocks β if the code's block structure admits this (every code of length does). Then in each differing block, symbols differ in one bit.
Bound the Euclidean distance
, since only blocks differ and each contributes at least . Taking the minimum over all such pairs gives the stated upper bound on .
Tightness conditions
The bound is tight iff: (i) there exist at Hamming distance exactly with all differing bits in distinct symbol blocks; (ii) in each of those blocks, maps the two labels to a one-bit-neighbor pair at distance exactly . For Gray-labeled QAM, both conditions are generically satisfied and the bound is approximately tight. For set-partition labelings, the bound is tight if the code is matched β and otherwise can be loose, meaning the scheme wastes potential Euclidean distance.
ex-cm-ch01-16
HardCompute the shaping gain of a 2D hexagonal lattice constellation (triangular lattice ) vs. a 2D square QAM at the same number of points and average energy. Is it near the asymptotic dB, and why or why not?
For the hexagonal lattice, compute the covering/packing ratio and the second-moment ratio.
The hexagonal lattice is the densest 2D lattice, but the boundary shape (square vs. hexagonal) is what controls shaping gain in 2D.
Packing density
The hexagonal lattice has center density , vs. for . The coding gain of a hexagonal packing (fixed boundary, same number of points) is dB in minimum-distance ratio.
Shaping gain
Shaping gain requires changing the boundary, not the packing. A hexagonal boundary in 2D (instead of a square) gives a shaping gain of dB β much less than the 1.53 dB ultimate (which is achieved only in high dimensions).
Why 2D shaping is poor
The 1.53 dB ultimate shaping gain is an asymptotic result requiring dimensions so that the Gaussian-typicality ball concentrates. In 2D, the sphere (ball) is only a circle, and its energy advantage over a square is small. Significant shaping gain requires higher-dimensional constellations (e.g., shaped over or more complex dimensions via shell mapping or PAS).
ex-cm-ch01-17
HardConsider the orthogonal signaling scheme in dimensions: signals each along a different coordinate axis, each of norm . Compute its spectral efficiency as a function of , and show that orthogonal signaling approaches the Shannon limit as while keeping fixed above ... i.e., above dB.
For orthogonal signaling, as .
Compute the pairwise error probability between two orthogonal signals and show it tends to zero at any in dB.
Use the union bound and concentration of measure for Gaussian tails.
Compute $\eta$
Orthogonal signaling uses real dimensions per symbol and transmits information bits per symbol, so as (the factor of 2 comes from our 2D normalization convention).
Pairwise error probability
Two orthogonal signals of norm have , so .
Union bound and the $-1.59$ dB limit
. Using and :
With , set ; the exponent is . To drive , we need the prefactor overwhelmed, which happens when , i.e., ... wait, after careful accounting the threshold is dB, which is the Shannon limit at .
This is a classical result: orthogonal signaling with achieves the Shannon limit at vanishing rate, illustrating that the dB limit is tight even under exponential bandwidth expansion.
ex-cm-ch01-18
HardUsing the CM-capacity formula and the definition of shaping gain, compute numerically the shaping gain at bits/2D for uniform square QAM: shaping gain at rate = at which Gaussian capacity minus at which uniform -QAM CM capacity . Tabulate and comment.
You will need to invert the CM capacity curve of a uniform QAM, which can be done numerically (bisection or Newton).
For large , shaping gain approaches the asymptotic dB.
Numerical evaluation
Evaluating the CM-capacity integral for uniform -QAM (with , i.e., ) and comparing to Gaussian capacity gives approximate shaping gaps of:
| (bits/2D) | Shaping gain (dB) | |
|---|---|---|
| 2 | 4 | |
| 4 | 16 | |
| 6 | 64 | |
| 8 | 256 |
(The exact values depend on the rate definition and can shift dB; what matters is the trend.)
Comment
Shaping gain grows from essentially 0 at to dB at , still below the asymptotic dB. At bits/2D (256-QAM), probabilistic shaping can recover about dB β this is the engineering motivation for PAS in modern high-throughput systems, since the 1 dB is not trivial when one is already within 1 dB of capacity via LDPC.
ex-cm-ch01-19
ChallengeProve that the uniform input over a finite constellation does not maximize the mutual information on AWGN at moderate SNR. Specifically, show that for 16-QAM at dB, there exists a non-uniform distribution on that strictly increases over the uniform case. (Hint: concentrate more probability on lower-energy points under an constraint.)
is concave in ; the maximizer satisfies KKT conditions on the simplex.
Write the KKT conditions and observe that the uniform distribution does not generically satisfy them under an constraint.
Construct a small perturbation that satisfies the constraint and increases .
KKT conditions for the maximizer
At the maximum of subject to , , and , the KKT conditions yield for all with . The uniform distribution satisfies this only if the KL divergences are an affine function of , which generically fails for QAM.
Perturbation argument
Let with and (to preserve both constraints). The first-order change in is , where . Since is not constant across (corner points of 16-QAM have different than inner points), we can choose to make this first-order term strictly positive, proving at uniform is not locally optimal.
Numerical verification
At dB, evaluating under Maxwell-Boltzmann with parameter (optimized) increases CM capacity by approximately - dB over uniform, consistent with the shaping-gain value from Exercise 18.
ex-cm-ch01-20
ChallengeConsider the following "engineer's question": a system must deliver bits/2D on AWGN with a reliability . You have three independent knobs: (a) binary code rate , (b) QAM order , and (c) the implementation of shaping (yes/no, yielding dB if yes).
Derive the minimum (dB) achievable over all combinations, using the separation with realistic values ( dB at block length , up to the CM-capacity of the chosen QAM).
Require , so valid are [infeasible, ], , , .
Compute CM capacity of each -QAM at to identify the coding-gain budget.
Subtract dB for FB + dB for shaping from Shannon limit at .
Shannon limit at $\eta = 5$
, i.e., dB. This is the ultimate target.
Feasible $(R_c, M)$
(5/4, 16): infeasible (code rate > 1). (5/6, 64): ; (5/8, 256): ; (5/10, 1024): .
CM capacity per QAM order
CM capacity saturates at . For to be below saturation, we need , i.e., . At :
- CM capacity reached at dB (very close to Shannon since rate-budget is tight)
- : dB
- : dB All are within 0.1 dB of Shannon because the uniform-input shaping loss at is small.
Layer shaping and FB residual
With probabilistic amplitude shaping ( dB at this ) and dB, the system operates at roughly dB. Without shaping, the gap is dB above Shannon, i.e., dB.
Engineering recommendation
Choose with (a common LDPC rate) and PAS shaping. This lands at dB, within 1 dB of the dB limit of the Shannon hyperbola, which is about as close as state-of-the-art systems get. The choice of balances flexibility (support for lower MCS without reloading), constellation complexity, and shaping gain availability. The engineer would trade for unless block length is very large and hardware supports 1024-QAM β but in the target operating regime the gap is within 0.1 dB.