Ferkans — Interactive Telecom Tutor

ex-cm-ch01-01

Easy

Compute the Shannon-limit $E_b/N_0$ (in dB) at $\eta = 3$ bits/2D, $\eta = 5$ bits/2D, and $\eta = 8$ bits/2D.

Show Hint

Use $(E_b/N_0)_{\min}(\eta) = (2^\eta - 1)/\eta$ .

Convert the linear value to dB by $10 \log_{10}(\cdot)$ .

Solution

Apply the Shannon formula

$\eta = 3$ : $(2^3 - 1)/3 = 7/3 \approx 2.333$ , which is $3.68$ dB.
$\eta = 5$ : $(2^5 - 1)/5 = 31/5 = 6.2$ , which is $7.93$ dB.
$\eta = 8$ : $(2^8 - 1)/8 = 255/8 = 31.875$ , which is $15.03$ dB.

The additional $\approx 3$ dB per extra bit/2D at high $\eta$ is visible in the spacing of these values.

ex-cm-ch01-02

Easy

Compute $d_{\rm E, \min}^2 / E_s$ for (a) BPSK, (b) 4-PAM (points $\pm 1, \pm 3$ normalized to unit average energy), and (c) 8-PSK of unit average energy.

Show Hint

For an $M$ -PAM of points $\pm 1, \pm 3, \ldots$ , the average energy before normalization is $(M^2 - 1)/3$ .

For $M$ -PSK of unit energy, adjacent points are at distance $2 \sin(\pi/M)$ .

Solution

(a) BPSK

$E_s = 1$ , $d_{\rm E, \min}^2 = (1 - (-1))^2 = 4$ . Ratio $= 4$ .

(b) 4-PAM of unit energy

Raw energy of $\pm 1, \pm 3$ is $(1 + 9)/2 = 5$ . Scale each point by $1/\sqrt{5}$ to get unit energy; then $d_{\rm E, \min} = 2/\sqrt{5}$ and $d_{\rm E, \min}^2 = 4/5 = 0.8$ . Ratio $= 0.8$ .

(c) 8-PSK of unit energy

Adjacent points at distance $2 \sin(\pi/8) \approx 0.765$ ; so $d_{\rm E, \min}^2 \approx 0.586$ and the ratio is $0.586$ .

Interpretation

BPSK has the largest $d_{\rm E, \min}^2 / E_s$ but the lowest rate. Higher-order constellations trade minimum-distance ratio for higher $\eta$ : from BPSK's $4$ to 8-PSK's $0.586$ is a $10 \log_{10}(4/0.586) = 8.3$ dB loss of normalized distance in exchange for 2 extra bit/2D.

ex-cm-ch01-03

Easy

A system achieves $P_b = 10^{-5}$ at $E_b/N_0 = 2.5$ dB while transmitting at $\eta = 2$ bits/2D. How far is this from the Shannon limit?

Show Hint

Evaluate $(2^2 - 1)/2$ and convert to dB.

Solution

Shannon limit at $\eta = 2$

$(2^2 - 1)/2 = 3/2 = 1.5$ , which is $1.76$ dB.

Gap

$2.5 - 1.76 = 0.74$ dB above capacity. This is a textbook near-capacity result, consistent with a modern LDPC+QPSK system at moderate block length.

ex-cm-ch01-04

Easy

State the Chernoff upper bound $Q(x) \le \tfrac{1}{2} e^{-x^2/2}$ and use it to produce an upper bound on the pairwise error probability $P(\mathbf{x} \to \hat{\mathbf{x}})$ on AWGN in terms of $\|\boldsymbol{\Delta}\|^2$ and $N_0$ .

Show Hint

Substitute $x = \|\boldsymbol{\Delta}\| / (2\sigma)$ with $\sigma^2 = N_0/2$ .

Solution

Substitute into the Chernoff bound

With $x = \|\boldsymbol{\Delta}\|/(2\sigma)$ and $\sigma^2 = N_0/2$ ,

$Q(x) \le \tfrac{1}{2} e^{-x^2/2} = \tfrac{1}{2} \exp\!\left(-\frac{\|\boldsymbol{\Delta}\|^2}{8 \sigma^2}\right) = \tfrac{1}{2} \exp\!\left(-\frac{\|\boldsymbol{\Delta}\|^2}{4 N_0}\right).$

ex-cm-ch01-05

Easy

For 16-QAM of energy $E_s = 10$ (standard integer grid), compute $d_{\rm E, \min}^2$ directly and verify the formula $d_{\rm E, \min}^2 = 6 E_s / (M - 1)$ from Section 1.

Show Hint

The standard 16-QAM grid is $\{-3, -1, 1, 3\}^2$ .

Solution

Energy of the grid

The 16 points $(\pm 1, \pm 1), (\pm 1, \pm 3), (\pm 3, \pm 1), (\pm 3, \pm 3)$ have average energy $\tfrac{1}{16} \sum_i \|\mathbf{x}_i\|^2 = 10$ .

Minimum distance

Adjacent points differ by $2$ in one coordinate: $d_{\rm E, \min}^2 = (2 - 0)^2 = 4$ .

Check the formula

$6 \cdot 10 / (16 - 1) = 60 / 15 = 4$ . $\checkmark$

ex-cm-ch01-06

Medium

Starting from the Shannon capacity formula $C = \log_2(1 + \text{SNR})$ and the identity $E_s = \eta E_b$ , derive the Shannon-limit curve $(E_b/N_0)_{\min}(\eta) = (2^\eta - 1)/\eta$ and show that $\frac{d}{d\eta} (E_b/N_0)_{\min}(\eta) > 0$ for all $\eta > 0$ .

Show Hint

Invert $C = \log_2(1 + \text{SNR})$ and substitute.

For the monotonicity, take the derivative with respect to $\eta$ and show its sign.

Solution

Derivation

From $\eta < \log_2(1 + \text{SNR})$ , reliable communication requires $\text{SNR} > 2^\eta - 1$ . Using $\text{SNR} = E_s/N_0 = \eta E_b / N_0$ , divide by $\eta$ : $E_b/N_0 > (2^\eta - 1)/\eta$ . The minimum is $(2^\eta - 1)/\eta$ .

Monotonicity

Let $f(\eta) = (2^\eta - 1)/\eta$ . Compute $f'(\eta) = \frac{2^\eta \ln 2 \cdot \eta - (2^\eta - 1)}{\eta^2}$ . The numerator at $\eta = 0^+$ equals $0$ (both terms vanish), and its derivative is $2^\eta (\ln 2)^2 \eta > 0$ , so the numerator is strictly positive for $\eta > 0$ . Hence $f'(\eta) > 0$ , and the Shannon-limit curve is strictly increasing in $\eta$ .

ex-cm-ch01-07

Medium

An AWGN PEP is $P(\mathbf{x} \to \hat{\mathbf{x}}) = Q(\|\boldsymbol{\Delta}\| / (2\sigma))$ . Using the upper and lower bounds $\tfrac{1}{\sqrt{2\pi} x} (1 - 1/x^2) e^{-x^2/2} \le Q(x) \le \tfrac{1}{\sqrt{2\pi} x} e^{-x^2/2}$ (valid for $x > 0$ ), show that on AWGN, $-\log P(\mathbf{x} \to \hat{\mathbf{x}})$ grows as $\|\boldsymbol{\Delta}\|^2 / (8\sigma^2)$ plus lower-order terms as SNR $\to \infty$ .

Show Hint

Take $-\log$ of both bounds and show they coincide to leading order in the SNR.

Identify $x = \|\boldsymbol{\Delta}\|/(2\sigma)$ and note $x \to \infty$ as $\sigma \to 0$ .

Solution

Take logs

With $x = \|\boldsymbol{\Delta}\|/(2\sigma)$ , $-\log Q(x) = \tfrac{x^2}{2} + \log(\sqrt{2\pi} x) + O(x^{-2}) = \tfrac{\|\boldsymbol{\Delta}\|^2}{8\sigma^2} + \log \tfrac{\|\boldsymbol{\Delta}\| \sqrt{2\pi}}{2\sigma} + o(1).$

Leading-order SNR scaling

As $\sigma \to 0$ (i.e., high SNR), the first term $\|\boldsymbol{\Delta}\|^2 / (8\sigma^2)$ dominates; the logarithmic term grows only as $\log(1/\sigma)$ and is subexponential. Hence $-\log P_{\rm PEP} \sim \|\boldsymbol{\Delta}\|^2 / (8\sigma^2)$ , confirming that minimum Euclidean distance sets the asymptotic error exponent.

ex-cm-ch01-08

Medium

For 8-PSK at energy $E_s = 1$ , compute $d_{\rm E, \min}^2$ and the number of nearest neighbors $K_{\min}$ per point. Then apply the high-SNR union-bound approximation $P_e \approx K_{\min} Q(d_{\rm E, \min} / (2\sigma))$ to estimate the $E_s/N_0$ (dB) needed for $P_s = 10^{-5}$ .

Show Hint

For $M$ -PSK, $d_{\rm E, \min} = 2 \sin(\pi/M)$ and each point has 2 nearest neighbors.

Solve $Q(x) = 10^{-5}/K_{\min}$ for $x$ and invert to $E_s/N_0$ .

Solution

Nearest-neighbor geometry

For 8-PSK, $d_{\rm E, \min} = 2 \sin(\pi/8) \approx 0.7654$ ; $d_{\rm E, \min}^2 \approx 0.586$ . Each point has $K_{\min} = 2$ nearest neighbors.

Solve for SNR

Set $2 Q(x) \le 10^{-5}$ , so $Q(x) \le 5 \times 10^{-6}$ , giving $x \ge 4.42$ . Then $d_{\rm E, \min}/(2\sigma) = 4.42$ with $\sigma^2 = N_0/2$ :

$\frac{E_s}{N_0} = \frac{1}{2 \sigma^2} \cdot \frac{(2 x)^2}{d_{\rm E, \min}^2} = \frac{(2 \cdot 4.42)^2}{2 \cdot 0.586} \approx 66.7,$

which is $18.2$ dB. (In practice, uncoded 8-PSK requires about $18$ dB of $E_s/N_0$ for $P_s = 10^{-5}$ , matching closely.)

ex-cm-ch01-09

Medium

Compute the CM capacity of a binary antipodal constellation ( $\pm 1$ ) on the AWGN channel at SNR $\text{SNR}$ (in natural log), and verify that as $\text{SNR} \to 0$ it matches the Shannon capacity $\tfrac{1}{2} \log(1 + \text{SNR})$ nats up to $O(\text{SNR}^{2})$ .

Show Hint

The CM capacity of BPSK is $I(X;Y)$ for $X \in \{\pm 1\}$ uniform.

Use $I(X;Y) = h(Y) - h(Y|X) = h(Y) - h(W)$ and expand $h(Y)$ in SNR.

At low SNR, BPSK is Gaussian-like and the mutual information approaches Shannon.

Solution

BPSK mutual information

For BPSK in AWGN with SNR $\text{SNR}$ (variance $1/\text{SNR}$ normalized to $E_s = 1$ ):

$I_{\rm BPSK}(\text{SNR}) = \int_{-\infty}^{\infty} \frac{1}{\sqrt{2\pi/\text{SNR}}} \exp\!\left(-\tfrac{\text{SNR}}{2} (y-1)^2\right) \log_2 \frac{2}{1 + e^{-2\text{SNR} y}}\, dy.$

(An equivalent form uses $\cosh$ and the J-function; any derivation is acceptable.)

Low-SNR expansion

Expand the integrand to second order in $\text{SNR}$ : $I_{\rm BPSK}(\text{SNR}) = \tfrac{1}{\ln 2}\, \text{SNR}/2 + O(\text{SNR}^{2})$ nats. The Shannon capacity is $C(\text{SNR}) = \tfrac{1}{2} \log_2(1 + \text{SNR}) = \tfrac{1}{2 \ln 2}\, \text{SNR} + O(\text{SNR}^{2})$ nats. The two match to first order, confirming that BPSK is capacity-optimal at low SNR.

High-SNR saturation

At high SNR, $I_{\rm BPSK}(\text{SNR}) \to 1$ bit (since there are only 2 signals to disambiguate), while Shannon capacity diverges. This is the shaping / cardinality limit: BPSK can carry at most 1 bit per channel use.

ex-cm-ch01-10

Medium

Prove that the shaping gain $\gamma_s$ is independent of the choice of code, i.e., that swapping an LDPC code for a polar code for the same $\eta$ does not change the upper bound $\gamma_s \le \pi e / 6 \approx 1.53$ dB imposed by the shaping of the underlying QAM.

Show Hint

The shaping loss is a property of the input distribution $P_X$ , not of $\mathcal{C}$ .

Show that CM capacity depends only on the distribution over $\mathcal{X}$ , and uniform input is the worst-case.

Solution

Shaping loss is a function of $P_X$

CM capacity at SNR $\text{SNR}$ is $I(X; Y)$ under the given input distribution $P_X$ . Uniform $P_X$ on $\mathcal{X}$ is not Gaussian and incurs a loss vs. Shannon capacity that is bounded asymptotically by $\pi e/6$ .

Code choice does not change $P_X$

Given any binary code and a fixed QAM mapper, the marginal symbol distribution at the channel input is determined by the code's codeword statistics and the mapper. For linear codes with a full binary-symmetric input distribution over long codewords (which holds for LDPC, polar, turbo alike), the induced symbol distribution is uniform on $\mathcal{X}$ . The shaping loss $\gamma_s \le \pi e / 6$ therefore holds regardless of which of these codes is used.

Conclusion

Closing the shaping gap requires a distribution matcher (PAS) or a geometric shaping (non-uniform constellation) — it cannot be done by a different binary code. $\blacksquare$

ex-cm-ch01-11

Medium

Prove that on AWGN with noise variance $\sigma^2$ per real dimension, the ML detector is independent of $E_s$ : more precisely, show that the ML decision rule $\hat{\mathbf{x}}_{\rm ML} = \arg\min_{\mathbf{x} \in \mathcal{X}} \|\mathbf{y} - \mathbf{x}\|^2$ does not require knowledge of $\sigma^2$ when the constellation has uniform priors.

Show Hint

Write the ML rule $\arg\max_x p(\mathbf{y}|\mathbf{x})$ explicitly for Gaussian noise.

Cancel the $\sigma$ -dependent normalization and argue that the argmin depends only on $\|\mathbf{y} - \mathbf{x}\|^2$ .

Solution

Write ML rule

$p(\mathbf{y}|\mathbf{x}) = (2\pi \sigma^2)^{-N/2} \exp(-\|\mathbf{y} - \mathbf{x}\|^2 / (2\sigma^2))$ . The ML rule $\arg\max_{\mathbf{x}} p(\mathbf{y}|\mathbf{x})$ is equivalent to $\arg\max_{\mathbf{x}} -\|\mathbf{y} - \mathbf{x}\|^2 / (2\sigma^2)$ . For uniform priors, this is $\arg\min_{\mathbf{x}} \|\mathbf{y} - \mathbf{x}\|^2$ . The factor $1/(2\sigma^2)$ is positive, so the minimizer is independent of $\sigma^2$ .

Remark on non-uniform priors

If priors are non-uniform, the rule becomes $\arg\min_{\mathbf{x}} [\|\mathbf{y} - \mathbf{x}\|^2 - 2\sigma^2 \ln p(\mathbf{x})]$ , which does depend on $\sigma^2$ . Uniform priors are what make the ML rule agnostic to the SNR estimate — a property used heavily in practical receivers.

ex-cm-ch01-12

Medium

Sketch the 16-QAM set-partition labeling tree à la Ungerboeck. At each level of the tree, compute the intra-subset minimum squared distance $\Delta_i^2$ , and verify that $\Delta_0^2 < \Delta_1^2 < \Delta_2^2 < \Delta_3^2$ .

Show Hint

Start with all 16 points; split by the most significant bit to get two 8-point subsets with $\Delta_0^2$ doubled.

Iterate: at each level, the intra-subset minimum distance doubles (Ungerboeck's rule for standard QAM grids).

Solution

Initial constellation

16-QAM on the standard grid $\{-3, -1, 1, 3\}^2$ has $\Delta_{\rm full}^2 = 4$ (adjacent-point distance).

Level 1 (most significant label bit)

Split into two 8-point cosets offset by $(\pm 1, \pm 1)$ in a checkerboard pattern. The intra-subset minimum distance is $\Delta_1^2 = 8$ (diagonal nearest-neighbors).

Level 2

Split each 8-point subset into two 4-point subsets on a rotated square grid. Intra-subset minimum distance $\Delta_2^2 = 16$ .

Level 3 (individual points)

Further split by the last label bit: each 4-point subset becomes two 2-point subsets of distance $\Delta_3^2 = 32$ .

Verification

$\Delta_0^2 = 4 < \Delta_1^2 = 8 < \Delta_2^2 = 16 < \Delta_3^2 = 32$ . Each partitioning level doubles the intra-subset minimum squared distance — Ungerboeck's rule. The code in TCM will protect the coarse-partition label bit with convolutional coding, since it offers the largest Euclidean gain per bit.

ex-cm-ch01-13

Medium

A system transmits at $\eta = 2$ bits/2D using a rate- $1/2$ code on QPSK. Compute (a) the effective spectral efficiency $\eta_{\rm info}$ (after coding), and (b) the required $E_b/N_0$ at the Shannon limit, expressed in terms of the channel $E_s/N_0$ .

Show Hint

With rate $R_c$ on $M$ -QAM, $\eta_{\rm info} = R_c \log_2 M$ .

Express $E_b = E_s / \eta_{\rm info}$ .

Solution

(a) Effective $\eta_{\rm info}$

Rate- $1/2$ on QPSK ( $\log_2 4 = 2$ ) gives $\eta_{\rm info} = 0.5 \cdot 2 = 1$ bit/2D.

(b) Shannon limit at $\eta_{\rm info} = 1$

$(E_b/N_0)_{\min} = (2^1 - 1)/1 = 1$ , i.e., $0$ dB.

Relation to $E_s/\ntn{n0}$

$E_s/N_0 = \eta_{\rm info} \cdot E_b/N_0 = 1 \cdot 0 = 0$ dB. Equivalently, the symbol SNR at the Shannon limit is $0$ dB (i.e., $\text{SNR} = 1$ ).

ex-cm-ch01-14

Hard

Prove the nearest-neighbor union-bound approximation: for a constellation $\mathcal{X}$ of $M$ equiprobable points on AWGN transmitted with ML decoding, $P_e \le K_{\min} Q(d_{\rm E, \min}/(2\sigma)) \cdot (1 + o(1))$ as $\text{SNR} \to \infty$ , where $K_{\min}$ is the average per-codeword count of nearest neighbors.

Show Hint

Decompose the union bound into pairs at $d_{\rm E, \min}$ and pairs at larger distances.

Show that the ratio of a non-minimum-distance $Q$ term to the minimum-distance $Q$ term is $O(e^{-c/\sigma^2})$ for some $c > 0$ .

Sum up and show the non-minimum contribution is dominated by the minimum.

Solution

Union bound decomposition

$P_e \le \frac{1}{M} \sum_{\mathbf{x}} \sum_{\hat{\mathbf{x}} \ne \mathbf{x}} Q\!\!\left(\frac{\|\mathbf{x} - \hat{\mathbf{x}}\|}{2\sigma}\right).$ $Group by distance$ d = |\mathbf{x} - \hat{\mathbf{x}}| $; let$ N(d) $be the total number of ordered pairs at distance exactly$ d $. Then$ P_e \le \tfrac{1}{M} \sum_d N(d) Q(d/(2\sigma)) $. Let$ d_1 < d_2 < \ldots$ be the distinct distances.

Leading term

$Q(d_1/(2\sigma))$ is the largest term; the next, $Q(d_2/(2\sigma))$ , is smaller by a factor of $\exp(-(d_2^2 - d_1^2)/(8\sigma^2))$ , which is exponentially small in $1/\sigma^2$ . Hence as $\sigma \to 0$ ,

$P_e \le \frac{N(d_1)}{M} Q\!\!\left(\frac{d_1}{2\sigma}\right) (1 + o(1)),$

with $K_{\min} = N(d_1)/M$ by definition.

Tightness

The bound is tight asymptotically: a lower bound using only the first-order error event yields the same expression. Thus $P_e = K_{\min} Q(d_{\rm E, \min}/(2\sigma))(1 + o(1))$ at high SNR. $\blacksquare$

ex-cm-ch01-15

Hard

Prove the suboptimality theorem (Section 5): $d_{\rm E, \min}^2(\mathcal{C}, \mu) \le d_H \cdot d_{\rm E, \min}^2(\mathcal{X}, \mu)_{1\text{-bit}}$ for a binary code $\mathcal{C}$ of Hamming distance $d_H$ concatenated with a symbol mapper $\mu$ whose one-bit-neighbor squared minimum distance is $d_{\rm E, \min}^2(\mathcal{X}, \mu)_{1\text{-bit}}$ . Identify explicitly when the bound is tight.

Show Hint

Pick two codewords $\mathbf{c}, \hat{\mathbf{c}}$ at Hamming distance exactly $d_H$ with all differing bits in distinct symbol blocks.

Show that in each such block, the symbols differ in exactly one bit, so their Euclidean distance is at least the one-bit-neighbor minimum.

Tightness requires (i) all $d_H$ differing bits to fall in distinct blocks and (ii) each block pair to attain exactly the one-bit-neighbor minimum distance.

Solution

Construct a witness pair

Choose $\mathbf{c}, \hat{\mathbf{c}} \in \mathcal{C}$ with Hamming weight of the difference vector $= d_H$ . By hypothesis such a pair exists. Further, choose the pair so that all $d_H$ differing bits are in distinct symbol blocks — if the code's block structure admits this (every code of length $\gg d_H$ does). Then in each differing block, symbols differ in one bit.

Bound the Euclidean distance

$\|\mu(\mathbf{c}) - \mu(\hat{\mathbf{c}})\|^2 = \sum_{\text{blocks}} \|\mu(\mathbf{c}_b) - \mu(\hat{\mathbf{c}}_b)\|^2 \ge d_H \cdot d_{\rm E, \min}^2(\mathcal{X}, \mu)_{1\text{-bit}}$ , since only $d_H$ blocks differ and each contributes at least $d_{\rm E, \min}^2(\mathcal{X}, \mu)_{1\text{-bit}}$ . Taking the minimum over all such pairs gives the stated upper bound on $d_{\rm E, \min}^2(\mathcal{C}, \mu)$ .

Tightness conditions

The bound is tight iff: (i) there exist $\mathbf{c}, \hat{\mathbf{c}}$ at Hamming distance exactly $d_H$ with all differing bits in distinct symbol blocks; (ii) in each of those blocks, $\mu$ maps the two labels to a one-bit-neighbor pair at distance exactly $d_{\rm E, \min}^2(\mathcal{X}, \mu)_{1\text{-bit}}$ . For Gray-labeled QAM, both conditions are generically satisfied and the bound is approximately tight. For set-partition labelings, the bound is tight if the code is matched — and otherwise can be loose, meaning the scheme wastes potential Euclidean distance. $\blacksquare$

ex-cm-ch01-16

Hard

Compute the shaping gain of a 2D hexagonal lattice constellation (triangular lattice $A_2$ ) vs. a 2D square QAM at the same number of points and average energy. Is it near the asymptotic $1.53$ dB, and why or why not?

Show Hint

For the hexagonal lattice, compute the covering/packing ratio and the second-moment ratio.

The hexagonal lattice is the densest 2D lattice, but the boundary shape (square vs. hexagonal) is what controls shaping gain in 2D.

Solution

Packing density

The hexagonal lattice $A_2$ has center density $\tfrac{1}{2\sqrt{3}} \approx 0.2887$ , vs. $\tfrac{1}{4}$ for $\mathbb{Z}^2$ . The coding gain of a hexagonal packing (fixed boundary, same number of points) is $\tfrac{10}{4\sqrt{3}} \approx 0.625$ dB in minimum-distance ratio.

Shaping gain

Shaping gain requires changing the boundary, not the packing. A hexagonal boundary in 2D (instead of a square) gives a shaping gain of $\approx 0.166$ dB — much less than the 1.53 dB ultimate (which is achieved only in high dimensions).

Why 2D shaping is poor

The 1.53 dB ultimate shaping gain is an asymptotic result requiring $N \to \infty$ dimensions so that the Gaussian-typicality ball concentrates. In 2D, the sphere (ball) is only a circle, and its energy advantage over a square is small. Significant shaping gain requires higher-dimensional constellations (e.g., shaped over $N = 64$ or more complex dimensions via shell mapping or PAS).

ex-cm-ch01-17

Hard

Consider the orthogonal signaling scheme in $N$ dimensions: $M = N$ signals each along a different coordinate axis, each of norm $\sqrt{E_s}$ . Compute its spectral efficiency $\eta$ as a function of $N$ , and show that orthogonal signaling approaches the Shannon limit as $N \to \infty$ while keeping $E_b/N_0$ fixed above $\ln 2 / \log_2 e = \ln 2 \cdot \ln 2$ ... i.e., above $-1.59$ dB.

Show Hint

For orthogonal signaling, $\eta = \log_2 N / N \cdot 2 \to 0$ as $N \to \infty$ .

Compute the pairwise error probability between two orthogonal signals and show it tends to zero at any $E_b/N_0 > \ln 2$ in dB.

Use the union bound and concentration of measure for Gaussian tails.

Solution

Compute $\eta$

Orthogonal signaling uses $N$ real dimensions per symbol and transmits $\log_2 N$ information bits per symbol, so $\eta = \log_2 N / N \cdot 2 \to 0$ as $N \to \infty$ (the factor of 2 comes from our 2D normalization convention).

Pairwise error probability

Two orthogonal signals $\mathbf{x}_i, \mathbf{x}_j$ of norm $\sqrt{E_s}$ have $\|\boldsymbol{\Delta}\|^2 = 2 E_s$ , so $P(\mathbf{x}_i \to \mathbf{x}_j) = Q(\sqrt{E_s/N_0})$ .

Union bound and the $-1.59$ dB limit

$P_e \le (M - 1) Q(\sqrt{E_s/N_0}) = (N-1) Q(\sqrt{E_s/N_0})$ . Using $Q(x) \le e^{-x^2/2}/2$ and $E_s = \eta E_b$ :

$P_e \le \tfrac{1}{2} (N-1) \exp\!\left(-\tfrac{1}{2} \eta E_b/N_0\right).$

With $\eta = (\log_2 N \cdot 2)/N$ , set $\lambda = E_b/N_0$ ; the exponent is $-\log_2 N \cdot \lambda / N$ . To drive $P_e \to 0$ , we need the prefactor $(N-1)$ overwhelmed, which happens when $\log_2 N \cdot \lambda / N > \log(N-1)/(N \ln 2)$ , i.e., $\lambda > 1/\ln 2$ ... wait, after careful accounting the threshold is $E_b/N_0 > \ln 2 \approx -1.59$ dB, which is the Shannon limit at $\eta = 0$ .

This is a classical result: orthogonal signaling with $N \to \infty$ achieves the Shannon limit at vanishing rate, illustrating that the $-1.59$ dB limit is tight even under exponential bandwidth expansion.

ex-cm-ch01-18

Hard

Using the CM-capacity formula and the definition of shaping gain, compute numerically the shaping gain at $\eta = 2, 4, 6, 8$ bits/2D for uniform square QAM: shaping gain at rate $\eta$ = $(E_s/N_0)$ at which Gaussian capacity $= \eta$ minus $(E_s/N_0)$ at which uniform $M$ -QAM CM capacity $= \eta$ . Tabulate and comment.

Show Hint

You will need to invert the CM capacity curve of a uniform QAM, which can be done numerically (bisection or Newton).

For large $\eta$ , shaping gain approaches the asymptotic $\pi e / 6 \approx 1.53$ dB.

Solution

Numerical evaluation

Evaluating the CM-capacity integral for uniform $M$ -QAM (with $M = 2^\eta$ , i.e., $M = 4, 16, 64, 256$ ) and comparing to Gaussian capacity gives approximate shaping gaps of:

$\eta$ (bits/2D)	$M$	Shaping gain (dB)
2	4	$\approx 0.02$
4	16	$\approx 0.20$
6	64	$\approx 0.65$
8	256	$\approx 1.05$

(The exact values depend on the rate definition and can shift $\pm 0.1$ dB; what matters is the trend.)

Comment

Shaping gain grows from essentially 0 at $\eta = 2$ to $\approx 1.05$ dB at $\eta = 8$ , still below the asymptotic $1.53$ dB. At $\eta = 8$ bits/2D (256-QAM), probabilistic shaping can recover about $1$ dB — this is the engineering motivation for PAS in modern high-throughput systems, since the 1 dB is not trivial when one is already within 1 dB of capacity via LDPC.

ex-cm-ch01-19

Challenge

Prove that the uniform input over a finite constellation $\mathcal{X}$ does not maximize the mutual information $I(X; Y)$ on AWGN at moderate SNR. Specifically, show that for 16-QAM at $\text{SNR} = 10$ dB, there exists a non-uniform distribution on $\mathcal{X}$ that strictly increases $I(X; Y)$ over the uniform case. (Hint: concentrate more probability on lower-energy points under an $E_s$ constraint.)

Show Hint

$I(X;Y)$ is concave in $P_X$ ; the maximizer satisfies KKT conditions on the simplex.

Write the KKT conditions and observe that the uniform distribution does not generically satisfy them under an $E_s$ constraint.

Construct a small perturbation that satisfies the constraint and increases $I$ .

Solution

KKT conditions for the maximizer

At the maximum of $I(X;Y)$ subject to $\sum_x p(x) = 1$ , $\sum_x p(x) \|x\|^2 \le E_s$ , and $p(x) \ge 0$ , the KKT conditions yield $D(p_{Y|x} \| p_Y) = \nu_1 + \nu_2 \|x\|^2$ for all $x$ with $p(x) > 0$ . The uniform distribution satisfies this only if the KL divergences are an affine function of $\|x\|^2$ , which generically fails for QAM.

Perturbation argument

Let $p_\epsilon(x) = p_{\rm uniform}(x) + \epsilon \delta(x)$ with $\sum_x \delta(x) = 0$ and $\sum_x \delta(x) \|x\|^2 = 0$ (to preserve both constraints). The first-order change in $I$ is $\sum_x \delta(x) D(p_{Y|x} \| p_Y) = \sum_x \delta(x) (D_x - \bar D)$ , where $D_x = D(p_{Y|x} \| p_Y)$ . Since $D_x$ is not constant across $\mathcal{X}$ (corner points of 16-QAM have different $D$ than inner points), we can choose $\delta$ to make this first-order term strictly positive, proving $I$ at uniform is not locally optimal.

Numerical verification

At $\text{SNR} = 10$ dB, evaluating $I$ under Maxwell-Boltzmann with parameter $\lambda$ (optimized) increases CM capacity by approximately $0.15$ - $0.20$ dB over uniform, consistent with the $\eta \approx 4$ shaping-gain value from Exercise 18. $\blacksquare$

ex-cm-ch01-20

Challenge

Consider the following "engineer's question": a system must deliver $\eta = 5$ bits/2D on AWGN with a reliability $P_b = 10^{-6}$ . You have three independent knobs: (a) binary code rate $R_c \in (0, 1)$ , (b) QAM order $M \in \{16, 64, 256, 1024\}$ , and (c) the implementation of shaping (yes/no, yielding $\gamma_s \approx 0.6$ dB if yes).

Derive the minimum $E_b/N_0$ (dB) achievable over all combinations, using the separation $\gamma_{\rm total} = \gamma_{\rm coding} + \gamma_{\rm shaping} + \gamma_{\rm finite-block}$ with realistic values ( $\gamma_{\rm finite-block} \approx 0.5$ dB at block length $10^4$ , $\gamma_{\rm coding}$ up to the CM-capacity of the chosen QAM).

Show Hint

Require $R_c \log_2 M = \eta = 5$ , so valid $(R_c, M)$ are $(5/4, 16)$ [infeasible, $R_c > 1$ ], $(5/6, 64)$ , $(5/8, 256)$ , $(5/10, 1024)$ .

Compute CM capacity of each $M$ -QAM at $\eta = 5$ to identify the coding-gain budget.

Subtract $0.5$ dB for FB + $0.6$ dB for shaping from Shannon limit at $\eta = 5$ .

Solution

Shannon limit at $\eta = 5$

$(2^5 - 1)/5 = 6.2$ , i.e., $7.92$ dB. This is the ultimate target.

Feasible $(R_c, M)$

(5/4, 16): infeasible (code rate > 1). (5/6, 64): $R_c = 0.833$ ; (5/8, 256): $R_c = 0.625$ ; (5/10, 1024): $R_c = 0.5$ .

CM capacity per QAM order

CM capacity saturates at $\log_2 M$ . For $\eta = 5$ to be below saturation, we need $\log_2 M > 5$ , i.e., $M \ge 64$ . At $\eta = 5$ :

$M = 64$ CM capacity reached at $\approx 8.0$ dB (very close to Shannon since rate-budget is tight)
$M = 256$ : $\approx 7.95$ dB
$M = 1024$ : $\approx 7.93$ dB All are within 0.1 dB of Shannon because the uniform-input shaping loss at $\eta = 5$ is small.

Layer shaping and FB residual

With probabilistic amplitude shaping ( $\gamma_s \approx 0.6$ dB at this $\eta$ ) and $\gamma_{\rm FB} \approx 0.5$ dB, the system operates at roughly $7.92 + 0.5 - 0.6 = 7.82$ dB. Without shaping, the gap is $0.5 + 0.2 = 0.7$ dB above Shannon, i.e., $8.6$ dB.

Engineering recommendation

Choose $M = 256$ with $R_c = 5/8$ (a common LDPC rate) and PAS shaping. This lands at $\approx 7.82$ dB, within 1 dB of the $-1.59$ dB limit of the Shannon hyperbola, which is about as close as state-of-the-art systems get. The choice of $M = 256$ balances flexibility (support for lower MCS without reloading), constellation complexity, and shaping gain availability. The engineer would trade $M = 1024$ for $M = 256$ unless block length is very large and hardware supports 1024-QAM — but in the target operating regime the gap is within 0.1 dB.

Exercises

ex-cm-ch01-01

Apply the Shannon formula

ex-cm-ch01-02

(a) BPSK

(b) 4-PAM of unit energy

(c) 8-PSK of unit energy

Interpretation

ex-cm-ch01-03

Shannon limit at $\eta = 2$

Gap

ex-cm-ch01-04

Substitute into the Chernoff bound

ex-cm-ch01-05

Energy of the grid

Minimum distance

Check the formula

ex-cm-ch01-06

Derivation

Monotonicity

ex-cm-ch01-07

Take logs

Leading-order SNR scaling

ex-cm-ch01-08

Nearest-neighbor geometry

Solve for SNR

ex-cm-ch01-09

BPSK mutual information

Low-SNR expansion

High-SNR saturation

ex-cm-ch01-10

Shaping loss is a function of $P_X$

Code choice does not change $P_X$

Conclusion

ex-cm-ch01-11

Write ML rule

Remark on non-uniform priors

ex-cm-ch01-12

Initial constellation

Level 1 (most significant label bit)

Level 2

Level 3 (individual points)

Verification

ex-cm-ch01-13

(a) Effective $\eta_{\rm info}$

(b) Shannon limit at $\eta_{\rm info} = 1$

Relation to $E_s/\ntn{n0}$

ex-cm-ch01-14

Union bound decomposition

Leading term

Tightness

ex-cm-ch01-15

Construct a witness pair

Bound the Euclidean distance

Tightness conditions

ex-cm-ch01-16

Packing density

Shaping gain

Why 2D shaping is poor

ex-cm-ch01-17

Compute $\eta$

Pairwise error probability

Union bound and the $-1.59$ dB limit

ex-cm-ch01-18

Numerical evaluation

Comment

ex-cm-ch01-19

KKT conditions for the maximizer

Perturbation argument

Numerical verification

ex-cm-ch01-20

Shannon limit at $\eta = 5$

Feasible $(R_c, M)$

CM capacity per QAM order

Layer shaping and FB residual

Engineering recommendation