Ferkans — Interactive Telecom Tutor

ex-ch26-01

Easy

Compute the capacity $C$ and dispersion $V$ of the BEC with erasure probability $\delta = 0.3$ . Determine the blocklength needed for rate $R = 0.5$ bits/use at $\epsilon = 10^{-3}$ .

Show Hint

BEC capacity: $C = 1 - \delta$ . BEC dispersion: $V = \delta(1-\delta)(\log_2 e)^2$ .

Use $R = C - \sqrt{V/n}\,Q^{-1}(\epsilon)$ and solve for $n$ .

Solution

Capacity and dispersion

$C = 1 - 0.3 = 0.7$ bits/use. $V = 0.3 \times 0.7 \times (\log_2 e)^2 = 0.21 \times 2.081 = 0.437$ bits $^2$ .

Required blocklength

$0.5 = 0.7 - \sqrt{0.437/n} \times 3.09$ . $\sqrt{0.437/n} = 0.2/3.09 = 0.0647$ . $n = 0.437/0.0647^2 = 0.437/0.00419 = 104$ .

ex-ch26-02

Easy

Show that the information density of the BSC with crossover probability $p$ at the capacity-achieving uniform input is: $\iota(X; Y) = \begin{cases} \log\frac{2(1-p)}{1} = 1 + \log(1-p) & \text{if } Y = X \\ \log\frac{2p}{1} = 1 + \log p & \text{if } Y \ne X \end{cases}$ (in bits, with $p_Y(0) = p_Y(1) = 1/2$ ).

Show Hint

$p_{Y|X}(y|x) = (1-p)$ when $y = x$ and $p$ when $y \ne x$ .

The output distribution under uniform input is $p_Y(y) = 1/2$ for all $y$ .

Solution

Output distribution

With $p_X(0) = p_X(1) = 1/2$ : $p_Y(0) = (1-p)/2 + p/2 = 1/2$ . Similarly $p_Y(1) = 1/2$ .

Information density

When $Y = X$ : $\iota = \log_2\frac{1-p}{1/2} = 1 + \log_2(1-p)$ . When $Y \ne X$ : $\iota = \log_2\frac{p}{1/2} = 1 + \log_2 p$ .

Verification

$\mathbb{E}[\iota] = (1-p)(1+\log_2(1-p)) + p(1+\log_2 p) = 1 - h(p) = C$ . Correct.

ex-ch26-03

Easy

For the AWGN channel at $\text{SNR} = 20$ dB, compute the maximum coding rate at blocklength $n = 1000$ and error probability $\epsilon = 10^{-5}$ . How close is this to the Shannon capacity?

Show Hint

At SNR = 20 dB (= 100 linear): $C = \frac{1}{2}\log_2(101) \approx 3.33$ bits/use.

Dispersion: $V = \text{SNR}(\text{SNR}+2)/(2(1+\text{SNR})^2)$ in nats $^2$ .

Solution

Capacity

$C = \frac{1}{2}\log_2(1 + 100) = \frac{1}{2}\log_2(101) \approx 3.33$ bits/use.

Dispersion

$V_{\text{nat}} = 100 \times 102 / (2 \times 101^2) = 10200/20402 = 0.4999$ nats $^2$ . $V_{\text{bit}} = 0.4999 \times (\log_2 e)^2 = 0.4999 \times 2.081 = 1.040$ bits $^2$ .

Normal approximation

$R^* = 3.33 - \sqrt{1.040/1000} \times 4.26 = 3.33 - 0.0322 \times 4.26 = 3.33 - 0.137 = 3.19$ bits/use. This is $3.19/3.33 = 95.8\%$ of capacity. At high SNR with $n = 1000$ , the gap is modest.

ex-ch26-04

Easy

Verify that the AWGN dispersion $V = \text{SNR}(\text{SNR}+2)/(2(1+\text{SNR})^2)$ satisfies $V \to 0$ as $\text{SNR} \to 0$ and $V \to 1/2$ as $\text{SNR} \to \infty$ (in nats $^2$ ).

Show Hint

Take limits directly.

Solution

Low SNR limit

As $\text{SNR} \to 0$ : $V = \text{SNR}(\text{SNR}+2)/(2(1+\text{SNR})^2) \approx 2\text{SNR}/2 = \text{SNR} \to 0$ .

High SNR limit

As $\text{SNR} \to \infty$ : $V = \text{SNR}^{2}(1+2/\text{SNR})/(2\text{SNR}^{2}(1+1/\text{SNR})^2) \to 1/2$ .

Interpretation

At low SNR, the channel is nearly deterministic (almost all noise, very little signal), so the information density has small variance. At high SNR, the variance saturates at 1/2, which is the variance of $\frac{1}{2}\log(1 + Z^2/\sigma^2)$ when $Z \sim \mathcal{N}(0, \sigma^2)$ .

ex-ch26-05

Easy

A binary communication system uses a BSC with $p = 0.05$ . How many channel uses are needed to transmit $B = 256$ information bits with error probability $\epsilon = 10^{-6}$ ?

Show Hint

BSC(0.05): $C = 1 - h(0.05) \approx 0.714$ bits/use, $V \approx 0.597$ bits $^2$ .

Need $n \ge B / R^*(n, \epsilon)$ . Iterate or solve approximately.

Solution

Channel parameters

$C = 1 - h(0.05) = 1 - (-0.05\log_2 0.05 - 0.95\log_2 0.95) = 1 - 0.286 = 0.714$ bits/use. $V = 0.05 \times 0.95 \times (\log_2(0.95/0.05))^2 = 0.0475 \times (4.248)^2 \times (\log_2 e)^{-2} \times (\log_2 e)^2$ . Actually: $V = 0.0475 \times (\log_2(19))^2 = 0.0475 \times 17.96 = 0.853$ bits $^2$ .

Required blocklength

$R^*(n, 10^{-6}) = 0.714 - \sqrt{0.853/n} \times 4.75$ . Need $n \times R^*(n) \ge 256$ .

Try $n = 500$ : $R^* = 0.714 - \sqrt{0.00171} \times 4.75 = 0.714 - 0.196 = 0.518$ . Bits: $500 \times 0.518 = 259 > 256$ . Works.

Try $n = 450$ : $R^* = 0.714 - \sqrt{0.00190} \times 4.75 = 0.714 - 0.207 = 0.507$ . Bits: $450 \times 0.507 = 228 < 256$ . Too short.

Approximately $n \approx 500$ channel uses.

ex-ch26-06

Medium

Derive the dispersion of the BSC from the information density. Specifically, show that: $V_{\text{BSC}}(p) = p(1-p)\left(\log\frac{1-p}{p}\right)^2$ (in the base matching the logarithm used for capacity).

Show Hint

The information density takes two values: $\iota_0 = \log 2(1-p)$ (no error) and $\iota_1 = \log 2p$ (error).

Use $V = \mathbb{E}[(\iota - C)^2]$ .

Solution

Information density values

With base-2 logarithm and uniform input: $\iota_0 = 1 + \log_2(1-p)$ with probability $1-p$ , $\iota_1 = 1 + \log_2(p)$ with probability $p$ .

Variance

$V = (1-p)(\iota_0 - C)^2 + p(\iota_1 - C)^2$ . Note $\iota_0 - \iota_1 = \log_2((1-p)/p)$ and $C = (1-p)\iota_0 + p\iota_1$ , so $\iota_0 - C = p(\iota_0 - \iota_1) = p\log_2((1-p)/p)$ and $\iota_1 - C = -(1-p)(\iota_0 - \iota_1) = -(1-p)\log_2((1-p)/p)$ .

$V = (1-p) p^2 (\log_2((1-p)/p))^2 + p(1-p)^2(\log_2((1-p)/p))^2 = p(1-p)[p + (1-p)](\log_2((1-p)/p))^2 = p(1-p)(\log_2((1-p)/p))^2$ .

ex-ch26-07

Medium

Show that the RCU bound for the AWGN channel with Gaussian codebook gives:

$\epsilon_{\text{RCU}} = \mathbb{E}\!\left[\min\!\left(1,\; (M-1)\, e^{-nE_{\text{sp}}(R)}\right)\right]$

where $E_{\text{sp}}(R)$ is the sphere-packing exponent, in the limit of large $n$ . Explain why this recovers the error exponent at rates below capacity.

Show Hint

The confusion probability $P_{\text{conf}}$ is related to the probability that a random codeword falls inside a sphere around the received signal.

For Gaussian codes, this involves the chi-squared distribution.

Solution

Confusion probability

For Gaussian codebook and ML decoding, the probability that a random codeword $\bar{X}^n$ has higher likelihood than the true codeword $X^n$ given output $Y^n$ is: $P_{\text{conf}} = \Pr[\|\bar{X}^n - Y^n\|^2 \le \|X^n - Y^n\|^2]$ .

Since $X^n - Y^n = -Z^n$ and $\bar{X}^n - Y^n = \bar{X}^n - X^n - Z^n$ , this involves the distribution of $\|\bar{X}^n - X^n - Z^n\|^2$ vs $\|Z^n\|^2$ .

Large-$n$ asymptotics

For large $n$ , by the law of large numbers and Cramer's theorem: $P_{\text{conf}} \doteq e^{-nE_{\text{sp}}(R)}$ where $E_{\text{sp}}$ is the sphere-packing exponent. The RCU bound becomes: $(M-1)P_{\text{conf}} \doteq e^{n(R - E_{\text{sp}}(R))}$ which gives $\epsilon \doteq e^{-nE_r(R)}$ where $E_r(R) = E_{\text{sp}}(R) - R + C$ is the random coding exponent.

ex-ch26-08

Medium

For the AWGN channel, compute the SNR penalty $\Delta_{\text{SNR}}$ (in dB) for operating at $n = 128$ vs $n = \infty$ at $\epsilon = 10^{-5}$ , for $\text{SNR} \in \{0, 5, 10, 20\}$ dB. Plot the penalty as a function of SNR.

Show Hint

At each SNR, compute $R^*(128, 10^{-5})$ and find the SNR needed to achieve this rate at $n = \infty$ .

The penalty is $\Delta = \text{SNR}_{\text{needed}} - \text{SNR}_{\text{capacity}}$ in dB.

Solution

At SNR = 0 dB

$C = 1$ bit/use, $V = 0.780$ bits $^2$ . $R^* = 1 - \sqrt{0.780/128} \times 4.26 = 1 - 0.332 = 0.668$ bits/use. SNR for $R^* = 0.668$ : $\text{SNR}' = 2^{2 \times 0.668} - 1 = 2^{1.336} - 1 = 1.525$ . $\Delta = 10\log_{10}(1.525) - 0 = 1.83$ dB.

At SNR = 10 dB

$C = 3.46$ bits/use, $V = 1.033$ bits $^2$ . $R^* = 3.46 - \sqrt{1.033/128} \times 4.26 = 3.46 - 0.383 = 3.08$ bits/use. SNR for $R^* = 3.08$ : $\text{SNR}' = 2^{6.16} - 1 = 70.8$ (18.5 dB). $\Delta = 18.5 - 10 = 8.5$ dB. Note: this is large because $C'(\text{SNR})$ is small at high SNR.

Summary table

SNR (dB)	$C$	$R^*(128, 10^{-5})$	$\Delta$ (dB)
0	1.00	0.67	1.8
5	1.46	1.08	2.7
10	3.46	3.08	8.5
20	6.66	6.21	large

The penalty increases with SNR because $C'(\text{SNR})$ flattens out.

ex-ch26-09

Medium

Prove that the Neyman-Pearson $\beta$ function satisfies $-\frac{1}{n}\log \beta_{1-\epsilon}(P^n, Q^n) \to D(P \| Q)$ as $n \to \infty$ , provided $\epsilon \in (0, 1)$ is fixed.

Show Hint

This is Stein's lemma. Use the fact that the log-likelihood ratio $\sum_i \log(P(x_i)/Q(x_i))$ concentrates around $nD(P \| Q)$ .

Solution

Optimal test

The Neyman-Pearson test compares $\sum_{i=1}^n \log(P(x_i)/Q(x_i))$ to a threshold $\tau_n$ . Under $P^n$ : $\frac{1}{n}\sum_i \log(P(x_i)/Q(x_i)) \to D(P\|Q)$ a.s. by SLLN.

Type-II error analysis

Under $Q^n$ : $\frac{1}{n}\sum_i \log(P(x_i)/Q(x_i)) \to -D(Q\|P)$ a.s. The threshold $\tau_n$ must satisfy $\Pr_{P^n}[\sum_i \log(P/Q) > \tau_n] \ge 1 - \epsilon$ , so $\tau_n \approx nD(P\|Q) - O(\sqrt{n})$ (CLT correction).

Exponential decay

Under $Q^n$ : $\beta_{1-\epsilon} = \Pr_{Q^n}[\sum_i \log(P/Q) > \tau_n] \doteq e^{-nD(P\|Q)}$ by Cramer's theorem. Therefore $-\frac{1}{n}\log\beta_{1-\epsilon} \to D(P\|Q)$ .

ex-ch26-10

Medium

For a Rayleigh fading channel $Y = HX + Z$ where $H \sim \mathcal{CN}(0, 1)$ is known at the receiver, $Z \sim \mathcal{CN}(0, 1)$ , and $\mathbb{E}[|X|^2] \le P$ :

(a) Compute the capacity $C = \mathbb{E}[\log(1 + P|H|^2)]$ .

(b) Show that the dispersion includes a fading term: $V = V_{\text{AWGN}} + \text{Var}_H[\log(1 + P|H|^2)]$ .

Show Hint

The channel is conditionally AWGN given $H$ . Use the law of total variance.

$|H|^2 \sim \text{Exp}(1)$ , so $\text{Var}[\log(1 + P|H|^2)]$ involves the second moment of the log-exponential.

Solution

Capacity

$C = \mathbb{E}_H[\log(1 + P|H|^2)] = \int_0^{\infty} \log(1 + Pt) e^{-t} dt = e^{1/P} E_1(1/P)$ (in nats), where $E_1$ is the exponential integral. For $P = 10$ : $C \approx 2.51$ nats $= 3.62$ bits/use.

Dispersion decomposition

By the law of total variance: $V = \mathbb{E}_H[\text{Var}[\iota | H]] + \text{Var}_H[\mathbb{E}[\iota | H]] = \mathbb{E}_H[V_{\text{AWGN}}(P|H|^2)] + \text{Var}_H[\log(1 + P|H|^2)]$ .

The first term is the average AWGN dispersion across fading realizations. The second term is the fading dispersion — the randomness of capacity across fading states. For Rayleigh fading, this second term dominates at low SNR.

ex-ch26-11

Medium

Consider the Z-channel with $\Pr[Y=0|X=0] = 1$ , $\Pr[Y=0|X=1] = p$ , $\Pr[Y=1|X=1] = 1-p$ . Compute the capacity-achieving input distribution, the capacity, and the dispersion for $p = 0.2$ .

Show Hint

The capacity-achieving input is $\Pr[X=1] = q^*$ where $q^*$ satisfies a transcendental equation.

The information density has three possible values depending on $(X, Y)$ .

Solution

Capacity-achieving input

The capacity is $C = \max_q [(1-q)\log(1/(1-q(1-p))) + q(1-p)\log((1-p)/(1-q(1-p)))]$ . For $p = 0.2$ , the optimal $q^* \approx 0.382$ and $C \approx 0.616$ bits/use.

Dispersion computation

The information density takes values depending on $(X, Y)$ : $(0, 0)$ : $\iota_1 = \log(1/(1-q^*(1-p)))$ with prob $(1-q^*)$ $(1, 1)$ : $\iota_2 = \log((1-p)/(1-q^*(1-p)))$ with prob $q^*(1-p)$ $(1, 0)$ : $\iota_3 = \log(p/(1-q^*(1-p)))$ with prob $q^*p$

$V = \sum_j p_j (\iota_j - C)^2$ . For $p = 0.2$ : $V \approx 0.51$ bits $^2$ .

ex-ch26-12

Medium

A URLLC system operates over a quasi-static Rayleigh fading channel (one fading realization per codeword). Show that the outage probability lower-bounds the error probability at any blocklength:

$\epsilon \ge P_{\text{out}}(R) = \Pr[\log(1 + \text{SNR}|H|^2) < R]$

and that no finite-blocklength code can beat the outage probability in this regime.

Show Hint

In quasi-static fading, the channel realization is fixed for the entire codeword.

Conditioned on $H$ , the channel is AWGN with SNR $= \text{SNR}|H|^2$ .

Solution

Conditional analysis

Given $H = h$ , the channel is AWGN with capacity $C(h) = \log(1 + \text{SNR}|h|^2)$ . If $R > C(h)$ , then no code can achieve error probability $< 1/2$ on this realization (by the converse of the channel coding theorem for the AWGN channel).

Unconditional bound

$\epsilon = \mathbb{E}_H[\epsilon(H)] \ge \mathbb{E}_H[\epsilon(H) \cdot \mathbf{1}[C(H) < R]] \ge \frac{1}{2}\Pr[C(H) < R] = \frac{1}{2}P_{\text{out}}(R)$ .

A tighter argument using the meta-converse shows $\epsilon \ge P_{\text{out}}(R)(1 - o(1))$ , confirming that outage dominates at all finite blocklengths.

Implications

In quasi-static fading, diversity is essential: without it, the error probability is fundamentally limited by the outage probability, regardless of code length. The normal approximation is not useful here because the fading "dispersion" is infinite (the variance of $\log(1 + \text{SNR}|H|^2)$ does not shrink with $n$ when $H$ is constant over the block).

ex-ch26-13

Hard

Prove the achievability part of the normal approximation using the Berry-Esseen theorem. Specifically, show that for a DMC with capacity $C$ , dispersion $V > 0$ , and third absolute moment $T = \mathbb{E}[|\iota(X;Y) - C|^3]$ :

$R^*(n, \epsilon) \ge C - \sqrt{\frac{V}{n}}\, Q^{-1}(\epsilon) - \frac{c_0 T}{V^{3/2}\sqrt{n}}$

for a universal constant $c_0$ .

Show Hint

Start with the RCU bound and bound the confusion probability using the CLT + Berry-Esseen.

The Berry-Esseen theorem gives $|\Pr[S_n \le x] - \Phi(x)| \le c_0 T/(V^{3/2}\sqrt{n})$ .

Solution

RCU bound simplification

From the RCU bound: there exists a code with error $\epsilon$ if $(M-1)\Pr[\iota(\bar{X}^n; Y^n) \ge \iota(X^n; Y^n)] \le \epsilon$ . This is equivalent to $M - 1 \le \epsilon / \Pr[\iota(\bar{X}^n; Y^n) \ge \gamma]$ where $\gamma$ is chosen to control the error.

Berry-Esseen application

Set $\gamma = nC - \sqrt{nV}\,Q^{-1}(\epsilon) + \delta_n$ where $\delta_n = O(1)$ . By Berry-Esseen: $\Pr[\iota(X^n; Y^n) < \gamma] \le \epsilon + c_0 T/(V^{3/2}\sqrt{n})$ . The confusion probability $\Pr[\iota(\bar{X}^n; Y^n) \ge \gamma] \le e^{-\gamma + nC} = e^{\sqrt{nV}Q^{-1}(\epsilon)}$ .

Rate bound

$\log M \ge \gamma - \log(1/\epsilon) = nC - \sqrt{nV}\,Q^{-1}(\epsilon) + O(1)$ . Dividing by $n$ : $R \ge C - \sqrt{V/n}\,Q^{-1}(\epsilon) - O(1/n)$ . The precise third-order term involves $c_0 T / V^{3/2}$ .

ex-ch26-14

Hard

Derive the meta-converse bound for the BSC at blocklength $n = 64$ and error probability $\epsilon = 0.01$ . Compute the exact maximum code size $M$ by evaluating $\beta_{1-\epsilon}$ numerically.

Show Hint

For the BSC with uniform $Q_Y$ , the $\beta$ function can be computed exactly using the binomial distribution.

The likelihood ratio between $P_{Y|X}^n$ and $Q_Y^n$ is a function of the Hamming weight.

Solution

Likelihood ratio

For BSC( $p$ ) with transmitted $x^n$ and received $y^n$ : $\frac{P_{Y|X}^n(y^n|x^n)}{Q_Y^n(y^n)} = \frac{p^d (1-p)^{n-d}}{(1/2)^n} = 2^n p^d(1-p)^{n-d}$ where $d = d_H(x^n, y^n)$ is the Hamming distance.

Neyman-Pearson test

Under $P^n$ : $d \sim \text{Binom}(n, p)$ . Under $Q^n$ : $d \sim \text{Binom}(n, 1/2)$ . The optimal test accepts $H_0$ when $d \le t$ for some threshold $t$ . Set $t$ so that $\sum_{d=0}^t \binom{n}{d}p^d(1-p)^{n-d} \ge 1-\epsilon$ .

Meta-converse bound

$M \le 1/\beta_{1-\epsilon} = 1/\sum_{d=0}^t \binom{n}{d}(1/2)^n$ .

For $n = 64$ , $p = 0.11$ , $\epsilon = 0.01$ : find $t$ such that $\Pr[\text{Binom}(64, 0.11) \le t] \ge 0.99$ , then evaluate $\beta = \Pr[\text{Binom}(64, 0.5) \le t]$ .

Numerically: $t = 12$ gives $\Pr[d \le 12|P] \ge 0.99$ . $\beta = \Pr[\text{Binom}(64, 0.5) \le 12] \approx 6.9 \times 10^{-7}$ . $M \le 1/(6.9 \times 10^{-7}) \approx 1.45 \times 10^6$ . $\log_2 M \le 20.4$ bits.

ex-ch26-15

Hard

Show that for a channel with zero dispersion ( $V = 0$ ), the finite-blocklength rate converges to capacity at rate $O(\log n / n)$ instead of $O(1/\sqrt{n})$ . Give an example of such a channel.

Show Hint

A channel has $V = 0$ if the information density is deterministic: $\iota(X;Y) = C$ with probability 1.

The noiseless binary channel has $V = 0$ .

Solution

Deterministic information density

If $V = 0$ , then $\iota(X_i; Y_i) = C$ with probability 1 for each $i$ . The cumulative information density $\sum_i \iota(X_i; Y_i) = nC$ deterministically. There is no CLT correction; the convergence to capacity is determined by the $O(\log n/n)$ term in the finite-blocklength expansion.

Example: noiseless channel

The noiseless binary channel ( $Y = X$ ) has $C = 1$ bit/use and $\iota(X; Y) = \log(p_{Y|X}(Y|X)/p_Y(Y)) = \log(1/p_Y(Y)) = \log 2 = 1$ bit deterministically under uniform input. So $V = 0$ . At any finite $n$ : $R^*(n, \epsilon) = 1 - O(\log n / n)$ for any $\epsilon > 0$ .

BEC as intermediate case

The BEC has $V > 0$ (the erasure creates randomness in $\iota$ ), but some channels approach $V = 0$ behavior. Geometrically, $V = 0$ means the typical set has extremely sharp boundaries, so random coding works almost immediately.

ex-ch26-16

Hard

For the two-user Gaussian MAC with $P_1 = P_2 = P$ and $\sigma^2 = 1$ , compute the full $3 \times 3$ dispersion matrix $\mathbf{V}$ at $P = 10$ . Verify that the (3,3) entry equals the point-to-point dispersion at SNR $= 2P$ .

Show Hint

The three information densities are: $\iota_1 = \iota(X_1; Y|X_2)$ , $\iota_2 = \iota(X_2; Y|X_1)$ , $\iota_{12} = \iota(X_1, X_2; Y)$ .

$\iota_1$ and $\iota_2$ are conditionally independent given $X_1, X_2$ .

Solution

Individual rate densities

$\iota_k = \iota(X_k; Y|X_{\bar{k}})$ is the AWGN information density at SNR $P/\sigma^2 = 10$ . $\text{Var}[\iota_k] = V(10) = 10 \times 12 / (2 \times 121) = 0.496$ nats $^2$ .

Sum-rate density

$\iota_{12} = \iota(X_1, X_2; Y)$ is the AWGN information density at SNR $2P = 20$ . $\text{Var}[\iota_{12}] = V(20) = 20 \times 22 / (2 \times 441) = 0.499$ nats $^2$ .

Cross-correlations and matrix

$\text{Cov}[\iota_1, \iota_2]$ and $\text{Cov}[\iota_k, \iota_{12}]$ are computed from the joint distribution of $(X_1, X_2, Y)$ . Since $\iota_{12} = \iota_1 + \iota(X_2; Y)$ (chain rule), the correlations are non-zero.

The full $3 \times 3$ matrix $\mathbf{V}$ has entries computable from the moments of $(X_1^2, X_2^2, Z^2)$ . The (3,3) entry is $0.499$ nats $^2$ , confirming it equals $V(2P)$ .

ex-ch26-17

Hard

Saddlepoint approximation. The normal approximation can be refined using the saddlepoint method. For the AWGN channel, the cumulant generating function of the information density is: $\Lambda(s) = \log \mathbb{E}[e^{s\,\iota(X;Y)}].$

(a) Compute $\Lambda(s)$ for the real AWGN channel with Gaussian input.

(b) Show that the saddlepoint approximation gives: $\log M^*(n, \epsilon) \approx n\Lambda'(s^*) - ns^* \Lambda'(s^*)$ where $s^*$ is the saddlepoint satisfying a specific equation.

Show Hint

For Gaussian variables, the cumulant generating function involves the moment generating function of chi-squared distributions.

The saddlepoint approximation is more accurate than the normal approximation at the tails.

Solution

CGF for AWGN

The information density for AWGN with Gaussian input is a quadratic form in Gaussian variables. Its CGF is: $\Lambda(s) = -\frac{1}{2}\log(1 - s^2 V_1) + \frac{s^2 V_2}{2(1 - s^2 V_1)}$ (up to constants depending on the SNR parametrization).

Saddlepoint equation

The saddlepoint $s^*$ satisfies $\Lambda'(s^*) = \gamma/n$ where $\gamma$ is the threshold in the hypothesis test. This gives a more accurate approximation than the CLT because it accounts for the skewness of the information density.

Practical impact

The saddlepoint approximation matches the exact RCU/MC bounds to within 0.01 bits/use for $n \ge 50$ , compared to 0.05-0.1 bits/use for the normal approximation. This makes it the tool of choice for precise finite-blocklength analysis in system design.

ex-ch26-18

Challenge

Research-level. Derive the second-order coding rate for the Gaussian MIMO channel $\mathbf{Y} = \mathbf{H}\mathbf{X} + \mathbf{Z}$ with $\mathbf{H} \in \mathbb{C}^{N_r \times N_t}$ known at the receiver. Show that the dispersion is:

$V = \sum_{i=1}^{\min(N_t, N_r)} \frac{\text{SNR}_{i}^{2}(\text{SNR}_{i} + 2)}{2(1 + \text{SNR}_{i})^2}$

where $\text{SNR}_{i} = P_i \sigma_i^2 / \sigma^2$ are the per-stream SNRs after water-filling, and $\sigma_i$ are the singular values of $\mathbf{H}$ .

Show Hint

After SVD of $\mathbf{H}$ , the MIMO channel decomposes into parallel AWGN channels.

The total information density is the sum of per-stream information densities, which are independent.

The dispersion of a sum of independent variables is the sum of individual dispersions.

Solution

SVD decomposition

Write $\mathbf{H} = \mathbf{U}\boldsymbol{\Sigma}\mathbf{V}^H$ and transform to parallel channels: $\tilde{Y}_i = \sigma_i \tilde{X}_i + \tilde{Z}_i$ for $i = 1, \ldots, \min(N_t, N_r)$ , with water-filling power $P_i$ .

Per-stream dispersion

Each parallel channel has AWGN dispersion $V_i = \text{SNR}_{i}(\text{SNR}_{i} + 2)/(2(1+\text{SNR}_{i})^2)$ in nats $^2$ .

Total dispersion

Since the streams are independent (after unitary transformation): $V = \sum_i V_i$ . The normal approximation gives: $R^*(n, \epsilon) = C_{\text{MIMO}} - \sqrt{V/n}\,Q^{-1}(\epsilon) + O(\log n/n)$ .

ex-ch26-19

Challenge

Open problem flavor. The meta-converse for the fading MAC with $K$ users and blocklength $n$ is not fully characterized when $K$ grows with $n$ (the many-access regime). Formulate the problem for $K = \Theta(n/\log n)$ active users, each sending $B = O(\log n)$ bits, and derive an achievability bound using random Gaussian codebooks.

Show Hint

In the many-access regime, the traditional MAC capacity region is not the right benchmark.

Use the per-user error probability (PUPE) as the performance metric.

The achievable $E_b/N_0$ scales as $\ln(K_a B)$ .

Solution

System model

$K_a$ active users, each with $B$ bits, blocklength $n$ , over AWGN MAC. Each user transmits $\mathbf{x}_k = \sqrt{nP}\mathbf{c}_k(w_k)$ where $\mathbf{c}_k$ is drawn from a Gaussian codebook of size $2^B$ .

Per-user achievability

Using the RCU bound for the Gaussian MAC: the PUPE satisfies $\epsilon \le K_a \cdot 2^B \cdot \Pr[\iota(\bar{X}; Y) \ge \iota(X; Y)] \approx K_a 2^B \exp(-nE_b/(2N_0))$ where $E_b = nP/B$ .

Required energy-per-bit

Setting $\epsilon = 0.01$ : $nE_b/(2N_0) \ge \log(100 K_a 2^B) = \log(100) + \log K_a + B\log 2$ . For $K_a = n/\log n$ and $B = \log n$ : $E_b/N_0 \ge \frac{2B}{n}(\log(100) + \log(n/\log n) + \log n \cdot \log 2) \approx \frac{4\log^2 n}{n}$ which goes to 0.

However, the per-user energy $nP$ must satisfy $K_a \cdot nP \le n P_{\text{total}}$ , giving $P \le P_{\text{total}}/K_a$ and $E_b = nP_{\text{total}}/(K_a B)$ .

ex-ch26-20

Challenge

Implementation project. Write a simulation that computes the exact RCU bound and meta-converse for the BSC with $p = 0.11$ at blocklengths $n \in \{32, 64, 128, 256, 512\}$ and error probabilities $\epsilon \in \{10^{-2}, 10^{-3}, 10^{-5}\}$ .

(a) Plot the exact bounds alongside the normal approximation.

(b) Quantify the gap between the normal approximation and the exact bounds.

(c) Overlay the performance of the best known codes (e.g., BCH, Reed-Muller, polar) from the literature.

Show Hint

The BSC allows exact computation via binomial sums.

For the meta-converse, optimize over the output distribution $Q_Y$ (or use the uniform output, which is often optimal for the BSC).

Solution

RCU computation

For BSC with uniform codebook: $\epsilon_{\text{RCU}} = \sum_{d=0}^n \binom{n}{d}p^d(1-p)^{n-d} \min(1, (M-1)I_d)$ where $I_d = \sum_{d'=0}^d \binom{n}{d'}2^{-n}$ is the confusion probability at Hamming distance $d$ .

Meta-converse computation

$M_{\text{MC}} = 1/\beta_{1-\epsilon}(P_{Y|X=0}^n, Q_Y^n)$ with $Q_Y = \text{Uniform}$ : find threshold $t$ such that $\Pr[\text{Binom}(n,p) \le t] \ge 1-\epsilon$ , then $\beta = \Pr[\text{Binom}(n, 1/2) \le t]$ .

Expected results

The normal approximation is within 0.5 bits of the exact bounds for $n \ge 128$ . For $n = 32$ , the gap can be 1-2 bits. Best known codes (BCH for small $n$ , polar for large $n$ ) are typically within 1-2 dB of the meta-converse.

Exercises

ex-ch26-01

Capacity and dispersion

Required blocklength

ex-ch26-02

Output distribution

Information density

Verification

ex-ch26-03

Capacity

Dispersion

Normal approximation

ex-ch26-04

Low SNR limit

High SNR limit

Interpretation

ex-ch26-05

Channel parameters

Required blocklength

ex-ch26-06

Information density values

Variance

ex-ch26-07

Confusion probability

Large-$n$ asymptotics

ex-ch26-08

At SNR = 0 dB

At SNR = 10 dB

Summary table

ex-ch26-09

Optimal test

Type-II error analysis

Exponential decay

ex-ch26-10

Capacity

Dispersion decomposition

ex-ch26-11

Capacity-achieving input

Dispersion computation

ex-ch26-12

Conditional analysis

Unconditional bound

Implications

ex-ch26-13

RCU bound simplification

Berry-Esseen application

Rate bound

ex-ch26-14

Likelihood ratio

Neyman-Pearson test

Meta-converse bound

ex-ch26-15

Deterministic information density

Example: noiseless channel

BEC as intermediate case

ex-ch26-16

Individual rate densities

Sum-rate density

Cross-correlations and matrix

ex-ch26-17

CGF for AWGN

Saddlepoint equation

Practical impact

ex-ch26-18

SVD decomposition

Per-stream dispersion

Total dispersion

ex-ch26-19

System model

Per-user achievability

Required energy-per-bit

ex-ch26-20

RCU computation

Meta-converse computation

Expected results