Ferkans — Interactive Telecom Tutor

ex-ch20-01

Easy

State the secrecy capacity of the degraded wiretap channel. What is the operational meaning of each term in the formula?

Show Hint

The formula involves the difference of two mutual informations.

Think about what each term means for the receiver and the eavesdropper.

Solution

Formula

$C_s = \max_{P_X}[I(X;Y) - I(X;Z)].$ $

Interpretation

$I(X;Y)$ is the rate at which the legitimate receiver can reliably decode. $I(X;Z)$ is the rate of information leakage to the eavesdropper. The difference represents the "secrecy advantage" — the rate at which we can communicate reliably to the receiver while keeping the eavesdropper ignorant. The maximization is over the input distribution, which controls the tradeoff.

ex-ch20-02

Easy

Compute the secrecy capacity of a BSC wiretap channel where the main channel has crossover probability $p = 0.05$ and the eavesdropper's channel has crossover probability $q = 0.2$ .

Show Hint

The BSC capacity is $1 - h(p)$ with uniform input.

For the degraded BSC wiretap, $C_s = h(q) - h(p)$ .

Solution

Compute

$C_s = h(q) - h(p) = h(0.2) - h(0.05).KATEXPLACEHOLDER0ENDC_s \approx 0.722 - 0.286 = 0.436 \text{ bits/use}.$ $

Interpretation

The main channel capacity is $1 - 0.286 = 0.714$ bits/use, and the secrecy capacity is $0.436$ bits/use — about 61% of the main channel capacity is available for secret communication.

ex-ch20-03

Easy

For the Gaussian wiretap channel with $P = 10$ W, $\sigma^2_{Y} = 1$ W, and $\sigma^2_{Z} = 4$ W, compute: (a) The main channel capacity $C_{\text{main}}$ (b) The secrecy capacity $C_s$ (c) The secrecy capacity as a fraction of the main capacity

Show Hint

$C_{\text{main}} = \frac{1}{2}\log(1 + P/\sigma^2_{Y})$ .

$C_s = \frac{1}{2}\log\frac{1 + P/\sigma^2_{Y}}{1 + P/\sigma^2_{Z}}$ .

Solution

Main capacity

$C_{\text{main}} = \frac{1}{2}\log_2(1 + 10) = \frac{1}{2}\log_2(11) \approx 1.73$ bits/use.

Secrecy capacity

$C_s = \frac{1}{2}\log_2\frac{11}{1 + 10/4} = \frac{1}{2}\log_2\frac{11}{3.5} = \frac{1}{2}\log_2(3.14) \approx 0.83$ bits/use.

Fraction

$C_s / C_{\text{main}} \approx 0.83/1.73 \approx 48\%$ . About half the main channel capacity is available for secret communication.

ex-ch20-04

Easy

Explain the difference between weak secrecy and strong secrecy. Why does the distinction matter for practical security?

Show Hint

Weak secrecy: per-symbol leakage vanishes. Strong secrecy: total leakage vanishes.

Think about what happens as $n$ grows.

Solution

Definitions

Weak secrecy: $\frac{1}{n}I(M; Z^n) \to 0$ . The leakage rate (bits per channel use) vanishes.

Strong secrecy: $I(M; Z^n) \to 0$ . The total leakage (bits) vanishes.

Why it matters

Under weak secrecy, $I(M; Z^n)$ can grow as $o(n)$ — for example, $\sqrt{n}$ bits could leak. Over a long transmission ( $n = 10^6$ ), the eavesdropper could learn $\sqrt{10^6} = 1000$ bits, which might be enough to compromise a 128-bit encryption key.

Under strong secrecy, the total leakage goes to zero regardless of $n$ . This is the appropriate notion for practical security.

The good news: both notions yield the same secrecy capacity, so there is no rate penalty for requiring the stronger guarantee.

ex-ch20-05

Easy

In a TDD system, Alice and Bob observe channel estimates $X = H + N_A$ and $Y = H + N_B$ where $H \sim \mathcal{CN}(0, 10)$ and $N_A, N_B \sim \mathcal{CN}(0, 1)$ independently. Eve's observation is independent of $(X, Y)$ . What is the secret key capacity?

Show Hint

Since Eve is independent, $I(X; Z) = 0$ .

Compute $I(X; Y)$ for jointly Gaussian $(X, Y)$ .

Solution

Secret key capacity

Since Eve is independent: $C_K = I(X; Y)$ .

$(X, Y)$ are jointly Gaussian with $\text{Var}(X) = \text{Var}(Y) = 11$ and $\text{Cov}(X, Y) = 10$ . Correlation: $\rho = 10/11$ .

$C_K = -\frac{1}{2}\log_2(1 - \rho^2) = -\frac{1}{2}\log_2\left(1 - \frac{100}{121}\right) = -\frac{1}{2}\log_2\left(\frac{21}{121}\right) = \frac{1}{2}\log_2\left(\frac{121}{21}\right) \approx 1.26 \text{ bits/observation}.$

ex-ch20-06

Medium

For the Gaussian wiretap channel, show that the secrecy capacity saturates as $P \to \infty$ . Find the limiting value and interpret it.

Show Hint

Write $C_s(P) = \frac{1}{2}\log\frac{1 + P/\sigma^2_{Y}}{1 + P/\sigma^2_{Z}}$ and take the limit.

Solution

High-SNR limit

$\lim_{P \to \infty} C_s(P) = \lim_{P \to \infty} \frac{1}{2}\log\frac{1 + P/\sigma^2_{Y}}{1 + P/\sigma^2_{Z}} = \frac{1}{2}\log\frac{P/\sigma^2_{Y}}{P/\sigma^2_{Z}} = \frac{1}{2}\log\frac{\sigma^2_{Z}}{\sigma^2_{Y}}.$ $

Interpretation

The secrecy capacity saturates at $\frac{1}{2}\log(\sigma^2_{Z}/\sigma^2_{Y})$ , determined solely by the noise ratio. Increasing power benefits both channels equally, so the gap — which is what determines secrecy — converges to a constant.

This is fundamentally different from the main channel capacity $\frac{1}{2}\log(1 + P/\sigma^2_{Y}) \to \infty$ . The implication for system design: at high SNR, additional power is wasted for secrecy purposes. Better to invest in more antennas (for spatial secrecy) than more power.

ex-ch20-07

Medium

Consider a BEC wiretap channel where the main channel is BEC( $\epsilon_1$ ) and the wiretap channel is BEC( $\epsilon_2$ ) with $\epsilon_2 > \epsilon_1$ .

(a) Verify that the channel is degraded.

(b) Compute the secrecy capacity.

(c) At what erasure probability $\epsilon_2$ does the secrecy capacity equal half the main channel capacity?

Show Hint

The BEC( $\epsilon_2$ ) is a degraded version of BEC( $\epsilon_1$ ) when $\epsilon_2 = \epsilon_1 + (1-\epsilon_1)\delta$ for some $\delta$ .

BEC capacity is $1 - \epsilon$ .

Solution

Part (a): Degradedness

The output of BEC( $\epsilon_1$ ) is either the input $X$ or an erasure $?$ . We can construct BEC( $\epsilon_2$ ) from BEC( $\epsilon_1$ ) by further erasing the non-erased outputs with probability $\delta = (\epsilon_2 - \epsilon_1)/(1-\epsilon_1)$ . This gives $\epsilon_2 = \epsilon_1 + (1-\epsilon_1)\delta$ , confirming $X \to Y \to Z$ is a Markov chain.

Part (b): Secrecy capacity

$I(X;Y) = 1 - \epsilon_1$ and $I(X;Z) = 1 - \epsilon_2$ with uniform input.

$C_s = (1-\epsilon_1) - (1-\epsilon_2) = \epsilon_2 - \epsilon_1.$

Part (c): Half-capacity point

$C_s = \frac{1}{2}C_{\text{main}} \Rightarrow \epsilon_2 - \epsilon_1 = \frac{1}{2}(1-\epsilon_1) \Rightarrow \epsilon_2 = \frac{1+\epsilon_1}{2}.$

For $\epsilon_1 = 0.1$ : $\epsilon_2 = 0.55$ .

ex-ch20-08

Medium

Sketch the achievability proof for the wiretap channel secrecy capacity. Specifically, describe: (a) The codebook structure (how many codewords, how they are organized) (b) The encoding rule (what the transmitter does) (c) Why the legitimate receiver can decode (d) Why the eavesdropper cannot determine the message

Show Hint

The codebook has $2^{nR}$ 'bins' with $2^{n\tilde{R}_l}$ codewords in each bin.

The randomization rate $\tilde{R}_l \approx I(X;Z)$ is what confuses the eavesdropper.

Solution

Part (a): Codebook

Generate $2^{n(R + \tilde{R}_l)}$ codewords i.i.d. $\sim P_X^*$ , organized into $2^{nR}$ bins (one per message), each containing $2^{n\tilde{R}_l}$ codewords. Set $R + \tilde{R}_l < I(X;Y)$ and $\tilde{R}_l > I(X;Z)$ .

Part (b): Encoding

To send message $m$ , the encoder uniformly selects one of the $2^{n\tilde{R}_l}$ codewords in bin $m$ and transmits it. The selection is random and independent of the message — this is the stochastic encoder.

Part (c): Reliability

The total rate $R + \tilde{R}_l < I(X;Y)$ , so the receiver can decode the specific codeword (both $m$ and the randomization index $l$ ) using joint typicality decoding. Reliability follows from the standard random coding argument.

Part (d): Secrecy

The eavesdropper sees $Z^n$ , which has mutual information $\leq I(X;Z)$ with the transmitted codeword. Since the randomization rate $\tilde{R}_l > I(X;Z)$ , the eavesdropper cannot distinguish which codeword within a bin was sent. Since knowing the codeword within the bin is necessary to determine the bin (message), the message remains hidden. Formally, $\frac{1}{n}H(M|Z^n) \geq R - \epsilon_n$ .

ex-ch20-09

Medium

Show that the secret key capacity with one-way communication equals the wiretap secrecy capacity for the degraded case. Specifically, if $X \to Y \to Z$ , show that $C_K^{\to} = I(X;Y) - I(X;Z) = C_s$ .

Show Hint

Use the chain rule to expand $I(X;Y) - I(X;Z)$ using the Markov chain.

The Markov chain gives $I(X;Z) \leq I(X;Y)$ .

Solution

Expand using chain rule

By the Markov chain $X \to Y \to Z$ : $I(X; Z) = I(X; Y) - I(X; Y|Z).$ Wait, that's not quite right. Let us use: $I(X; Y, Z) = I(X; Y) + I(X; Z|Y) = I(X;Y)$ since $X \to Y \to Z$ gives $I(X; Z|Y) = 0$ .

Also $I(X; Y, Z) = I(X; Z) + I(X; Y|Z)$ .

Therefore: $I(X; Y) = I(X; Z) + I(X; Y|Z)$ .

Key capacity

$C_K^{\to} = I(X;Y) - I(X;Z) = I(X; Y|Z).KATEXPLACEHOLDER0ENDC_s = \max_{P_X}[I(X;Y) - I(X;Z)] = \max_{P_X} I(X; Y|Z).$ $The duality is exact: secret key generation with one-way communication achieves the same rate as direct wiretap coding.$ \blacksquare$

ex-ch20-10

Medium

In the MISO wiretap channel with $n_t = 4$ , $n_r = n_e = 1$ , compute the secrecy rate achieved by artificial noise as a function of the power split parameter $\alpha$ (fraction allocated to the message). Assume: $\mathbf{h}_B = [1, 0, 0, 0]^T$ , $\mathbf{h}_E = [0.5, 0.5, 0.5, 0.5]^T$ , $P = 20$ dB.

Show Hint

The beamforming vector is $\mathbf{v}_s = \mathbf{h}_B/\|\mathbf{h}_B\| = [1,0,0,0]^T$ .

The AN subspace is $\text{null}(\mathbf{h}_B^H) = \text{span}\{\mathbf{e}_2, \mathbf{e}_3, \mathbf{e}_4\}$ .

Solution

Bob's SNR

$P = 100$ (linear). $\mathbf{v}_s = [1,0,0,0]^T$ . $\text{SNR}_B = \alpha P |\mathbf{h}_B^H \mathbf{v}_s|^2 = 100\alpha \cdot 1 = 100\alpha$ .

Eve's SINR

$|\mathbf{h}_E^H \mathbf{v}_s|^2 = |0.5|^2 = 0.25$ .

AN power per dimension: $(1-\alpha)P/3 = 100(1-\alpha)/3$ . $\|\mathbf{h}_E^H \mathbf{V}_{\text{AN}}\|^2 = |0.5|^2 + |0.5|^2 + |0.5|^2 = 0.75$ .

$\text{SINR}_E = \frac{100\alpha \cdot 0.25}{1 + \frac{100(1-\alpha)}{3} \cdot 0.75} = \frac{25\alpha}{1 + 25(1-\alpha)}.$

Secrecy rate

$R_s(\alpha) = \left[\log_2(1 + 100\alpha) - \log_2\left(1 + \frac{25\alpha}{1 + 25(1-\alpha)}\right)\right]^+.$ $At$ \alpha = 0.5 $:$ \text{SNR}_B = 50 $,$ \text{SINR}_E = 12.5/(1+12.5) = 0.926 $.$ R_s(0.5) = \log_2(51) - \log_2(1.926) \approx 5.67 - 0.95 = 4.72 $bits/use. At$ \alpha = 1 $(no AN):$ \text{SINR}_E = 25 $.$ R_s(1) = \log_2(101) - \log_2(26) \approx 6.66 - 4.70 = 1.96$ bits/use.

AN more than doubles the secrecy rate at this operating point.

ex-ch20-11

Medium

Show that the secrecy capacity of the MIMO wiretap channel is at least as large as the secrecy capacity of the best MISO sub-channel obtained by receive beamforming at Bob.

Show Hint

Bob can apply a receive beamforming vector $\mathbf{u}^H$ to get a MISO channel.

The MIMO secrecy capacity optimizes over all input covariances, which includes rank-1 beamforming.

Solution

MISO reduction

Let Bob apply a unit-norm receive beamforming vector $\mathbf{u}^H$ . The effective channel becomes MISO: $\tilde{y} = \mathbf{u}^H \mathbf{H} \mathbf{x} + \mathbf{u}^H \mathbf{w}_B$ , $\mathbf{z} = \mathbf{H}_{E} \mathbf{x} + \mathbf{w}_E$ .

The effective Bob channel is $\tilde{\mathbf{h}}_B^H = \mathbf{u}^H \mathbf{H}$ with noise variance $\|\mathbf{u}\|^2 = 1$ .

Lower bound

The MISO secrecy capacity with artificial noise is: $C_s^{\text{MISO}} = \max_{\alpha, \mathbf{v}_s} R_s(\alpha, \mathbf{v}_s).$

This is achievable for the MIMO channel by having Bob use $\mathbf{u}$ and the transmitter use the corresponding MISO strategy. Since the MIMO secrecy capacity optimizes over all input covariances (a larger set that includes rank-1 + AN strategies as special cases): $C_s^{\text{MIMO}} \geq C_s^{\text{MISO}} \text{ for any } \mathbf{u}.$

The MIMO capacity is at least as large as the best MISO sub-channel. $\blacksquare$

ex-ch20-12

Hard

Prove the converse of the wiretap channel secrecy capacity for the degraded case. Specifically, show that for any $(2^{nR}, n)$ code with $P_e^{(n)} \to 0$ and $\frac{1}{n}I(M; Z^n) \to 0$ , we have $R \leq \max_{P_X}[I(X;Y) - I(X;Z)]$ .

Show Hint

Start with $nR = H(M)$ and use Fano's inequality.

Decompose using the chain rule and the Markov chain $M \to X^n \to Y^n \to Z^n$ .

Use the Csiszár sum identity to relate the bound to single-letter quantities.

Solution

Start with Fano

$nR = H(M) \leq I(M; Y^n) + n\epsilon_n$ by Fano's inequality.

Subtract leakage

$nR \leq I(M; Y^n) - I(M; Z^n) + I(M; Z^n) + n\epsilon_n$ $\leq I(M; Y^n) - I(M; Z^n) + n\delta_n + n\epsilon_n$ by the secrecy constraint $\frac{1}{n}I(M; Z^n) \leq \delta_n \to 0$ .

Chain rule expansion

$I(M; Y^n) - I(M; Z^n) = \sum_{i=1}^n [I(M; Y_i | Y^{i-1}) - I(M; Z_i | Z_{i+1}^n)]KATEXPLACEHOLDER0END\leq \sum_{i=1}^n [I(X_i; Y_i) - I(X_i; Z_i)] \leq n \max_{P_X}[I(X;Y) - I(X;Z)].$ $

Conclude

$R \leq \max_{P_X}[I(X;Y) - I(X;Z)] + \delta_n + \epsilon_n.$ $Taking$ n \to \infty $:$ R \leq C_s $.$ \blacksquare$

ex-ch20-13

Hard

For a MISO wiretap channel with $n_t$ transmit antennas, show that the optimal artificial noise power fraction $(1-\alpha^*)$ increases as the number of antennas increases (for fixed total power and a generic Eve channel).

Show Hint

As $n_t$ increases, the null space of $\mathbf{h}_B^H$ grows, providing more directions for AN.

More AN directions means lower per-direction AN power but more total interference at Eve.

Show that the optimal $\alpha^*$ is decreasing in $n_t$ .

Solution

Setup

The AN occupies the $(n_t - 1)$ -dimensional null space of $\mathbf{h}_B^H$ . With power fraction $(1-\alpha)$ , the AN power per direction is $\sigma_{\text{AN}}^2 = (1-\alpha)P/(n_t - 1)$ .

Eve's interference from AN: $\text{AN}_E = \sigma_{\text{AN}}^2 \|\mathbf{h}_E^H \mathbf{V}_{\text{AN}}\|^2$ .

For generic $\mathbf{h}_E$ , $\|\mathbf{h}_E^H \mathbf{V}_{\text{AN}}\|^2 \sim \chi^2_{2(n_t-1)} \cdot \frac{\|\mathbf{h}_E\|^2}{2n_t}$ in expectation, giving $\mathbb{E}[\text{AN}_E] \approx (1-\alpha)P \cdot \frac{n_t-1}{n_t} \cdot \frac{\|\mathbf{h}_E\|^2}{n_t}$ .

Large $n_t$ behavior

As $n_t$ grows: $\mathbb{E}[\text{AN}_E] \to (1-\alpha)P \|\mathbf{h}_E\|^2 / n_t$ concentrates (by the law of large numbers applied to $\|\mathbf{h}_E^H \mathbf{V}_{\text{AN}}\|^2$ ).

Bob's rate: $R_B = \log(1 + \alpha P \|\mathbf{h}_B\|^2)$ (independent of $n_t$ ).

Eve's SINR: $\text{SINR}_E \approx \frac{\alpha P |\mathbf{h}_E^H \mathbf{v}_s|^2}{1 + (1-\alpha)P(n_t-1)/(n_t) \cdot \|\mathbf{h}_E\|^2/(n_t-1)} \to \frac{\alpha P |\mathbf{h}_E^H \mathbf{v}_s|^2}{1 + (1-\alpha)P\|\mathbf{h}_E\|^2/n_t}$ .

Optimal power split

The secrecy rate $R_s = R_B - \log(1 + \text{SINR}_E)$ is maximized by choosing $\alpha$ to balance Bob's SNR against Eve's SINR. As $n_t$ increases, the AN becomes more effective per unit power (more null space directions), so the optimal $\alpha^*$ decreases — more power goes to AN.

In the limit $n_t \to \infty$ : $\text{SINR}_E \to 0$ for any $\alpha < 1$ , and $\alpha^* \to P\|\mathbf{h}_B\|^2 / (1 + P\|\mathbf{h}_B\|^2)$ , allocating just enough power to the message to achieve the interference-free capacity.

ex-ch20-14

Hard

Prove that the secrecy degrees of freedom of the MIMO wiretap channel with $n_t$ transmit, $n_r$ receive, and $n_e$ eavesdropper antennas is $d_s = [\min(n_t, n_r) - \min(n_t, n_e)]^+$ when $n_t \leq n_r + n_e$ .

Show Hint

Upper bound: use the secrecy capacity formula and the high-SNR scaling of log-det.

Lower bound: use GSVD-based precoding to separate the sub-channels.

Solution

Upper bound

At high SNR ( $P \to \infty$ ), with optimal $\boldsymbol{\Sigma}_{X} = (P/n_t)\mathbf{I}$ : $C_s \leq \log\det(\mathbf{I} + (P/n_t)\mathbf{H}\mathbf{H}^{H}) - \log\det(\mathbf{I} + (P/n_t)\mathbf{H}_{E}\mathbf{H}_{E}^{H}).$

The first term scales as $\min(n_t, n_r) \log(P/n_t) + O(1)$ and the second as $\min(n_t, n_e) \log(P/n_t) + O(1)$ .

Therefore $d_s \leq \min(n_t, n_r) - \min(n_t, n_e)$ .

Lower bound

Use the GSVD of $(\mathbf{H}, \mathbf{H}_{E})$ : there exist unitary matrices and a common right precoder that simultaneously diagonalize both channels. The GSVD creates parallel sub-channels indexed by $i$ , where the $i$ -th sub-channel has gains $(\alpha_i, \beta_i)$ to Bob and Eve respectively.

There are $\min(n_t, n_r) - \min(n_t, n_e)$ sub-channels where $\alpha_i > 0$ and $\beta_i = 0$ (visible to Bob, invisible to Eve). Sending data on these sub-channels achieves $d_s = \min(n_t, n_r) - \min(n_t, n_e)$ . $\blacksquare$

Special case: $n_t > n_r + n_e$

When $n_t > n_r + n_e$ , we can find $n_r$ beamforming directions that are simultaneously in the range of $\mathbf{H}^{H}$ and the null space of $\mathbf{H}_{E}$ . This gives $d_s = n_r = \min(n_t, n_r)$ — the full non-secrecy DoF.

ex-ch20-15

Hard

A secret key generation protocol operates as follows: Alice and Bob observe $n$ i.i.d. samples of $(X, Y)$ with $X, Y \sim \mathcal{N}(0, 1)$ and correlation $\rho = 0.95$ . Eve has no observation.

(a) What is the maximum key rate?

(b) If Alice quantizes $X$ to 4-bit resolution, what fraction of the key rate is lost due to quantization?

(c) After quantization, Alice sends the syndrome of a rate-0.2 LDPC code over the public channel for information reconciliation. How many secret key bits can be extracted per observation?

Show Hint

Part (a): $C_K = I(X;Y) = -\frac{1}{2}\log(1-\rho^2)$ .

Part (b): Quantization reduces the effective correlation.

Part (c): The syndrome leaks information that must be subtracted from the key rate.

Solution

Part (a): Maximum key rate

$C_K = I(X;Y) = -\frac{1}{2}\log_2(1 - 0.95^2) = -\frac{1}{2}\log_2(0.0975) \approx 1.68 \text{ bits/obs}.$ $

Part (b): Quantization loss

With 4-bit uniform quantization of $X \in [-3, 3]$ (covering 99.7% of the Gaussian), the quantization noise variance is approximately $\sigma_q^2 \approx (6/16)^2/12 \approx 0.0117$ .

The effective correlation after quantization: $\rho_{\text{eff}} \approx \rho \cdot \sigma_X / \sqrt{\sigma_X^2 + \sigma_q^2} \approx 0.95 \cdot 1/\sqrt{1.0117} \approx 0.9445$ .

$C_K^{\text{quant}} \approx -\frac{1}{2}\log_2(1 - 0.9445^2) \approx 1.55$ bits/obs.

Loss: $(1.68 - 1.55)/1.68 \approx 7.7\%$ .

Part (c): Extractable key bits

The syndrome of the rate-0.2 LDPC code leaks $0.2 \times 4 = 0.8$ bits/obs to Eve over the public channel.

After privacy amplification, the extractable key rate is: $R_K \leq C_K^{\text{quant}} - R_{\text{leak}} = 1.55 - 0.8 = 0.75 \text{ bits/obs}.$

This is a conservative estimate; tighter reconciliation codes would leak less.

ex-ch20-16

Hard

Show that the Gaussian distribution maximizes $I(X;Y) - I(X;Z)$ under a power constraint for the degraded Gaussian wiretap channel. (Hint: use the entropy power inequality or the maximum entropy argument.)

Show Hint

The Gaussian maximizes entropy under a variance constraint.

For the degraded channel $Y = X + N_Y$ , $Z = Y + N'$ , expand both mutual informations.

Solution

Expand mutual informations

$I(X;Y) = h(Y) - h(Y|X) = h(Y) - h(N_Y) = h(X + N_Y) - \frac{1}{2}\log(2\pi e \sigma^2_{Y})$ .

Since $Z = X + N_Z$ where $N_Z = N_Y + N'$ (degraded): $I(X;Z) = h(Z) - h(Z|X) = h(X + N_Z) - \frac{1}{2}\log(2\pi e \sigma^2_{Z})$ .

Difference

$I(X;Y) - I(X;Z) = h(X + N_Y) - h(X + N_Z) + \frac{1}{2}\log\frac{\sigma^2_{Z}}{\sigma^2_{Y}}.$ $The last term is constant. We need to show that$ h(X + N_Y) - h(X + N_Z) $is maximized by Gaussian$ X$.

Costa's argument

By the entropy power inequality (EPI), for independent $X$ and $N$ : $2^{2h(X+N)} \geq 2^{2h(X)} + 2^{2h(N)}$ .

A direct proof: define $f(t) = h(X + \sqrt{t}G)$ where $G \sim \mathcal{N}(0,1)$ independent of $X$ . Then $h(X + N_Y) = f(\sigma^2_{Y})$ and $h(X + N_Z) = f(\sigma^2_{Z})$ .

By de Bruijn's identity, $f'(t) = \frac{1}{2}J(X + \sqrt{t}G)$ where $J$ is the Fisher information. Fisher information is convex, so $f$ is concave.

$h(X+N_Y) - h(X+N_Z) = f(\sigma^2_{Y}) - f(\sigma^2_{Z}) = \int_{\sigma^2_{Y}}^{\sigma^2_{Z}} f'(t)dt$ .

This integral is maximized when $f'$ is as large as possible on $[\sigma^2_{Y}, \sigma^2_{Z}]$ , which happens when $X$ is Gaussian (since the Gaussian maximizes Fisher information among distributions with the same variance). $\blacksquare$

ex-ch20-17

Challenge

(Research-flavored) Consider a massive MIMO system with $n_t = 128$ transmit antennas, $K = 8$ single-antenna legitimate users, and one $n_e$ -antenna eavesdropper. The transmitter uses zero-forcing beamforming to serve the $K$ users and artificial noise in the remaining $n_t - K = 120$ dimensions.

(a) Show that the per-user secrecy rate approaches the per-user rate without the eavesdropper as $n_t \to \infty$ (secrecy is free in massive MIMO).

(b) Quantify the rate of convergence: how large must $n_t$ be for the secrecy penalty to be less than 0.1 bits/use at $\text{SNR} = 10$ dB?

(c) Discuss what happens when Eve has $n_e > n_t - K$ antennas.

Show Hint

The AN lives in a $(n_t - K)$ -dimensional space. Eve's interference grows with $n_t - K$ .

Use random matrix theory: $\|\mathbf{H}_{E} \mathbf{V}_{\text{AN}}\|_F^2 / (n_t - K) \to n_e$ as $n_t \to \infty$ .

Solution

Part (a): Asymptotic secrecy

With ZF beamforming, each user $k$ achieves rate $R_k = \log(1 + \alpha P_k \|\mathbf{h}_k\|^2)$ . Eve's SINR for user $k$ : $\text{SINR}_{E,k} = \frac{\alpha P_k |\mathbf{h}_{E}^H \mathbf{v}_k|^2}{n_e + \frac{(1-\alpha)P}{n_t - K} \|\mathbf{H}_{E} \mathbf{V}_{\text{AN}}\|_F^2}.$

As $n_t \to \infty$ , $\|\mathbf{H}_{E} \mathbf{V}_{\text{AN}}\|_F^2 \sim n_e(n_t - K)$ , so the AN power at Eve grows as $(1-\alpha)P n_e$ . Meanwhile, $|\mathbf{h}_{E}^H \mathbf{v}_k|^2 = O(1)$ .

Therefore $\text{SINR}_{E,k} \to 0$ and the secrecy rate $R_{s,k} \to R_k$ .

Parts (b) and (c)

(b) The secrecy penalty is $\Delta R_k \approx \log(1 + \text{SINR}_{E,k})$ . Setting $\Delta R_k < 0.1$ bits/use at $\text{SNR} = 10$ dB requires $\text{SINR}_{E,k} < 2^{0.1} - 1 \approx 0.072$ . Working through the random matrix expressions, this requires $n_t \gtrsim K + 10 n_e$ , so for $n_e = 4$ : $n_t \gtrsim 48$ .

(c) When $n_e > n_t - K$ , the AN subspace is smaller than Eve's observation space. Eve can partially resolve the AN and extract some information about the data signals. The secrecy rate degrades but remains positive as long as $n_t > K$ (some AN is still possible). The worst case is $n_e \geq n_t$ , where Eve can potentially invert all spatial processing.

ex-ch20-18

Challenge

(Open-ended) Compare information-theoretic secrecy with computational secrecy (AES-256 encryption) along the following dimensions:

(a) Threat model (what does the adversary need to break the scheme?)

(b) Key management (does the scheme require pre-shared keys?)

(c) Performance overhead (bandwidth/power cost of security)

(d) Practical deployment readiness

Argue for or against the proposition: "Physical-layer security will complement, not replace, cryptographic security in 6G systems."

Show Hint

Information-theoretic secrecy is unconditional (no computational assumptions) but requires a channel advantage.

AES-256 is computationally secure (assumed infeasible to break) but requires key distribution.

Consider quantum computing threats to AES and post-quantum cryptography.

Solution

Comparison

(a) Threat model:

IT secrecy: assumes eavesdropper has unbounded computation but bounded channel quality. Broken if Eve gets a better channel.
AES-256: assumes eavesdropper has bounded computation (~ $2^{256}$ operations infeasible). Broken by a sufficiently powerful computer (quantum or classical).

(b) Key management:

IT secrecy: no pre-shared key needed (wiretap coding) or generates keys from channel (key agreement).
AES: requires pre-shared or Diffie-Hellman-established keys. Key distribution is the hard problem in practice.

(c) Performance overhead:

IT secrecy: costs rate (secrecy rate < main channel rate) and may require extra antennas/power for AN.
AES: negligible throughput overhead (AES-NI hardware acceleration), but key establishment has latency.

(d) Deployment:

IT secrecy: research prototypes for key generation; no deployed wiretap codes.
AES: deployed universally in all commercial systems.

Argument for complementarity: IT secrecy is most valuable where key management is hardest: IoT devices without PKI, machine-type communication, and ad-hoc networks. Channel-based key generation can bootstrap encryption keys without infrastructure, while AES handles the bulk data encryption. In 6G, the combination — PLS for key generation, AES for encryption — offers defense in depth: even if quantum computing breaks AES, the PLS-generated keys provide an unconditional fallback.

Exercises

ex-ch20-01

Formula

Interpretation

ex-ch20-02

Compute

Interpretation

ex-ch20-03

Main capacity

Secrecy capacity

Fraction

ex-ch20-04

Definitions

Why it matters

ex-ch20-05

Secret key capacity

ex-ch20-06

High-SNR limit

Interpretation

ex-ch20-07

Part (a): Degradedness

Part (b): Secrecy capacity

Part (c): Half-capacity point

ex-ch20-08

Part (a): Codebook

Part (b): Encoding

Part (c): Reliability

Part (d): Secrecy

ex-ch20-09

Expand using chain rule

Key capacity

ex-ch20-10

Bob's SNR

Eve's SINR

Secrecy rate

ex-ch20-11

MISO reduction

Lower bound

ex-ch20-12

Start with Fano

Subtract leakage

Chain rule expansion

Conclude

ex-ch20-13

Setup

Large $n_t$ behavior

Optimal power split

ex-ch20-14

Upper bound

Lower bound

Special case: $n_t > n_r + n_e$

ex-ch20-15

Part (a): Maximum key rate

Part (b): Quantization loss

Part (c): Extractable key bits

ex-ch20-16

Expand mutual informations

Difference

Costa's argument

ex-ch20-17

Part (a): Asymptotic secrecy

Parts (b) and (c)

ex-ch20-18

Comparison