Ferkans — Interactive Telecom Tutor

ex-ch06-01

Easy

For a Bernoulli(0.3) source with Hamming distortion, compute $R$ for $D = 0, 0.1, 0.2, 0.3$ and sketch the curve.

Show Hint

$R = H(p) - H(D)$ for $0 \leq D \leq p$ .

Solution

Compute

$H(0.3) \approx 0.881$ bits.

$R(0) = H(0.3) - H(0) = 0.881 - 0 = 0.881$ bits
$R(0.1) = 0.881 - H(0.1) = 0.881 - 0.469 = 0.412$ bits
$R(0.2) = 0.881 - H(0.2) = 0.881 - 0.722 = 0.159$ bits
$R(0.3) = 0.881 - H(0.3) = 0.881 - 0.881 = 0$ bits

The curve is convex, decreasing from $H(p)$ to 0.

ex-ch06-02

Easy

For a Gaussian source $X \sim \mathcal{N}(0, 4)$ with squared-error distortion, compute the rate needed to achieve SNR = 30 dB.

Show Hint

SNR $= \sigma^2/D$ in linear scale. 30 dB $= 1000$ .

Solution

Compute

SNR = 30 dB $= 10^{30/10} = 1000$ . So $D = \sigma^2/\text{SNR} = 4/1000 = 0.004$ . $R = \frac{1}{2}\log_2(\sigma^2/D) = \frac{1}{2}\log_2(1000) \approx \frac{1}{2}(9.97) = 4.98$ bits/sample.

Alternatively: $R = \frac{1}{2} \cdot \text{SNR(dB)} / (20\log_{10}2) \cdot 2 = \text{SNR(dB)} / 6.02 = 30/6.02 = 4.98$ bits.

ex-ch06-03

Easy

Explain why $R(D_{\max}) = 0$ for any source and distortion measure, where $D_{\max} = \min_{\hat{x}} \mathbb{E}[d(X, \hat{x})]$ .

Show Hint

At $D_{\max}$ , the encoder sends no information; the decoder uses a fixed reconstruction.

Solution

Zero-rate codebook

At rate $R = 0$ , the codebook has $2^0 = 1$ codeword. The best single reconstruction is $\hat{x}^* = \arg\min_{\hat{x}} \mathbb{E}[d(X, \hat{x})]$ , achieving distortion $D_{\max}$ . Since $I(X;\hat{X}) = 0$ when $\hat{X}$ is constant (independent of $X$ ), we have $R(D_{\max}) = 0$ .

For Hamming distortion on Bernoulli( $p$ ): $D_{\max} = \min(p, 1-p) = p$ (always output the more likely symbol). For Gaussian squared error: $D_{\max} = \sigma^2$ (always output the mean, zero).

ex-ch06-04

Easy

A Gaussian source $X \sim \mathcal{N}(0, 1)$ is quantized to 8 bits/sample using an ideal entropy-coded quantizer. What is the achievable SNR according to the R-D bound? What if using a uniform quantizer without entropy coding?

Show Hint

R-D bound: $D = \sigma^2 \cdot 2^{-2R}$ .

Uniform quantizer (no entropy coding): $D \approx \sigma^2 \cdot 2^{-2R} \cdot \pi e / 6$ .

Solution

R-D bound

$D = 2^{-16} \approx 1.5 \times 10^{-5}$ . SNR $= 1/D = 2^{16} = 65536 \approx 48.2$ dB.

Uniform without entropy coding

For a uniform quantizer with $M = 2^8 = 256$ levels: SNR $\approx 6.02 \times 8 - 7.27 = 40.9$ dB (using the standard formula for uniform quantization of Gaussian). The gap from R-D is about 7.3 dB — this includes both the shaping loss (1.53 dB) and the overload loss.

ex-ch06-05

Medium

Prove that $R$ is convex in $D$ directly from the definition (without using the test-channel mixing argument). Hint: use the operational definition — achievability implies a code exists, and time-sharing two codes gives a convex combination.

Show Hint

Given codes achieving $(R_1, D_1)$ and $(R_2, D_2)$ , time-share to achieve $(\lambda R_1 + (1-\lambda)R_2, \lambda D_1 + (1-\lambda)D_2)$ .

Solution

Time-sharing argument

For $\lambda \in [0,1]$ , given a code $C_1$ achieving $(R_1, D_1)$ and $C_2$ achieving $(R_2, D_2)$ : use $C_1$ on a $\lambda$ -fraction of the source blocks and $C_2$ on the rest. The average rate is $\lambda R_1 + (1-\lambda)R_2$ and the average distortion is $\lambda D_1 + (1-\lambda)D_2$ . Since this pair is achievable: $R(\lambda D_1 + (1-\lambda)D_2) \leq \lambda R_1 + (1-\lambda)R_2.$ Setting $R_i = R(D_i)$ : $R(\lambda D_1 + (1-\lambda)D_2) \leq \lambda R(D_1) + (1-\lambda)R(D_2)$ . This is the definition of convexity.

ex-ch06-06

Medium

Compute the rate-distortion function for a ternary source $X$ uniform on $\{0, 1, 2\}$ with Hamming distortion $d(x, \hat{x}) = \mathbf{1}\{x \neq \hat{x}\}$ .

Show Hint

By symmetry, the optimal test channel is a symmetric channel: $P(\hat{x}|x) = 1 - D$ if $\hat{x} = x$ and $D/2$ otherwise.

$I(X;\hat{X}) = H(\hat{X}) - H(\hat{X}|X)$ .

Solution

Optimal test channel

By symmetry (uniform source, symmetric distortion), the optimal test channel is a ternary symmetric channel: $P(\hat{x}|x) = 1 - D$ if $\hat{x} = x$ , and $D/(|\mathcal{X}|-1) = D/2$ otherwise. The marginal $\hat{X}$ is also uniform.

Compute mutual information

$H(\hat{X}) = \log_2 3$ (uniform). $H(\hat{X}|X) = -(1-D)\log(1-D) - D\log(D/2) = -(1-D)\log(1-D) - D\log D + D\log 2$ $= H(D) + D\log 2 = H(D) + D$ (in bits, where $H$ is the binary entropy).

Wait — more carefully: $H(\hat{X}|X=x) = -(1-D)\log_2(1-D) - 2 \cdot \frac{D}{2}\log_2\frac{D}{2}$ $= -(1-D)\log_2(1-D) - D\log_2(D/2)$ $= -(1-D)\log_2(1-D) - D\log_2 D + D$ .

So $R = \log_2 3 - [-(1-D)\log_2(1-D) - D\log_2 D + D]$ $= \log_2 3 + (1-D)\log_2(1-D) + D\log_2 D - D$ $= \log_2 3 - H(D, (1-D)) - D\log_2 2 + D\log_2 D + D\log_2 D$ ...

Let me redo this cleanly. With the ternary symmetric channel: $H(\hat{X}|X) = H_3(1-D, D/2, D/2) = -(1-D)\log(1-D) - 2 \cdot \frac{D}{2}\log\frac{D}{2}$ $= -(1-D)\log(1-D) - D\log\frac{D}{2} = -(1-D)\log(1-D) - D\log D + D$ .

$R = \log 3 + (1-D)\log(1-D) + D\log D - D$ for $0 \leq D \leq 2/3$ . At $D = 0$ : $R = \log 3 \approx 1.585$ bits. At $D = 2/3$ : $R = 0$ (always output any fixed symbol).

ex-ch06-07

Medium

(Reverse waterfilling.) Two independent Gaussian sources have variances $\sigma_1^2 = 4$ and $\sigma_2^2 = 1$ . Compute the rate-distortion function at total distortion $D = 2$ . Which component is "shut off"?

Show Hint

Reverse waterfilling: $D_i^* = \min(\gamma, \sigma_i^2)$ , $\sum D_i^* = D$ .

If $\gamma > \sigma_2^2 = 1$ , the second component is shut off.

Solution

Try $\ntn{rwf_lvl} > 1$

If $\gamma \geq \sigma_2^2 = 1$ : $D_2^* = \sigma_2^2 = 1$ (shut off, zero bits allocated). Then $D_1^* = D - D_2^* = 2 - 1 = 1$ . Check: $D_1^* = 1 < \sigma_1^2 = 4$ , so $\gamma = D_1^* = 1 = \sigma_2^2$ . This is the boundary case — the waterfilling level equals $\sigma_2^2$ .

Compute rate

$R = \frac{1}{2}\log_2\frac{\sigma_1^2}{D_1^*} + 0 = \frac{1}{2}\log_2\frac{4}{1} = 1$ bit. Only the first (stronger) component gets any bits. The second component contributes its full variance $\sigma_2^2 = 1$ to the distortion budget.

Compare with equal allocation

Equal allocation: $D_1 = D_2 = 1$ . Same result! But for $D = 1.5$ : reverse waterfilling gives $D_2^* = 1, D_1^* = 0.5$ , rate $= \frac{1}{2}\log(4/0.5) = 1.5$ bits. Equal allocation: $D_1 = D_2 = 0.75$ , rate $= \frac{1}{2}\log(4/0.75) + \frac{1}{2}\log(1/0.75) = 1.21 + 0.21 = 1.42$ bits. Equal allocation is better here because it doesn't waste bits on the weak component. Wait — reverse waterfilling gives 1.5 bits while equal gives 1.42. Let me recheck: at $D = 1.5$ , $\gamma$ satisfies $\min(\gamma, 4) + \min(\gamma, 1) = 1.5$ . If $\gamma < 1$ : $\gamma + \gamma = 1.5$ , so $\gamma = 0.75$ . Rate $= \frac{1}{2}\log(4/0.75) + \frac{1}{2}\log(1/0.75) = 1.21 + 0.21 = 1.42$ bits. So reverse waterfilling gives 1.42 bits — same as equal allocation! This makes sense: $\gamma = 0.75 < 1$ , so both components are active with $D_i^* = 0.75$ .

ex-ch06-08

Medium

Show that for the Gaussian source, the distortion-rate function $D(R) = \sigma^2 2^{-2R}$ satisfies $10\log_{10}(\sigma^2/D) = 6.02 R$ dB, confirming the "6 dB per bit" rule.

Show Hint

Express $\sigma^2/D$ in dB and use $10\log_{10}(2) \approx 3.01$ .

Solution

Derivation

SNR $= \sigma^2/D = 2^{2R}$ . In dB: $\text{SNR(dB)} = 10\log_{10}(2^{2R}) = 20R\log_{10}(2) = 20R \times 0.3010 = 6.02R.$ Each bit of rate adds 6.02 dB of SNR — this is the fundamental resolution of digital representation. A 16-bit audio sample (CD quality) achieves $6.02 \times 16 = 96.3$ dB SNR.

ex-ch06-09

Medium

(Blahut-Arimoto.) Implement one iteration of the Blahut-Arimoto algorithm for the binary source with $P(0) = 0.7$ , Hamming distortion, at $\lambda = 2$ . Start with $q(\hat{x}) = (0.5, 0.5)$ .

Show Hint

Update: $P_{\hat{x}|x} \propto q(\hat{x}) \exp(-\lambda d(x, \hat{x}))$ .

Hamming: $d(x, \hat{x}) = 0$ if $x = \hat{x}$ , 1 otherwise.

Solution

Update test channel

$P(\hat{x}|x) \propto q(\hat{x}) e^{-\lambda d(x,\hat{x})}$ . For $x = 0$ : $P(0|0) \propto 0.5 \cdot e^0 = 0.5$ , $P(1|0) \propto 0.5 \cdot e^{-2} \approx 0.0677$ . Normalizing: $P(0|0) = 0.5/0.5677 = 0.881$ , $P(1|0) = 0.119$ . For $x = 1$ : $P(0|1) \propto 0.5 \cdot e^{-2} = 0.0677$ , $P(1|1) \propto 0.5 \cdot e^0 = 0.5$ . Normalizing: $P(0|1) = 0.119$ , $P(1|1) = 0.881$ .

Update marginal

$q(0) = 0.7 \times 0.881 + 0.3 \times 0.119 = 0.617 + 0.036 = 0.653$ . $q(1) = 0.7 \times 0.119 + 0.3 \times 0.881 = 0.083 + 0.264 = 0.347$ .

Compute D and R

$D = 0.7 \times 0.119 + 0.3 \times 0.119 = 0.119$ . $R = \sum_{x,\hat{x}} P(x) P(\hat{x}|x) \log\frac{P(\hat{x}|x)}{q(\hat{x})}$ . This requires more computation; after convergence (a few more iterations), the algorithm produces the point $(R, D)$ on the R-D curve corresponding to $\lambda = 2$ .

ex-ch06-10

Hard

Prove the converse of the rate-distortion theorem: any sequence of $(2^{nR_n}, n)$ codes with $\mathbb{E}[d(\mathbf{X}, \hat{\mathbf{X}})] \leq D$ satisfies $\liminf R_n \geq R(D)$ .

Show Hint

Use $nR \geq H(W) \geq I(\mathbf{X}; W) \geq I(\mathbf{X}; \hat{\mathbf{X}})$ .

Single-letterize using $I(\mathbf{X}; \hat{\mathbf{X}}) \geq \sum_i I(X_i; \hat{X}_i)$ .

Use convexity of $R(D)$ to get $\frac{1}{n}\sum R(D_i) \geq R(\frac{1}{n}\sum D_i)$ .

Solution

Rate lower bound

The encoder output $W = f(\mathbf{X})$ takes $2^{nR}$ values: $nR \geq H(W) \geq I(\mathbf{X}; W) \geq I(\mathbf{X}; \hat{\mathbf{X}})$ where the last step uses data processing ( $\hat{\mathbf{X}} = g(W)$ ).

Single-letterization

$I(\mathbf{X}; \hat{\mathbf{X}}) \geq \sum_{i=1}^n I(X_i; \hat{X}_i)$ by the chain rule and dropping conditioning (conditioning reduces mutual information for independent $X_i$ 's). Each $I(X_i; \hat{X}_i) \geq R(D_i)$ by definition of $R(\cdot)$ , where $D_i = \mathbb{E}[d(X_i, \hat{X}_i)]$ .

Convexity

$nR \geq \sum_{i=1}^n R(D_i) \geq n R\!\left(\frac{1}{n}\sum_{i=1}^n D_i\right) \geq n R(D)$ by convexity (Jensen's inequality applied to convex $R(\cdot)$ ) and $\frac{1}{n}\sum D_i \leq D$ .

ex-ch06-11

Hard

(Wyner-Ziv for binary source.) Let $X \sim \text{Bern}(1/2)$ and $Y = X \oplus N$ where $N \sim \text{Bern}(p)$ independent of $X$ (BSC observation). Compute $R_{XY}(D)$ (side information at both encoder and decoder) and compare with $R_{WZ}(D)$ for Hamming distortion.

Show Hint

$R_{XY}(D) = H(X|Y) - H(D) = H(p) - H(D)$ for $D \leq p$ .

The Wyner-Ziv rate is $R_{WZ}(D) = H(p * D) - H(D)$ where $p * D = p(1-D) + (1-p)D$ .

Solution

$R_{XY}(D)$

With side information at both encoder and decoder, the problem reduces to compressing $X$ given $Y$ . Since $X|Y \sim \text{Bern}(p)$ (the BSC flips with probability $p$ ): $R_{XY}(D) = R_{X|Y}(D) = H(p) - H(D) \text{ for } 0 \leq D \leq p.$

$R_{WZ}(D)$

Without side information at the encoder, the Wyner-Ziv rate for this binary case is: $R_{WZ}(D) = H(p * D) - H(D)$ where $p * D = p(1-D) + (1-p)D$ (BSC convolution). This can be shown using the auxiliary $U$ in the Wyner-Ziv theorem with $U = X \oplus Z$ , $Z \sim \text{Bern}(D)$ .

Compare

The gap $R_{WZ}(D) - R_{XY}(D) = H(p * D) - H(p)$ . Since $H(p * D) \geq H(p)$ (BSC convolution increases entropy for $D > 0$ ), $R_{WZ} \geq R_{XY}$ with strict inequality for $D > 0$ and $p \notin \{0, 1/2\}$ . The gap is zero only when $p = 1/2$ (useless side information) or $D = 0$ (lossless).

For example, with $p = 0.1$ , $D = 0.05$ : $R_{XY}(0.05) = H(0.1) - H(0.05) = 0.469 - 0.286 = 0.183$ bits. $R_{WZ}(0.05) = H(0.1 \cdot 0.95 + 0.9 \cdot 0.05) - H(0.05)$ $= H(0.14) - 0.286 = 0.588 - 0.286 = 0.302$ bits. Gap: 0.119 bits (65% overhead for not knowing $Y$ at the encoder).

ex-ch06-12

Hard

(Separation theorem proof sketch.) Prove that a DMS $\{X_i\}$ is transmissible over a DMC $W$ at distortion $D$ if and only if $R < C(W)$ , assuming one channel use per source symbol. Show that separate source and channel coding achieves this.

Show Hint

Achievability: choose $R$ with $R < R < C$ . Use a source code at rate $R$ and a channel code at rate $R$ .

Converse: $nR \geq I(\mathbf{X}; \hat{\mathbf{X}}) \geq nR(D)$ and $nR \leq nC$ .

Solution

Achievability

Choose $R$ with $R < R < C$ . This is possible since $R < C$ . Use a lossy source code at rate $R$ achieving distortion $\leq D + \epsilon$ (by the R-D theorem). The encoder output is $2^{nR}$ indices — transmit these over the channel using a channel code at rate $R < C$ with $P_e \to 0$ (by the channel coding theorem). The overall distortion is $\leq D + \epsilon + \delta$ where $\delta$ accounts for the (vanishing) channel error.

Converse

Any joint source-channel code maps $\mathbf{X}$ to $\hat{\mathbf{X}}$ via the channel. By data processing: $I(\mathbf{X}; \hat{\mathbf{X}}) \leq I(\mathbf{X}; \mathbf{Y}) \leq nC$ . By the R-D converse: $I(\mathbf{X}; \hat{\mathbf{X}}) \geq nR(D)$ . Combining: $R(D) \leq C$ , so $R \leq C$ is necessary.

Optimality of separation

The achievability proof uses separate source and channel codes, while the converse applies to any coding scheme (including joint). Since both give the same condition $R < C$ , separation is optimal. No joint scheme can do better.

ex-ch06-13

Challenge

(Rate-distortion for exponential source.) Let $X \sim \text{Exp}(\lambda)$ (exponential with rate $\lambda$ ) and squared-error distortion $d(x, \hat{x}) = (x - \hat{x})^2$ . Show that $R(D) = \frac{1}{2}\log_2\frac{1}{\lambda^2 D}$ for $D \leq 1/\lambda^2$ , and compare with the Gaussian R-D at the same variance $\sigma^2 = 1/\lambda^2$ .

Show Hint

The exponential source has differential entropy $h(X) = 1 + \ln(1/\lambda)$ nats.

The optimal test channel is NOT Gaussian — the exponential source does not match the Gaussian R-D.

Use the Shannon lower bound $R(D) \geq h(X) - \frac{1}{2}\log(2\pi e D)$ .

Solution

Shannon lower bound

For any source with differential entropy $h(X)$ and squared-error distortion: $R(D) \geq h(X) - \frac{1}{2}\log(2\pi e D) = \frac{1}{2}\log\frac{2^{2h(X)}}{2\pi e D}.$ The "entropy power" is $N_X = \frac{1}{2\pi e} 2^{2h(X)}$ . The Gaussian with the same entropy has variance $N_X$ , and the bound says $R(D) \geq R_{\text{Gauss}}(D)$ with $\sigma^2 = N_X$ .

Compute for exponential

$h(X) = 1 + \ln(1/\lambda)$ nats $= (1 + \ln(1/\lambda))/\ln 2$ bits. Entropy power: $N_X = \frac{1}{2\pi e}2^{2h} = \frac{1}{2\pi e} \cdot (e/\lambda)^2 \cdot 4 = \frac{2e}{\pi\lambda^2}$ . Hmm, this doesn't simplify to $1/\lambda^2$ .

Actually, for the exponential source, the exact $R(D)$ is not available in simple closed form. The Shannon lower bound gives $R(D) \geq \frac{1}{2}\log_2\frac{e^2/(2\pi e \lambda^2)}{D} = \frac{1}{2}\log_2\frac{e}{2\pi\lambda^2 D}$ . The actual $R(D)$ for the exponential source lies strictly above the Gaussian $R(D)$ at the same variance — this is because the Gaussian has the maximum entropy for a given variance, making it the "hardest" source to compress.

Comparison

The Gaussian $R(D)$ at variance $\sigma^2 = 1/\lambda^2$ is $\frac{1}{2}\log(\sigma^2/D) = \frac{1}{2}\log(1/(\lambda^2 D))$ . The exponential source has $R_{exp}(D) > R_{gauss}(D)$ for all $D > 0$ , because the Gaussian is the hardest source to compress at a given variance. The gap measures the "non-Gaussianity" penalty and can be computed numerically via Blahut-Arimoto.

ex-ch06-14

Challenge

(Successive refinement necessity.) Show that a binary source with $P(0) = p \neq 1/2$ and Hamming distortion is NOT successively refinable between some pairs $(D_1, D_2)$ . Specifically, find $(D_1, D_2)$ with $D_2 < D_1 < p$ such that $R(D_2) - R(D_1) > R_{D_1 \to D_2}^{\text{incremental}}$ , i.e., the incremental rate exceeds what successive refinement would require.

Show Hint

For the binary source, the test channel achieving $R(D)$ is a BSC( $D$ ).

For successive refinement, we need $\hat{X}_1$ derivable from $\hat{X}_2$ via a degraded channel.

Check whether $X \to \hat{X}_2 \to \hat{X}_1$ can simultaneously achieve both $R(D_1)$ and $R(D_2)$ .

Solution

Degradation constraint

For successive refinement, we need $\hat{X}_1 = \hat{X}_2 \oplus N'$ where $N' \sim \text{Bern}(\delta)$ . If $\hat{X}_2 = X \oplus N_2$ with $N_2 \sim \text{Bern}(D_2)$ , then $\hat{X}_1 = X \oplus N_2 \oplus N' = X \oplus N_1$ where $N_1 \sim \text{Bern}(D_2 * \delta)$ . For this to give $\mathbb{E}[d(X, \hat{X}_1)] = D_1$ , we need $D_2 * \delta = D_1$ .

Check rates

The rate for $\hat{X}_2$ is $R(D_2)$ . The incremental rate for $\hat{X}_1 \to \hat{X}_2$ in the successive scheme is $I(X; \hat{X}_2 | \hat{X}_1) = R(D_2) - I_{\text{base}}$ . The total rate $R(D_2)$ must be achievable, and the base rate must be $R(D_1)$ . The constraint is: can we find a joint distribution achieving both $R(D_1)$ at $D_1$ and $R(D_2)$ at $D_2$ with the degradation structure?

Failure for asymmetric sources

For $p \neq 1/2$ , the optimal test channel at distortion $D$ is BSC( $D$ ), but the optimal marginal $P_{\hat{X}}$ depends on $D$ . At $D_1$ : $P(\hat{X} = 0) = p * (1-D_1) + (1-p)D_1$ . At $D_2 < D_1$ : $P(\hat{X} = 0)$ is different. The degradation $\hat{X}_1 = \hat{X}_2 \oplus N'$ fixes the relationship between marginals, and for $p \neq 1/2$ , this constraint prevents both layers from being individually optimal. The gap is small but nonzero.

ex-ch06-15

Challenge

(Dithered ECSQ achieves $R(D) + 0.754$ bits.) Show that a uniform quantizer with step $\Delta$ , combined with dithering (adding uniform noise $U \sim \text{Uniform}(-\Delta/2, \Delta/2)$ independent of $X$ before quantization, then subtracting $U$ after), achieves distortion $D = \Delta^2/12$ for ANY source distribution, and the rate (entropy of the quantized output) is at most $h(X) - \log \Delta + 0.754$ bits.

Show Hint

Dithering makes the quantization noise $Z = Q(X+U) - X$ independent of $X$ and uniformly distributed.

The entropy of the dithered quantized output is bounded using the entropy power inequality.

Solution

Dithered quantization noise

With dithering: $Y = Q(X + U) - U$ where $Q$ is the uniform quantizer. The quantization noise is $Z = Y - X = Q(X + U) - U - X$ . Schuchman's theorem states that $Z$ is uniformly distributed on $[-\Delta/2, \Delta/2]$ and independent of $X$ . Therefore $D = \mathbb{E}[Z^2] = \Delta^2/12$ , regardless of the distribution of $X$ .

Rate bound

The quantized output $Q(X+U)$ takes integer values. Its entropy satisfies: $H(Q(X+U)) \leq h(X+U) - \log \Delta + 1$ (by the entropy bound for integer RVs). Since $h(X+U) \leq \frac{1}{2}\log(2^{2h(X)} + \Delta^2/12) \cdot 2\pi e$ ... More directly: $H(Q(X+U)) \leq h(X) + \frac{1}{2}\log(2\pi e \Delta^2/12)/\Delta$ ...

The clean result (Zamir and Feder, 1996) is: $R = H(Q(X+U)) \leq h(X) - \log\Delta + D_{KL}(f_Z \| f_{Gauss}) \leq h(X) - \log\Delta + 0.254$ nats. Converting: $0.254$ nats $\approx 0.366$ bits... The 0.754 bit figure comes from the full analysis including the entropy of the uniform distribution vs. Gaussian: $\frac{1}{2}\log(2\pi e/12) = 0.254$ nats $= 0.366$ bits is the shaping loss in entropy.

The total gap to $R(D)$ : rate $\leq R(D) + \frac{1}{2}\log_2(\pi e/6) + 1 \approx R(D) + 0.754$ bits.

Exercises

ex-ch06-01

Compute

ex-ch06-02

Compute

ex-ch06-03

Zero-rate codebook

ex-ch06-04

R-D bound

Uniform without entropy coding

ex-ch06-05

Time-sharing argument

ex-ch06-06

Optimal test channel

Compute mutual information

ex-ch06-07

Try $\ntn{rwf_lvl} > 1$

Compute rate

Compare with equal allocation

ex-ch06-08

Derivation

ex-ch06-09

Update test channel

Update marginal

Compute D and R

ex-ch06-10

Rate lower bound

Single-letterization

Convexity

ex-ch06-11

$R_{XY}(D)$

$R_{WZ}(D)$

Compare

ex-ch06-12

Achievability

Converse

Optimality of separation

ex-ch06-13

Shannon lower bound

Compute for exponential

Comparison

ex-ch06-14

Degradation constraint

Check rates

Failure for asymmetric sources

ex-ch06-15

Dithered quantization noise

Rate bound