Ferkans — Interactive Telecom Tutor

ex15-1

Easy

Compute $C_{\text{PIR-SI}}(N, K, M)$ for $N = 5, K = 8, M = 3$ .

Show Hint

Use $C(N, K - M)$ .

Effective $K = 5$ .

Solution

Apply formula

$C_{\text{PIR-SI}}(5, 8, 3) = C_{\text{PIR}}(5, 5) = (1 + 1/5 + 1/25 + 1/125 + 1/625)^{-1} = 625/781 \approx 0.800$ .

Compare with classical

$C_{\text{PIR}}(5, 8) \approx 0.800$ as well — the gap is tiny because $K \geq 5$ is already in the asymptotic regime.

ex15-2

Easy

For $N = 4, K = 8, \gamma = 0.5$ , estimate the cache-aided PIR rate by approximating the binomial mixture with its mean.

Show Hint

Mean of $\text{Bin}(K-1, \gamma)$ is $(K-1)\gamma$ .

Approximate $C(N, K - I) \approx C(N, K - (K-1)\gamma)$ .

Solution

Mean of binomial

$\mathbb{E}[I] = (K-1)\gamma = 7 \cdot 0.5 = 3.5$ . Effective $K - I \approx 4.5$ .

Approximate rate

$C(4, 4.5) \approx C(4, 5) \approx 0.802$ (using interpolation; exact value slightly higher with the full mixture).

Operational

Cache fraction $\gamma = 0.5$ doubles the effective per-bit rate compared with $\gamma = 0$ — non-trivial improvement.

ex15-3

Easy

For $U = 5, K = 20, \gamma = 0.4$ , compute: (a) MAN rate, (b) demand-private rate, (c) privacy cost ratio.

Show Hint

$R_{\text{MAN}} = U(1-\gamma)/(1+U\gamma)$ .

$R^* = U(1-\gamma)$ .

Solution

(a) MAN

$R_{\text{MAN}} = 5 \cdot 0.6 / (1 + 2) = 3 / 3 = 1$ .

(b) Demand-private

$R^* = 5 \cdot 0.6 = 3$ .

(c) Privacy cost ratio

$R^* / R_{\text{MAN}} = 3 / 1 = 3$ . Demand privacy costs $3\times$ rate.

ex15-4

Medium

Sketch the curve of $C_{\text{PIR-SI}}(4, K, M)$ vs. $M$ for $K = 10, M = 0, 1, \ldots, 9$ . Identify the rate at $M = K - 1$ .

Show Hint

$C(4, K - M)$ is monotone in $M$ .

$M = K - 1$ gives effective $K = 1$ .

Solution

Compute end points

$M = 0$ : $C(4, 10) \approx 0.750$ . $M = 9 = K - 1$ : $C(4, 1) = 1$ (only one file to retrieve, trivially).

Mid points

$M = 5$ : $C(4, 5) \approx 0.802$ . Slow growth from $M = 0$ to $M = 5$ ; rapid growth from $M = 5$ to $M = 9$ .

Operational

Side info is most useful at the high- $M$ end of the spectrum. Small caches give small rate gains; nearly-full caches give dramatic gains (rate $\to 1$ ).

ex15-5

Medium

For $K = 10, N = 4$ , compare $C_{\text{PIR-SI}}(4, 10, M)$ at $M = 5$ vs. $C_{\text{cache-PIR}}(4, 10, \gamma)$ at $\gamma = 0.5$ . Why is one higher?

Show Hint

$M = 5$ side info: 5 complete files cached.

$\gamma = 0.5$ cache: half the bits of every file.

Solution

PIR-SI at $M = 5$

$C_{\text{PIR-SI}}(4, 10, 5) = C(4, 5) \approx 0.802$ .

Cache-aided at $\gamma = 0.5$

Binomial mixture centered around $i = 4-5$ . Effective $K \approx 5-6$ . $C \approx 0.83-0.85$ (slightly higher than $0.802$ ).

Why cache-aided is better

The cache spreads the side info across all $K$ files (half a bit per file, on average). This avoids "wasting" cache on files that won't be requested. Side info with $M$ complete files is less flexible — if $\theta \notin \mathcal{S}$ , the protocol must retrieve a complete missing file.

Operational

For random caches with no popularity bias, cache-aided PIR is the right framework. For explicit knowledge of which files are wanted (e.g., personal favorites), side-info framework with $M$ complete files is appropriate.

ex15-6

Medium

For demand-private cached delivery, derive the asymptotic rate as $\gamma \to 1$ and interpret.

Show Hint

$R^* = U(1 - \gamma) \to 0$ as $\gamma \to 1$ .

Solution

Derivation

$R^*(\gamma) = U(1 - \gamma)$ . As $\gamma \to 1$ , $R^* \to 0$ .

Interpretation

At $\gamma = 1$ , every user has the entire library in their cache. No delivery is needed. Demand privacy is automatically satisfied (the server doesn't broadcast anything, so it learns nothing about demands).

Operational

Cache-rich deployments (e.g., $\gamma > 0.9$ ) have near-zero delivery cost regardless of privacy. The privacy cost $R^* / R_{\text{MAN}} = 1 + U\gamma$ is dominated by the asymptotic $\gamma$ .

ex15-7

Medium

For $N = 4$ , find the cache fraction $\gamma^*$ at which $C_{\text{cache-PIR}}(4, 10, \gamma)$ exceeds $0.95$ .

Show Hint

Approximate the mixture using the mean effective $K - I$ .

Solve $C(4, K - (K-1)\gamma) \geq 0.95$ .

Solution

Setup

$C(4, x) = (1 - 1/4)/(1 - (1/4)^x) = 0.75 / (1 - 4^{-x})$ . Need $0.75 / (1 - 4^{-x}) \geq 0.95$ , i.e., $1 - 4^{-x} \leq 0.789$ , i.e., $4^{-x} \geq 0.211$ , i.e., $x \leq -\log_4(0.211) \approx 1.12$ .

Mean effective $K - I$

$K - (K-1)\gamma = 10 - 9\gamma \leq 1.12$ $\Rightarrow \gamma \geq 8.88/9 \approx 0.987$ .

Sanity check

At $\gamma = 0.99$ , the cache holds $\sim 99\%$ of every file. Effective $K - I \approx 1$ . Rate $\approx 1$ . Plausible.

Operational

Reaching rate $0.95$ requires a very large cache (essentially 99% of the library at $K = 10$ ). Smaller caches give smaller improvements.

ex15-8

Medium

For demand-private cached delivery, plot the privacy cost ratio $R^*/R_{\text{MAN}}$ as a function of $U$ for fixed $\gamma = 0.5$ .

Show Hint

Cost ratio is $1 + U\gamma$ .

Solution

Cost formula

$R^*/R_{\text{MAN}} = 1 + U\gamma = 1 + 0.5 U$ .

Tabulate

$U = 1$ : ratio $= 1.5$ . $U = 5$ : ratio $= 3.5$ . $U = 10$ : ratio $= 6$ . $U = 20$ : ratio $= 11$ . Linear in $U$ .

Operational

For deployments with many users ( $U \geq 10$ ), demand privacy becomes very expensive ( $\geq 6\times$ rate cost). For small user populations ( $U \leq 3$ ), the cost is $\leq 2.5\times$ — manageable.

ex15-9

Medium

Conjecture: for PIR with $M$ side-info files and $T$ -colluding privacy, the capacity is $C(N, K - M, T)$ . Verify the consistency at: (i) $M = 0$ , (ii) $T = 1$ , (iii) $M = K - 1, T = 1$ .

Show Hint

Apply the $T$ -colluding formula with effective $K - M$ .

Solution

(i) $M = 0$

$C(N, K - 0, T) = C(N, K, T)$ — recovers $T$ -colluding PIR.

(ii) $T = 1$

$C(N, K - M, 1) = C_{\text{PIR-SI}}(N, K, M)$ — recovers Wei-Banawan-Ulukus.

(iii) $M = K-1, T = 1$

$C(N, K - (K-1), 1) = C(N, 1, 1) = 1$ — only one unknown file, capacity 1. Consistent with §15.1.

Status

The conjecture is consistent with all known special cases. General proof is open (achievability follows easily; converse requires the joint cut-set argument, which has not been fully verified).

ex15-10

Hard

Sketch the cut-set argument that proves $R^*(\gamma) \geq U(1 - \gamma)$ for multi-user demand-private cached delivery.

Show Hint

Use a random demand vector $\mathbf{d} \sim \text{Uniform}([K]^U)$ .

Apply the demand-privacy condition.

Cut-set on the broadcast message.

Solution

Setup

Each user $u$ decodes $W_{d_u}$ from $X$ (broadcast) and $Z_u$ (cache). Demand privacy: $X$ is independent of $\mathbf{d}$ .

Cut-set

Apply cut-set: each user must be able to recover their requested file. $H(W_{d_u} | X, Z_u) = 0$ (decoding). $H(Z_u) \leq \gamma K L$ (cache size).

Demand privacy → independence

Demand privacy: $X$ is statistically independent of $\mathbf{d}$ . The broadcast must be "demand-agnostic" — i.e., contain enough information to satisfy any demand vector simultaneously.

Counting bits

For $X$ to enable $U$ different users to recover any of $K$ possible files, given caches of size $\gamma K L$ , we need $|X| \geq U(1 - \gamma) L$ bits. Because $X$ is demand-agnostic (privacy), we cannot use coded multicast (which would tailor $X$ to the specific demands).

Conclusion

$R^*(\gamma) = D / L \geq U(1 - \gamma)$ . Achievability via the trivial uncoded scheme matches the bound.

ex15-11

Hard

Combine cache-aided PIR with coded-storage PIR (Chapter 14 §14.1): user has cache $\gamma$ fraction; databases store $(N, r)$ -MDS coded files. Conjecture the rate as a function of $\gamma, N, K, r$ and verify special cases.

Show Hint

Effective formula: binomial mixture of coded-storage capacities.

Solution

Conjecture

$R(N, K, r, \gamma) = \sum_{i=0}^{K-1} \binom{K-1}{i} \gamma^i (1-\gamma)^{K-1-i} \cdot C_{\text{PIR-MDS}}(N, K-i, r)$ — same binomial structure with coded-storage capacity replacing classical PIR capacity.

Verify special cases

$\gamma = 0$ : $R = C_{\text{PIR-MDS}}(N, K, r)$ — coded-storage PIR. ✓ $r = 1$ : $R = C_{\text{cache-PIR}}(N, K, \gamma)$ — uncoded-storage cache-aided PIR. ✓ $\gamma = 1$ : $R = 1$ (everything cached). ✓

Status

Conjecture is consistent with all boundary cases. Open: whether the binomial-mixture formula matches the capacity (converse is non-trivial for coded storage with caches).

ex15-12

Hard

The Wan-Tuninetti-Caire scheme uses shared randomness across users (and the server). Without shared randomness, can demand privacy still be achieved? At what rate?

Show Hint

Public-coin protocols use only individual randomness, no shared randomness.

Compare with SPIR's randomness requirement (Chapter 14 §14.3).

Solution

Setup

Public-coin: each user generates their own randomness independently. No shared random masks between server and users (or among users).

Achievability

Each user can mask their query with individual randomness. The server's broadcast may need to accommodate $U$ different randomness sources.

Rate cost

Open question. Heuristically, the rate may degrade by an additive factor proportional to $U$ (one extra "side channel" per user). Sharp characterization unknown.

Operational

Public-coin protocols are easier to deploy (no shared-key infrastructure) but typically incur a rate cost. Trade-off: deployment ease vs. rate efficiency.

ex15-13

Hard

Suppose user demands follow a Zipf distribution with parameter $\alpha = 1$ (popular files dominate). Estimate the demand-private rate vs. the uniform-demand case.

Show Hint

Zipf: $\Pr[d = i] \propto 1/i$ .

Effective entropy $H(\mathbf{d})$ is lower than uniform's $U \log K$ .

Solution

Uniform baseline

Uniform demands: $H(\mathbf{d}) = U \log K$ . WTC rate: $R^* = U(1 - \gamma)$ .

Zipf entropy

Zipf ( $\alpha = 1$ , $K$ files): $H(\mathbf{d}) \approx U \cdot \log H_K \approx U \log \log K$ (Zipf entropy grows slowly).

Effect on rate

With less demand entropy, the demand-privacy constraint is easier to satisfy — the optimal rate may be lower than $U(1 - \gamma)$ . Open question: how much lower?

Operational

For popularity-driven workloads (e.g., video streaming), the WTC rate may be a pessimistic upper bound on the actual cost. Practical schemes can exploit the Zipf structure to recover some of the lost coded-multicast gain — but only at the cost of partial demand privacy.

ex15-14

Hard

Consider a sequence of $T$ PIR queries from the same user. Define the latency-amortized PIR rate as the long-run average rate over all $T$ queries. Discuss whether this can exceed the one-shot capacity $C_{\text{PIR}}(N, K)$ .

Show Hint

Caching effect: previous responses become side info for later queries.

After the first query, the user has $W_{\theta_1}$ as side info.

Solution

Setup

Query 1: classical PIR rate $C(N, K)$ . Query 2 (with $W_{\theta_1}$ as side info): rate $C(N, K - 1)$ . ... Query $t$ : rate $C(N, K - t + 1)$ (assuming all $\theta$ 's distinct).

Amortized rate

Average $R_T = (1/T) \sum_{t=1}^{T} C(N, K - t + 1)$ . As $T \to K$ , this approaches $1$ (the user eventually has the entire library).

Comparison with one-shot

One-shot capacity: $C(N, K) \approx 1 - 1/N$ . Amortized rate over $T = K$ queries: avg $\approx (1/K) \sum_{k=1}^{K} C(N, k)$ , which exceeds the one-shot capacity (because later queries enjoy effective- $K$ reduction).

Operational

Latency amortization is a real practical advantage of repeat queries. The "first query is expensive, later queries cheap" pattern is intuitive. Sharp characterization is open.

ex15-15

Challenge

Discuss the structure of quantum PIR (databases hold quantum states, queries and answers are quantum). Why might quantum PIR have higher capacity than classical PIR?

Show Hint

Quantum coding allows for stronger forms of interference alignment.

Holevo bound limits classical capacity from quantum sources.

Solution

Classical PIR baseline

Classical Sun-Jafar: $C(N, K) \to 1 - 1/N$ as $K \to \infty$ . This is bounded above by the download-rate constraint.

Quantum advantage

Quantum entanglement between databases (or between user and databases) can enable more efficient interference alignment. Analog: superdense coding sends $2$ bits in $1$ qubit — quantum PIR may transmit more "information per query" per qubit.

Known results

Recent work (Song-Kao-Hayashi 2019; Allaix-Cascudo-Cuevas-Bardo 2020) establishes quantum PIR capacity in some regimes — sometimes exceeding the classical capacity. The full quantum PIR capacity is open.

Open frontier

Quantum PIR is one of the most exciting frontiers — the classical-quantum gap is non-trivial and still being characterized. Analogous gaps emerge in quantum coded computing (Chapter 18 may touch on this).

Engineering feasibility

Quantum PIR requires quantum databases and quantum links — currently impractical. The results are theoretical benchmarks, not deployment targets.

Exercises

ex15-1

Apply formula

Compare with classical

ex15-2

Mean of binomial

Approximate rate

Operational

ex15-3

(a) MAN

(b) Demand-private

(c) Privacy cost ratio

ex15-4

Compute end points

Mid points

Operational

ex15-5

PIR-SI at $M = 5$

Cache-aided at $\gamma = 0.5$

Why cache-aided is better

Operational

ex15-6

Derivation

Interpretation

Operational

ex15-7

Setup

Mean effective $K - I$

Sanity check

Operational

ex15-8

Cost formula

Tabulate

Operational

ex15-9

(i) $M = 0$

(ii) $T = 1$

(iii) $M = K-1, T = 1$

Status

ex15-10

Setup

Cut-set

Demand privacy → independence

Counting bits

Conclusion

ex15-11

Conjecture

Verify special cases

Status

ex15-12

Setup

Achievability

Rate cost

Operational

ex15-13

Uniform baseline

Zipf entropy

Effect on rate

Operational

ex15-14

Setup

Amortized rate

Comparison with one-shot

Operational

ex15-15

Classical PIR baseline

Quantum advantage

Known results

Open frontier

Engineering feasibility