Ferkans — Interactive Telecom Tutor

ex-ch04-01

Easy

State the alignment condition for the $K$ -user finite-field interference channel, and explain in one sentence why it enables $K/2$ total DoF.

Show Hint

The condition is a matrix equation involving $\mathbf{U}_k$ , $\mathbf{H}_{kj}$ , $\mathbf{V}_j$ .

Solution

Condition

$\mathbf{U}_k \mathbf{H}_{kj} \mathbf{V}_j = \mathbf{0}$ for every $j \neq k$ , while $\mathbf{U}_k \mathbf{H}_{kk} \mathbf{V}_k$ has full column rank.

Why $K/2$

All $K - 1$ unintended transmissions at each receiver align into a common $(N/2)$ -dimensional subspace (the nullspace of $\mathbf{U}_k$ ), freeing the other half of the signal space for the intended message.

ex-ch04-02

Easy

For coded matrix multiplication with $p = 5, q = 3$ partitions, compute the recovery threshold $K$ and compare to the "naive" replication scheme ( $K = 1$ if all data is replicated on every worker).

Show Hint

Recovery threshold for polynomial codes: $K = pq$ .

Solution

Polynomial-code threshold

$K = pq = 15$ .

Replication comparison

Full replication would have $\mu = 1$ and $K = 1$ (one worker's response suffices). At $\mu = 1/p + 1/q = 1/5 + 1/3 = 8/15$ , polynomial codes achieve $K = 15$ — better straggler tolerance at lower storage.

ex-ch04-03

Easy

Compute the global caching gain for $K = 8$ users, $F = 16$ files, cache size $M = 4$ . What is the coded delivery rate?

Show Hint

Gain $= 1 + KM/F$ ; rate $= K(1 - M/F)/\text{gain}$ .

Solution

Gain

$1 + KM/F = 1 + 8 \cdot 4/16 = 3$ . Coded caching is $3\times$ more efficient than uncoded.

Coded rate

$R^* = K(1 - M/F)/(1 + KM/F) = 8 \cdot (3/4)/3 = 2$ .

ex-ch04-04

Easy

Using the classical PIR capacity formula, compute $C_{\text{PIR}}$ for $N = 5, F = 4$ .

Show Hint

$C_{\text{PIR}} = (1 + 1/N + \cdots + 1/N^{F-1})^{-1}$ .

Solution

Plug in

$C_{\text{PIR}} = (1 + 1/5 + 1/25 + 1/125)^{-1} = (1 + 0.2 + 0.04 + 0.008)^{-1} = 1/1.248 \approx 0.801$ .

Interpretation

For every bit of downloaded data from the 5 databases, the user receives about $0.80$ bits of useful file content — nearly the theoretical maximum of $1 - 1/N = 0.80$ .

ex-ch04-05

Medium

Consider a 3-user finite-field interference channel with signal dimension $N = 6$ . Compute the maximum achievable per-user DoF via zero-forcing alone (no alignment), and via IA. Quantify the gap.

Show Hint

Zero-forcing at the receiver uses $d_k = N - (K-1) d$ .

IA achieves roughly $N/2$ per user in the limit.

Solution

Zero-forcing

With uniform per-user DoF $d$ , each receiver must null $K - 1 = 2$ interferers of dimension $d$ each, requiring $N \geq (K-1) d + d = Kd$ , i.e., $d \leq N/K = 2$ .

IA

$d \approx N/2 = 3$ per user in the limit. At finite $N = 6$ one gets $d = 2$ or $3$ depending on rounding.

Gap

IA gains a factor of $K/2$ over zero-forcing in the sum-DoF: $2 \cdot (3/2) = 3$ vs. $2 \cdot 1 = 2$ . At $N = 6$ , the practical gain is about $1.5\times$ .

ex-ch04-06

Medium

Design a concrete coded-matrix-multiplication scheme for $p = q = 2$ and $N = 5$ workers over $\mathbb{F}_7$ . Specify the storage and decoding. What is the straggler tolerance?

Show Hint

Use polynomial codes with evaluation points $\alpha_k = 1, 2, 3, 4, 5$ .

Solution

Storage

Worker $k$ stores $\tilde{\mathbf{A}}_k = \mathbf{A}_1 + \alpha_k \mathbf{A}_2$ and $\tilde{\mathbf{B}}_k = \mathbf{B}_1 + \alpha_k^2 \mathbf{B}_2$ , for $\alpha_k = k$ mod 7.

Computation

$\tilde{\mathbf{C}}_k = p(\alpha_k)$ where $p(x) = \mathbf{C}_{11} + x \mathbf{C}_{21} + x^2 \mathbf{C}_{12} + x^3 \mathbf{C}_{22}$ , degree 3.

Recovery threshold

$K = pq = 4$ . Master can tolerate $N - K = 1$ straggler.

Decoding

Collect any $4$ responses and interpolate the degree-3 polynomial via Lagrange interpolation; the four coefficients are the four output blocks.

ex-ch04-07

Medium

Prove that the global caching gain $1 + KM/F$ cannot be exceeded by any centralized scheme using information-theoretic arguments. Sketch the cut-set bound.

Show Hint

Cut: all $K$ users on one side, server on the other.

Each user has cache $M$ ; server must fill in the rest.

Solution

Cut-set

For demand $\mathbf{d}$ with all-distinct files, the $K$ users collectively need $K$ files' worth of information. They already cache $KM/F$ files' worth; the server broadcast must supply the remaining $K(1 - M/F)/(1 + KM/F)$ files — exactly the Maddah-Ali / Niesen rate.

Why the bound is tight

The global gain $1 + KM/F$ cannot be exceeded because every broadcast can satisfy at most $1 + KM/F$ users simultaneously (each bit of the broadcast lies in at most $1 + KM/F$ user "cache-aligned" subspaces).

Caveat

The converse is strictly for centralized placement; decentralized placement has a slightly relaxed converse (see Maddah-Ali / Niesen §VI).

ex-ch04-08

Medium

For classical PIR with $N = 3$ databases and $F = 2$ files, write out a concrete scheme achieving the capacity $C_{\text{PIR}}(3, 2) = (1 + 1/3)^{-1} = 3/4$ .

Show Hint

Split each file into 3 chunks.

Each database returns 1 or 2 chunks; total download = 4 chunks; useful = 3 (one file).

Solution

File split

$W_1 = (W_{1,1}, W_{1,2}, W_{1,3})$ , $W_2 = (W_{2,1}, W_{2,2}, W_{2,3})$ .

Scheme (user wants $W_1$)

DB 1: returns $W_{1,1}$ .
DB 2: returns $W_{1,2}$ .
DB 3: returns $W_{1,3} \oplus W_{2,1}$ .
Need additionally $W_{2,1}$ for cancellation; DB 3 also returns $W_{2,1}$ .

Total: 4 chunks downloaded. Rate $= 3/4$ . ✓

Privacy check

Each database sees one or two chunk-labels of each file; the choice of which labels depends on a random permutation unknown to any single database. With the permutation uniform, each database's query is statistically independent of the desired index $\theta$ .

ex-ch04-09

Medium

Argue why finite-field IA for coded matrix multiplication has zero failure probability, while a generic random-encoding scheme has strictly positive failure probability over small fields.

Show Hint

Polynomial codes use Vandermonde structure.

Solution

Polynomial code (deterministic)

Any $pq \times pq$ submatrix of a Vandermonde matrix (with distinct evaluation points) is invertible. Zero failure probability.

Generic random encoding

With i.i.d. uniform coefficients in $\mathbb{F}_q$ , a $pq \times pq$ submatrix is singular with probability $\leq pq/q$ . For $q < pq^2$ this failure probability exceeds $1/pq$ — noticeable.

Practical upshot

Production systems use polynomial codes, not random encoding. The wireless-IA literature fascination with generic randomized constructions does not carry over to the algebraic, code-based distributed computing setting.

ex-ch04-10

Medium

Give an explicit 2-user finite-field interference channel over $\mathbb{F}_7$ with generic channel matrices, and verify the alignment condition for a scheme achieving $2$ DoF per user with $N = 4$ signal dimensions.

Show Hint

With $K = 2$ , IA reduces to zero-forcing; choose $\mathbf{V}_k, \mathbf{U}_k$ to null one cross-channel.

Solution

Setup

Let $\mathbf{H}_{11} = \mathbf{I}$ , $\mathbf{H}_{22} = \mathbf{I}$ , $\mathbf{H}_{12} = \mathbf{H}_{21} = \text{diag}(1, 2, 3, 4)$ over $\mathbb{F}_7$ .

Precoders and projections

$\mathbf{V}_1 = [\mathbf{e}_1, \mathbf{e}_2]$ , $\mathbf{V}_2 = [\mathbf{e}_3, \mathbf{e}_4]$ . The interferer at receiver 1 is $\mathbf{H}_{12} \mathbf{V}_2 = \text{diag}(1,2,3,4) \cdot [\mathbf{e}_3, \mathbf{e}_4] = [3\mathbf{e}_3, 4\mathbf{e}_4]$ . Choose $\mathbf{U}_1 = [\mathbf{e}_1, \mathbf{e}_2]^\top$ — null of the interferer.

Verification

$\mathbf{U}_1 \mathbf{H}_{12} \mathbf{V}_2 = [\mathbf{e}_1, \mathbf{e}_2]^\top [3\mathbf{e}_3, 4\mathbf{e}_4] = \mathbf{0}$ ✓. $\mathbf{U}_1 \mathbf{H}_{11} \mathbf{V}_1 = \mathbf{I}_2$ , full rank ✓. Similarly at receiver 2. Per-user DoF: $d = 2$ . Sum DoF: $4 = N$ , matching the no-interference upper bound.

ex-ch04-11

Hard

Prove that the Maddah-Ali / Niesen coded-caching scheme is exactly optimal within the class of uncoded-placement schemes. What happens with coded placement — is the gain strictly larger?

Show Hint

Yu, Maddah-Ali, Avestimehr 2018 showed a gap for small $K$ with coded placement.

Solution

Uncoded-placement optimality

Within uncoded placement (each user caches file chunks in plain), the MN scheme achieves the cut-set bound $1 + KM/F$ exactly (see Yu/Maddah-Ali/Avestimehr 2018). Any scheme reaching this bound is optimal.

Coded placement

Yu / Maddah-Ali / Avestimehr (2018) showed a marginal improvement with coded placement for small $K$ . The gap vanishes as $K \to \infty$ , so asymptotically the MN bound is tight. At $K \leq 4$ the coded- placement rate can be up to $\sim 12\%$ smaller.

Takeaway

For the production regimes of interest ( $K \geq 10$ ), the MN bound is tight and coded placement offers negligible advantage. Uncoded placement is therefore the right default in Chapter 7's shuffling analysis.

ex-ch04-12

Hard

Extend the polynomial-code construction of §4.2 to $T$ -private coded matrix multiplication (any $T$ colluding workers learn nothing about $\mathbf{A}$ ). What is the recovery threshold?

Show Hint

Add $T$ random-coefficient terms to the encoding polynomial.

Solution

Construction

Replace the polynomial-code polynomial $p(x) = \sum_{i,j} \mathbf{C}_{ij} x^{i + pj}$ by a randomized version $p(x) = \sum_{i,j} \mathbf{C}_{ij} x^{i + pj} + \sum_{\ell = 0}^{T-1} \mathbf{Z}_\ell x^{pq + \ell}$ , where $\mathbf{Z}_\ell$ are fresh random matrices.

Recovery threshold

$K = pq + T$ . The added $T$ terms preserve privacy (any $T$ evaluations are uniform random) at the cost of $T$ more worker responses for the master to decode.

Tradeoff

The privacy parameter $T$ adds linearly to the recovery threshold. For $T = 1$ , one extra worker in exchange for perfect secrecy against any single worker. This is the construction used in Chapter 11 for ByzSecAgg.

ex-ch04-13

Hard

Show that the classical PIR capacity $C_{\text{PIR}} = (1 - 1/N)/(1 - 1/N^F)$ approaches $1 - 1/N$ as $F \to \infty$ . Interpret the result operationally.

Show Hint

$1 - 1/N^F \to 1$ as $F \to \infty$ .

Solution

Limit

As $F \to \infty$ , $1/N^F \to 0$ , so $C_{\text{PIR}} \to (1 - 1/N)/1 = 1 - 1/N$ .

Operational meaning

With a huge library, the PIR download rate per byte of useful content is $1/(1 - 1/N) = N/(N-1)$ . With 2 databases this is $2$ ; with 5 databases, $1.25$ . More databases $\implies$ less overhead — the marginal cost of privacy vanishes as the system scales.

Operational vs. asymptotic

Practical PIR deployments care about finite- $F$ behavior, where the capacity formula has a small correction. Chapter 13 develops the coded-storage variant where the effective file size grows and the correction becomes visible.

ex-ch04-14

Hard

Compare the recovery threshold $K$ of polynomial-coded matrix multiplication with the PIR download rate in terms of their achievability via finite-field IA. What are the structural parallels and differences?

Solution

Parallels

Both constructions express a target computation (matrix product; file retrieval) as a polynomial evaluation, and decode via Lagrange interpolation on a subset of evaluations. Both achieve optimality from finite-field IA: cross-channel alignments reduce to Vandermonde invertibility.

Differences

Coded matrix multiplication aligns many output blocks (the $pq$ entries of the product) into a low-degree polynomial; PIR aligns file interferers (the $F - 1$ non-requested files) into a small number of independent directions. The first is a source-coding-style counting argument; the second is a privacy-preserving download.

Synthesis

Both problems are instances of finite-field IA in the "many-to-one" configuration — many messages, one receiver (master or user). The alignment reduces the effective per-unit overhead from $\Theta(K)$ (trivial) to $\Theta(1)$ (IA-optimal) — exactly the DoF gain of Section 4.1 in another disguise.

ex-ch04-15

Challenge

Consider a hybrid system combining distributed matrix multiplication with coded caching: $N$ workers compute $\mathbf{A}^T \mathbf{B}$ , but intermediate results can be cached at the workers to speed up future products. Conjecture a joint tradeoff between the matrix-multiplication recovery threshold $K$ , the caching gain $1 + KM/F$ , and the per-worker storage $\mu$ .

Show Hint

Think about what's reused across different products.

The caching gain applies to the repeat-product traffic.

Solution

Structure

For a single product, polynomial codes give $K = pq$ (§4.2). For a stream of products, cached intermediate values (the $\mathbf{A}^{(k)} \mathbf{B}^{(k)}$ sub-products or their polynomial combinations) can be reused across products.

Conjectured tradeoff

The effective recovery threshold for a stream of products with cache size $M$ scales as $K_{\text{eff}} = pq / (1 + KM/F)$ in the many-products limit — achieving a $1 + KM/F$ reduction in the per-product straggler-tolerance requirement. The total per-worker storage is $\mu + M$ .

Research status

The precise tradeoff is open. The CCG (coded-computing with caching) literature (2019–present) has partial achievability results but no matching converse. This is a genuine open problem at the intersection of coded computing and coded caching — a natural research direction extending the IA machinery of this chapter.

Exercises

ex-ch04-01

Condition

Why $K/2$

ex-ch04-02

Polynomial-code threshold

Replication comparison

ex-ch04-03

Gain

Coded rate

ex-ch04-04

Plug in

Interpretation

ex-ch04-05

Zero-forcing

IA

Gap

ex-ch04-06

Storage

Computation

Recovery threshold

Decoding

ex-ch04-07

Cut-set

Why the bound is tight

Caveat

ex-ch04-08

File split

Scheme (user wants $W_1$)

Privacy check

ex-ch04-09

Polynomial code (deterministic)

Generic random encoding

Practical upshot

ex-ch04-10

Setup

Precoders and projections

Verification

ex-ch04-11

Uncoded-placement optimality

Coded placement

Takeaway

ex-ch04-12

Construction

Recovery threshold

Tradeoff

ex-ch04-13

Limit

Operational meaning

Operational vs. asymptotic

ex-ch04-14

Parallels

Differences

Synthesis

ex-ch04-15

Structure

Conjectured tradeoff

Research status