Ferkans — Interactive Telecom Tutor

ex-ch05-01

Easy

For a polynomial code with $p = 4, q = 5$ , what are the per-worker storage $\mu$ , the recovery threshold $K$ , and the degree of the product polynomial $p_C(x)$ ?

Show Hint

$\mu = 1/p + 1/q$ , $K = pq$ , $\deg p_C = pq - 1$ .

Solution

Plug in

$\mu = 1/4 + 1/5 = 9/20$ ; $K = pq = 20$ ; $\deg p_C = pq - 1 = 19$ .

ex-ch05-02

Easy

A polynomial code uses $N = 30$ workers with $p = q = 4$ . What is the maximum number of stragglers the master can tolerate?

Show Hint

Straggler tolerance = $N - K$ .

Solution

Compute

$K = pq = 16$ . Straggler tolerance $= N - K = 30 - 16 = 14$ — almost half of all workers can fail.

ex-ch05-03

Easy

State why the polynomial code requires the field $\mathbb{F}_q$ to satisfy $q \geq N$ .

Show Hint

Distinct evaluation points must come from $\mathbb{F}_q^*$ .

Solution

Reason

$\mathbb{F}_q$ has $q - 1$ nonzero elements. The polynomial code needs $N$ distinct nonzero $\alpha_k$ , so $q - 1 \geq N$ , i.e., $q \geq N + 1$ . Practically we round up to a prime power $\geq N + 1$ .

ex-ch05-04

Easy

Compare the recovery threshold of the polynomial code and MDS-coded replication for $p = q = 4$ .

Show Hint

Polynomial: $K = pq$ . MDS: $K = p + q - 1$ .

Solution

Both

Polynomial: $K = 16$ . MDS: $K = 7$ . MDS has lower $K$ but higher per-worker storage ( $\mu = 1/\min(p, q) = 1/4$ for MDS vs. $1/p + 1/q = 1/2$ for polynomial). Different tradeoffs.

ex-ch05-05

Medium

Construct a polynomial code for $p = 2, q = 3$ over $\mathbb{F}_{11}$ with $N = 8$ workers and evaluation points $\alpha_k = k$ . Write out worker 5's stored matrices and response.

Show Hint

Encoding: $\tilde{\mathbf{A}}_k = \mathbf{A}_1 + k \mathbf{A}_2$ , $\tilde{\mathbf{B}}_k = \mathbf{B}_1 + k^2 \mathbf{B}_2 + k^4 \mathbf{B}_3$ .

Solution

Encoding for worker 5

$\tilde{\mathbf{A}}_5 = \mathbf{A}_1 + 5 \mathbf{A}_2$ ; $\tilde{\mathbf{B}}_5 = \mathbf{B}_1 + 25 \mathbf{B}_2 + 625 \mathbf{B}_3 \equiv \mathbf{B}_1 + 3 \mathbf{B}_2 + 9 \mathbf{B}_3 \pmod{11}$ .

Computation

$\tilde{\mathbf{C}}_5 = \tilde{\mathbf{A}}_5^T \tilde{\mathbf{B}}_5 = p_C(5)$ where $p_C$ has degree $pq - 1 = 5$ .

Recovery threshold

$K = 6$ . Master can tolerate $N - K = 2$ stragglers.

ex-ch05-06

Medium

Prove that the encoding polynomial $p_C(x) = p_A(x)^T p_B(x)$ in §5.2 has the desired output blocks $\mathbf{C}_{ij}$ as the coefficients of $x^{i + pj}$ (for $i \in \{0, \ldots, p-1\}$ , $j \in \{0, \ldots, q-1\}$ ).

Show Hint

Multiply $\sum_i \mathbf{A}_{i+1} x^i$ by $\sum_j \mathbf{B}_{j+1} x^{pj}$ .

Solution

Multiplication

$p_C(x) = (\sum_{i=0}^{p-1} \mathbf{A}_{i+1} x^i)^T (\sum_{j=0}^{q-1} \mathbf{B}_{j+1} x^{pj}) = \sum_{i, j} \mathbf{A}_{i+1}^T \mathbf{B}_{j+1} x^{i + pj} = \sum_{i, j} \mathbf{C}_{i+1, j+1} x^{i + pj}$ .

Distinct exponents

The exponents $i + pj$ for $(i, j) \in [p] \times [q]$ are distinct (since $i \in \{0, \ldots, p-1\}$ uniquely encodes the residue mod $p$ ). Hence each $\mathbf{C}_{ij}$ appears as a distinct coefficient, and Lagrange interpolation recovers all of them independently.

ex-ch05-07

Medium

For a polynomial code with $p = q = 3$ , $N = 12$ , simulate (on paper or in your head) which $K = 9$ workers respond when $\{3, 7, 12\}$ straggle. Verify that decoding still succeeds.

Show Hint

Polynomial codes are $(N, K)$ MDS-like — any $K$ of $N$ suffice.

Solution

Identify responders

Responders: $\{1, 2, 4, 5, 6, 8, 9, 10, 11\}$ — exactly $9$ workers. The master does not need to know which workers will straggle in advance.

Decoding

With distinct evaluation points $\alpha_1, \alpha_2, \alpha_4, \alpha_5, \alpha_6, \alpha_8, \alpha_9, \alpha_{10}, \alpha_{11}$ , Lagrange interpolation succeeds because any $9 \times 9$ Vandermonde submatrix is invertible.

Result

The full output $\mathbf{A}^T \mathbf{B}$ is recovered exactly. The straggler tolerance $N - K = 3$ is exactly the number of failed workers — at the edge of the budget.

ex-ch05-08

Medium

Show that the decoder complexity of polynomial codes can be reduced from $O(K^2)$ (naive Lagrange) to $O(K \log^2 K)$ using FFT-based interpolation. When does the FFT speedup matter in practice?

Show Hint

FFT polynomial interpolation is $O(K \log^2 K)$ over a field with appropriate roots of unity.

Solution

Naive Lagrange

For each output coefficient, compute Lagrange basis $\ell_k(\lambda)$ in $O(K)$ time. Sum over $k$ for $K$ coefficients: $O(K^2)$ total.

FFT speedup

Using the polynomial-evaluation FFT and inverse-FFT algorithms, polynomial interpolation runs in $O(K \log^2 K)$ provided the field has a primitive $K$ -th root of unity. For $K = 1024$ , this is roughly a $30\times$ speedup over naive Lagrange.

When it matters

The FFT speedup is significant for $K \geq 64$ or so. Below that, the naive approach is simpler and the constant factors of FFT overhead dominate. Modern coded-computing libraries provide both algorithms and switch based on $K$ .

ex-ch05-09

Medium

Compute the storage requirement of an MDS-coded matrix multiplication scheme that achieves $K = p + q - 1$ for $p = q = 4$ . Compare with the polynomial code's $\mu = 1/p + 1/q$ .

Show Hint

MDS schemes typically store $1/\min(p, q)$ per worker.

Solution

MDS storage

$\mu_{\text{MDS}} = 1/\min(p, q) = 1/4$ . Each worker stores $1/4$ of $\mathbf{A}$ and $1/4$ of $\mathbf{B}$ — total $1/2$ of each input.

Polynomial-code storage

$\mu_{\text{poly}} = 1/p + 1/q = 1/4 + 1/4 = 1/2$ . Same per-worker storage as MDS in this case.

Comparison

At $p = q$ , the storage budgets are identical. The difference is the recovery threshold: MDS achieves $K = p + q - 1 = 7$ vs. polynomial's $K = pq = 16$ . MDS is better for straggler tolerance at this storage. The catch is that MDS requires a different polynomial structure that is sensitive to the partition shape; polynomial codes generalize more cleanly to $p \neq q$ .

ex-ch05-10

Medium

A polynomial-code system runs at $p = q = 8$ on $N = 80$ workers with i.i.d. exponential task times of rate $\lambda = 1$ . Compute the expected wall-clock time per matrix multiplication (assuming $K = pq$ responses needed).

Show Hint

Expected wait for $K$ of $N$ exponentials: $(H_N - H_{N - K})/\lambda$ .

Solution

Recovery threshold

$K = pq = 64$ .

Order-statistic mean

$\mathbb{E}[T_{(K)}] = (H_N - H_{N-K})/\lambda = (H_{80} - H_{16})/1$ . Compute: $H_{80} \approx 4.965$ , $H_{16} \approx 3.381$ . $\mathbb{E}[T_{(K)}] \approx 1.584$ .

Compare with no redundancy

With $K = N = 80$ (no redundancy), $\mathbb{E}[T_{(N)}] = H_{80} \approx 4.965$ — about $3.1\times$ slower. The polynomial-code redundancy buys a factor-3 speedup at this $(N, K)$ point.

ex-ch05-11

Hard

Prove that the $T$ -private polynomial code of §5.4 leaks no information to any size- $T$ coalition. State the privacy guarantee precisely in mutual-information terms.

Show Hint

Coalition's view = $T$ evaluations of $p_A$ , $T$ of $p_B$ .

Each polynomial has $T$ random coefficients absorbing the observations.

Solution

Privacy claim

For any $\mathcal{U} \subset [N]$ with $|\mathcal{U}| \leq T$ , $I(\mathbf{A}, \mathbf{B}; \{\tilde{\mathbf{A}}_k, \tilde{\mathbf{B}}_k\}_{k \in \mathcal{U}}) = 0$ .

Argument for $\mathbf{A}$

Coalition's view of $p_A$ : $T$ evaluations $\{p_A(\alpha_k)\}_{k \in \mathcal{U}}$ . The polynomial $p_A$ has $p$ "data" coefficients $\mathbf{A}_1, \ldots, \mathbf{A}_p$ plus $T$ random $\mathbf{Z}_{A, \ell}$ . The Vandermonde linear system $\{p_A(\alpha_k) = \cdots\}_{k \in \mathcal{U}}$ has $T$ random degrees of freedom on the right-hand side, fully absorbing the $T$ observations. Hence the observations are uniform over the masking space, independent of $\mathbf{A}$ .

Argument for $\mathbf{B}$

Identical: the $T$ random terms in $p_B$ absorb the $T$ observations. By symmetry, no information about $\mathbf{B}$ leaks.

Joint privacy

Since the masks for $p_A$ and $p_B$ are drawn independently, joint independence follows directly. $\blacksquare$

ex-ch05-12

Hard

Sketch the $T$ -private polynomial-code construction for matrix-vector multiplication ( $\mathbf{A}\mathbf{x}$ instead of $\mathbf{A}^T \mathbf{B}$ ), with $p$ row partitions of $\mathbf{A}$ . What is the recovery threshold?

Show Hint

$q = 1$ in the matrix-vector case.

Solution

Setup

$p_A(x) = \sum_{i=0}^{p-1} \mathbf{A}_{i+1} x^i + \sum_{\ell=p}^{p+T-1} \mathbf{Z}_{A, \ell} x^\ell$ , $p_x(x) = \mathbf{x}$ (no partitioning since $\mathbf{x}$ is a vector). Worker $k$ computes $\tilde{\mathbf{y}}_k = p_A(\alpha_k) \mathbf{x} = p_y(\alpha_k)$ where $p_y$ has degree $p + T - 1$ .

Recovery threshold

$K = p + T$ . With $T = 0$ this matches the polynomial- code $K = pq = p$ (since $q = 1$ ), and each privacy unit costs $1$ extra response (vs. $2$ in the matrix- matrix case because $p_x$ does not need its own masking).

Storage

Per-worker storage $= |\mathbf{A}|/p + |\mathbf{x}|/1 = 1/p + 1$ (in normalized terms — vector is small relative to matrix). Privacy adds $T/p$ to the storage.

ex-ch05-13

Hard

Consider the MatDot (entangled polynomial) variant of Section 5.4: it achieves recovery threshold $K = p + q - 1$ at storage $\mu = 1/p + 1/q$ — better than polynomial codes on $K$ . Why is this not a contradiction with the lower-bound converse of §5.3?

Show Hint

Read the statement of the converse carefully.

MatDot does not satisfy the polynomial code's correctness requirement — the encoded polynomials have specific relationships.

Solution

Resolve the apparent contradiction

The §5.3 converse applies to schemes where $\tilde{\mathbf{A}}_k$ and $\tilde{\mathbf{B}}_k$ are independent linear combinations of $\mathbf{A}_i, \mathbf{B}_j$ . MatDot introduces entanglement — the encoding of $\mathbf{B}_j$ depends on properties of $\mathbf{A}$ as well, breaking the independence assumption.

Where the converse applies

The converse holds for "separable" coded schemes (each worker stores a separable function of the inputs). MatDot uses a joint encoding that does not fit this template.

Lesson

Information-theoretic lower bounds depend on the scheme class. Tight bounds for one class can be beaten by extending to a strictly larger class. The coded-computing literature has many such "partial converse" results, and characterizing the full achievable region remains an active research area.

ex-ch05-14

Hard

A production system uses a polynomial code over the reals with $K = 100$ . The Vandermonde decoder matrix has condition number $\sim 10^{60}$ at integer evaluation points, completely unusable. Propose three distinct strategies to fix this without changing the recovery threshold.

Show Hint

Chebyshev points reduce the condition number to $\sim 2^{100}$ .

Real number system → modular arithmetic → finite field.

Decoupling encoding from decoding via random rotations.

Solution

Strategy 1: Chebyshev evaluation points

Replace $\alpha_k = k$ with Chebyshev nodes $\alpha_k = \cos((2k - 1)\pi / (2K))$ . Condition number drops to $\sim 2^K \approx 10^{30}$ — still large but tractable in 128-bit floats.

Strategy 2: Switch to finite-field arithmetic

Compute the matrix product modulo a large prime (e.g., $q = 2^{61} - 1$ ), then lift back to reals. Avoids floating-point conditioning entirely. Cost: modular reduction of large matrices.

Strategy 3: Random rotations / orthonormal bases

Replace the Vandermonde encoding with a random orthogonal matrix $\mathbf{Q}$ . Provides recovery threshold $K = pq$ with condition number $\sim 1$ (orthonormal rows). Cost: random encoding loses the explicit Lagrange decoder structure.

Production choice

Most production deployments use Strategy 2 (finite- field arithmetic), often with $q$ chosen to match the natural data type (e.g., $\mathbb{F}_{2^{32}}$ for int32 matrices). Strategy 1 is the right choice when finite-field arithmetic is too slow.

ex-ch05-15

Challenge

Open problem. For coded matrix multiplication with non- symmetric storage budgets ( $\mu_k \neq \mu_j$ across workers), the optimal recovery threshold is not known in full generality. Sketch a candidate construction that combines polynomial codes with unequal per-worker storage, and discuss what the recovery-threshold formula might look like.

Show Hint

Asymmetric polynomial codes use different evaluation points and exponents per worker.

Thinking of the problem as a weighted Vandermonde matrix may help.

Solution

Candidate construction

Assign each worker a budget $b_k$ (number of evaluation points it can store). Encode $\mathbf{A}, \mathbf{B}$ as polynomials with degrees adapted to the per-worker budgets — workers with more budget store more coefficients. The recovery threshold becomes the smallest $K$ such that $\sum_{k \in \mathcal{T}} b_k \geq pq$ for any $K$ -subset $\mathcal{T}$ .

Conjectured optimal $K$

For storage budgets $\mu_1 \geq \mu_2 \geq \cdots \geq \mu_N$ , the optimal $K$ is the smallest integer such that $\sum_{k = 1}^K \mu_k \geq 1/p + 1/q$ (i.e., the $K$ workers with the smallest budgets collectively store enough). This is similar to the "fractional cover" formulation in coded caching.

Status

The conjecture is consistent with known special cases (symmetric storage recovers $K = pq$ ). A matching converse is open. This is one of the open problems of Chapter 18 — a research direction at the intersection of coded computing and combinatorial optimization.

Exercises

ex-ch05-01

Plug in

ex-ch05-02

Compute

ex-ch05-03

Reason

ex-ch05-04

Both

ex-ch05-05

Encoding for worker 5

Computation

Recovery threshold

ex-ch05-06

Multiplication

Distinct exponents

ex-ch05-07

Identify responders

Decoding

Result

ex-ch05-08

Naive Lagrange

FFT speedup

When it matters

ex-ch05-09

MDS storage

Polynomial-code storage

Comparison

ex-ch05-10

Recovery threshold

Order-statistic mean

Compare with no redundancy

ex-ch05-11

Privacy claim

Argument for $\mathbf{A}$

Argument for $\mathbf{B}$

Joint privacy

ex-ch05-12

Setup

Recovery threshold

Storage

ex-ch05-13

Resolve the apparent contradiction

Where the converse applies

Lesson

ex-ch05-14

Strategy 1: Chebyshev evaluation points

Strategy 2: Switch to finite-field arithmetic

Strategy 3: Random rotations / orthonormal bases

Production choice

ex-ch05-15

Candidate construction

Conjectured optimal $K$

Status