Exercises

ex-ch07-01

Easy

For N=20N = 20 workers, dataset size DD, per-worker memory M=0.4DM = 0.4 D. Compute the uncoded shuffling cost RuncodedR_{\text{uncoded}} and the Wan / Tuninetti / Caire coded rate R(M)R^*(M).

ex-ch07-02

Easy

Why does the Wan / Tuninetti / Caire coded-shuffling scheme achieve optimal rate in both achievability and converse?

ex-ch07-03

Easy

State why federated learning does not benefit from the coded-shuffling framework of this chapter, and what alternative shuffling strategies FL uses instead.

ex-ch07-04

Easy

For random-reshuffling SGD vs. i.i.d. sampling, state the convergence rates and why the former is preferred.

ex-ch07-05

Medium

Construct a concrete coded-shuffling placement and one epoch's delivery for N=4N = 4 workers, D=8D = 8 data points, M=2M = 2 (so Nμ=1N\mu = 1). Specify subfile partitions, placement, and broadcast messages for a particular permutation π\pi.

ex-ch07-06

Medium

Prove that the coded-shuffling rate R(M)=N(1μ)/(1+Nμ)R^*(M) = N(1 - \mu)/ (1 + N\mu) is decreasing in μ\mu. What is the physical interpretation?

ex-ch07-07

Medium

For N=30N = 30 workers and μ=0.05\mu = 0.05, compute both the centralized coded-shuffling rate and the decentralized variant. Quantify the gap.

ex-ch07-08

Medium

Compute the demand-private shuffling rate for N=20N = 20 workers, μ=0.2\mu = 0.2, collusion threshold T=2T = 2. Compare with the non-private rate.

ex-ch07-09

Medium

Explain how the coded-shuffling construction of §7.3 reuses the finite-field IA machinery of Chapter 4 directly. Identify the specific alignment happening in the XOR broadcasts.

ex-ch07-10

Medium

Why is centralized placement more efficient than decentralized placement? What is the underlying combinatorial reason?

ex-ch07-11

Hard

Prove the cut-set lower bound for coded shuffling: R(M)N(1μ)/(1+Nμ)R^*(M) \geq N(1 - \mu)/(1 + N\mu). Sketch the key inequalities.

ex-ch07-12

Hard

Extend the Wan / Tuninetti / Caire scheme to TT-demand-private shuffling. Show that the rate becomes Rpriv=N(1μ)/(1+NμT)R_{\text{priv}}^* = N(1 - \mu)/(1 + N\mu - T) and explain where the T-T comes from.

ex-ch07-13

Hard

Consider heterogeneous per-worker memory budgets: n1n_1 workers with memory M1M_1, n2n_2 workers with memory M2M_2 (M1<M2M_1 < M_2). Conjecture the optimal rate and argue why the homogeneous lower bound must weaken.

ex-ch07-14

Hard

Describe a hypothetical hybrid scheme combining coded gradient computation (Chapter 6) with coded data shuffling (this chapter). What benefits arise and what additional costs?

ex-ch07-15

Challenge

Open problem. Characterize the optimal achievability-converse closure for the demand-private coded-shuffling problem with TT colluders and straggler tolerance ss. Is there a clean formula Rpriv,strag(M,T,s)R^*_{\text{priv,strag}} (M, T, s)?