Prerequisites & Notation
Before You Begin
Chapter 9 opens Part III by establishing the federated- learning (FL) paradigm that the rest of the book builds on. The prerequisites are the distributed-SGD architecture of Chapter 1, basic SGD convergence intuition, and the gradient-coding framework of Chapter 6 (which is reused for FL straggler handling).
- Distributed SGD architecture (Chapter 1 §1.3)(Review ch01)
Self-check: State the per-round communication cost of synchronous distributed SGD in terms of , , .
- SGD convergence rates on strongly-convex losses(Review ch05)
Self-check: What is the asymptotic convergence rate of SGD on a -strongly-convex, -smooth objective?
- Gradient inversion attacks (Chapter 1 §1.3)(Review ch01)
Self-check: Why is a plaintext gradient not a privacy-preserving primitive for federated learning?
- Quantization and rate-distortion fundamentals(Review ch06)
Self-check: For a -bit uniform quantizer on a real-valued scalar with Gaussian distribution, what is the approximate distortion?
- Coded gradient computation (Chapter 6)(Review ch06)
Self-check: How does -gradient coding tolerate stragglers?
Notation for This Chapter
Chapter 9 introduces FL-specific notation. We use (lowercase) for the number of users in FL — distinguishing from (uppercase) used for workers in coded computing (Chapters 5–8). Each FL user has a local dataset and holds model parameters after broadcasting.
| Symbol | Meaning | Introduced |
|---|---|---|
| Number of users (clients) in FL — lowercase to distinguish from workers | s01 | |
| Client participation rate per round — fraction of users selected | s02 | |
| Local dataset of user (private, stays on device) | s01 | |
| Global model parameters at round | s02 | |
| User 's locally-updated model after local epochs | s02 | |
| Number of local epochs per round | s02 | |
| User 's local gradient on (same as Chapter 1) | s01 | |
| Global objective — average of user-local objectives | s01 | |
| Quantization / sparsification relative error tolerance (Chapter 6 §6.3) | s03 | |
| Model dimensionality (parameters) | s01 |