Prerequisites & Notation
Before You Begin
This chapter generalizes the coded-caching framework to distributed computing. Prerequisites: MAN basics and exposure to MapReduce and stochastic gradient descent.
- MapReduce programming model
Self-check: What is the shuffle phase between map and reduce?
- Distributed gradient descent(Review ch26)
Self-check: What is a straggler and how does it limit SGD throughput?
- Polynomial codes / Reed-Solomon(Review ch09)
Self-check: Why can a degree- polynomial be reconstructed from evaluations?
Notation for This Chapter
Symbols for coded computing in distributed systems.
| Symbol | Meaning | Introduced |
|---|---|---|
| Number of workers (analog of users) | s01 | |
| Number of reduce output keys | s01 | |
| Computation load: each file mapped at workers | s01 | |
| Coded communication load (fraction of shuffle data) | s02 | |
| Gain parameter, analog of caching | s02 | |
| Mean worker completion time (Exp distribution) | s03 | |
| (gradient coding)$ | Straggler tolerance: tolerate any stragglers | s03 |