The Subpacketization Bottleneck
Why MAN Does Not Scale to Large
The MAN scheme has a serious practical limitation that has kept it out of production systems: subpacketization grows exponentially in the number of users. Each file must be split into subfiles, and each subfile must be tracked, transmitted, and decoded. For , : subfiles per file. At this scale, individual subfiles are too small to contain useful data (each ~1 bit), and the coding overhead overwhelms the savings.
This chapter addresses the subpacketization bottleneck. The CommIT contribution — graph-coloring delivery schemes — achieves most of the MAN coded-multicasting gain with polynomial subpacketization for constant . This makes coded caching deployable at realistic CDN scale ( users per cluster).
Theorem: MAN Subpacketization Is Exponential
The MAN scheme requires subpacketization where . Using Stirling's approximation, where is the binary entropy function. For any fixed , grows exponentially in .
is the number of subsets of size from a -element set. For , this is — doubling squares . For : — still exponential.
Stirling
where .
Implication
For any fixed in , , so grows exponentially in .
Numerical
For : . At : . At : . At : astronomical.
Consequence
Deploying MAN at scale requires content libraries with files of size bits. For video (1 GB file, 1 KB minimum subfile granularity): max , giving -. Beyond this, MAN is infeasible.
Subpacketization: MAN vs PDA vs Graph-Coloring
Log-scale plot of required subpacketization vs for three schemes at fixed target : MAN (exponential), PDA (polynomial), graph-coloring (). At MAN becomes infeasible for typical file sizes; PDA and graph-coloring remain practical to +.
Parameters
Subpacketization Feasibility vs File Size
For fixed memory ratio, plot the subpacketization of MAN (red) and PDA (blue) vs . Horizontal line at file size (in bits): above this line, subfiles become smaller than practical granularity. MAN crosses this line at modest ; PDA stays feasible at much larger scales.
Parameters
Example: Where Does MAN Break?
A 1 GB video file ( bits). Minimum practical subfile size: 1 KB ( bits) due to header overhead. Max practical . At what does MAN break at ?
Solve
. Stirling: . , so .
Conclusion
MAN is practical up to users for video files at . Beyond this: subfiles too small, scheme breaks.
Solution?
Either (a) use smaller files (infeasible — high overhead) or (b) use polynomial-subpacketization schemes (PDA, graph- coloring). Chapter 14 develops (b).
Key Takeaway
MAN's subpacketization limits practical deployment to . For larger networks, polynomial- subpacketization schemes are essential. The CommIT graph- coloring schemes (§14.3) and PDA framework (§14.2) are the answers.
Subpacketization in Practice
Realistic numbers for subpacketization planning:
- File granularity. HTTP byte-range minimum: 1 KB per subfile. Smaller than this, header overhead dominates.
- Feasible file sizes. Video: 100 MB-2 GB per file. Software: 10 MB-1 GB. Documents: 1 MB-100 MB.
- Max feasible : 10^4 for small files, 10^6 for large.
- Max practical at : with MAN, . With PDA, +.
- Subfile indexing overhead. Each subfile has a label ( bits of metadata). For large , this is non-negligible.
Production coded-caching deployments must plan subpacketization at design time. Picking the wrong subpacketization scheme leads to 100× overhead penalty at runtime.
- •
HTTP byte-range min: 1 KB
- •
Subfile label: ~log_2 F bytes per subfile
- •
MAN feasible only for K ≤ 50 at μ = 0.1
- •
PDA / graph-coloring extend to K = 1000+
Common Mistake: Theoretical Rates Hide Subpacketization
Mistake:
Citing MAN rate as achievable for a 1000-user network without addressing subpacketization.
Correction:
At , : MAN subpacketization is astronomical. The rate formula is mathematically correct but operationally unachievable without a different scheme. Honest presentation: "MAN achieves rate if each file can be split into subfiles; this is feasible only for ."
For large , polynomial-subpacketization schemes achieve a constant-factor approximation of the MAN rate. The relevant achievable rate is typically -× the MAN rate, not the MAN rate itself.