Placement Phase: Combinatorial Subfile Assignment
Why This Particular Placement?
The MAN placement is a combinatorial design: each subfile goes to exactly caches, chosen by the identity of the subset . One could imagine many other placements (random, content-aware, hierarchical) but the MAN choice is special for a precise reason: it is the unique (up to symmetries) uncoded placement that equalizes, across users, the number of subfiles held by every pair. This symmetry is what makes the delivery phase so clean.
A designer could complain that the placement is rigid β it must be exactly right for the delivery to work. Chapters 14 and 18 show alternative placements (PDAs, multi-access, decentralized) that relax this rigidity at the cost of different tradeoffs.
MAN Placement Algorithm
Complexity: Storage per user: subfiles, each of size bits. Total: bits. β Off-peak bandwidth: bits (every subfile transmitted to the users that cache it).Theorem: Cache Size Consistency
Under MAN placement, each user caches exactly bits, i.e., files' worth of content, as required.
Count subfiles per cache
User caches subfile iff . The number of -subsets of containing is . Summing over :
Convert to bits
Each subfile has size bits, so using the identity and the definition .
The Symmetry That Matters
Two critical symmetries of the MAN placement:
Subfile-level symmetry. Every subfile is cached by exactly users. This is visible in the placement heatmap as row-wise balance.
User-level symmetry. Every user caches exactly subfiles per file. This is the column-wise balance.
Together, these two symmetries imply that for every pair with and , user does not have subfile . When user requests file , the missing subfiles it needs are exactly those indexed by -subsets that do not contain . There are such subsets. This count will reappear in the delivery analysis.
Example: Example Cache Contents: ,
For users and (implying ), enumerate user 1's cache contents for each of files.
List 2-subsets containing user 1
2-subsets of containing 1: β three subsets, matching .
User 1's cache per file
For each , user 1 stores . Three subfiles per file, each of size bits.
Total cache size
(). β
Missing subfiles per file
Total subfiles per file: . User 1 has 3, is missing 3: β all 2-subsets not containing 1. Count: . β
Key Takeaway
User 's cache content is uniform over its support, because the subfiles are mutually independent (different files are independent, same-file different-subfile are too under our assumption ). Hence bits exactly β the cache budget is fully used with no slack.
The Off-Peak Cost of Placement
A subtle point in the placement analysis: the MAN scheme requires the server to transmit every subfile to each of the users in . Total off-peak traffic: That is, one full library's worth for each user β the cache is entirely populated, as it must be.
In real CDN deployments the placement is done over a long period (days) and the delivery savings amortize over many delivery rounds. The economic argument for coded caching depends on the ratio of these two time scales.
- β’
Placement cost: bits per library refresh
- β’
Delivery cost per round: bits
- β’
Break-even: ratio of peak rounds to library refresh frequency must be large
Common Mistake: All Caches Are Identical in Structure, Different in Content
Mistake:
Saying 'all users cache the same subfiles' because the scheme is symmetric.
Correction:
The structure is identical β every user caches subfiles whose index sets contain that user, and every user caches the same number of subfiles. But the specific subfiles differ: user 1 caches ; user 2 caches as well (since ); user 3 does not cache . The combinatorial interleaving of caches is what lets the delivery XOR messages work.