Exercises
ex-cc-ch03-01
EasyFor , , , compute (a) the MAN rate and (b) the cut-set lower bound. Identify the gap.
MAN: .
Cut-set: max over of .
MAN
. .
Cut-set
: . : , . : , . : . Max: 1.
Gap
Gap = MAN/cut-set = 1.5/1 = 1.5. MAN exceeds the cut-set bound by 50% — the converse is loose here but not hopelessly so.
ex-cc-ch03-02
EasyExplain in one paragraph why the cut-set argument does not distinguish between uncoded and coded placement.
The argument uses only entropy bounds on the caches.
Answer
The cut-set proof bounds using the fact that each cache has at most bits of entropy. This inequality holds whether the cache contents are raw file bits (uncoded placement) or arbitrary functions of the files (coded placement). The argument is placement-agnostic, which is why it cannot tightly capture the uncoded/coded distinction. The YMA '18 converse does distinguish, by using a symmetry argument that only applies under uncoded placement.
ex-cc-ch03-03
EasyFor , compute the MAN rate, the cut-set lower bound, and verify that the YMA '18 exact result coincides with MAN for .
YMA '18 says exactly.
Compute
: , cut-set (s=5): 5. Gap: 1. : , cut-set max at s = 2: , ; s=3: 1(1-1/1)=0... max overall: 1 at s=2? Wait: s=4: , . Recheck s=1: . s=2: 1. s=3: 0. Max = 1. Gap 2. YMA matches MAN (2). : , cut-set: s=2, ; s=1, . Max = 0.6. YMA says . : left as straightforward exercise.
Interpretation
The cut-set is loose at ; YMA '18 confirms that even under uncoded placement, is optimal. The gap between cut-set and YMA comes from tighter entropy bounds in the YMA argument.
ex-cc-ch03-04
EasyState the number of distinct demands that yields worst-case rate under MAN delivery, for and .
.
Compute
: worst (all users distinct). : worst (every file requested at least once).
Interpretation
When , some demand coincidences are forced (pigeonhole); the worst is when coincidences are minimized (each file requested times).
ex-cc-ch03-05
EasyExplain the correlated-files model: what is , and why does the WTJC rate converge to as ?
means all files are identical.
Answer
is the fraction of the file that is common across all files in the library. : all files are independent (MAN baseline). : all files are identical — the library effectively has only 1 file, and once all users cache a fraction of it (they need just one file's worth), delivery is trivial. Formally: WTJC rate the rate of broadcasting the common part, which is the difference between what users cache and the whole file — approaching zero when caches can store the (single) file.
ex-cc-ch03-06
MediumCut-set at the boundary. Prove that the cut-set bound with gives . For , this reduces to . Compare with MAN.
Plug in , (assuming , ).
Compute
. For , . For : , so .
Compare with MAN
at . Cut-set at : . Ratio MAN/cut-set = . For , : MAN = 2, cut-set = 0. Hmm, cut-set becomes tight only at and ; in between it is weaker than other choices.
ex-cc-ch03-07
MediumYMA '18 intuition. Sketch the averaging argument: why does averaging over user permutations give a lower bound that matches MAN?
Consider the worst-case demand vector .
Any permutation of yields another demand vector of the same type.
Uncoded placement is symmetric under user permutation.
Setup
Fix worst-case demand . For any permutation , the demand has the same type (same histogram of distinct demands). Uncoded placement is invariant under relabeling: .
Averaging
The rate for equals the rate for under the permuted scheme (just with users relabeled). Averaging over all permutations: the average rate equals the rate of the symmetric scheme, which is the MAN rate under this demand type.
Lower bound
Since the worst-case rate is at least the average rate, and the average equals the MAN rate, any scheme's worst-case rate is at least the MAN rate. Combined with MAN achievability, the two are equal. (This is the high-level structure; the full YMA '18 proof requires care with finite and non-integer .)
ex-cc-ch03-08
MediumCorrelated-files bound with memory split. For the WTJC model, show that allocating cache to the common component and to the variants (with cache-to-library ratio matching), the WTJC rate is , where .
Common part: single broadcast of size bits.
Variant part: MAN on reduced library of effective size .
Common-part rate
All users want the common component (every file has it), so a single broadcast suffices. The broadcast size is bits if users cache fraction of the common part. Rate contribution: files.
Variant-part rate
Variants are independent across files. Apply MAN with users, variants, per-user cache . Adjusted memory ratio if we reinterpret variants as a reduced library. Rate contribution: .
Optimize split
The total rate is a convex combination; taking the derivative with respect to and setting to zero yields the optimal split. In closed form for symmetric libraries: balances the two terms.
ex-cc-ch03-09
MediumWhen is the cut-set tight? Find conditions on such that the cut-set bound is achieved with equality by the MAN scheme.
Extremes of : gives and cut-set = .
At , trivially.
Check intermediate points.
Endpoint tightness
: MAN rate = . Cut-set with : . Tight. : MAN rate = 0. Cut-set trivially 0. Tight.
Intermediate tightness
For : at for integer , MAN rate . Cut-set with (when integer): matches after algebra. Generally, the cut-set is tight at the integer- MAN operating points when .
General case
For arbitrary , the cut-set bound is tight at MAN integer- points up to floor corrections; the YMA '18 bound (tighter) is always tight. The cut-set loses tightness when and are "misaligned."
ex-cc-ch03-10
MediumDemand type and delivery rate. Show that for the MAN scheme, delivery rate depends on the demand vector only through its type (histogram). Exhibit two demand vectors of different types with the same but different rates.
Same does not imply same type.
Consider and for .
Type vs. $N_e$
Type is the full histogram. has type and . has type and . Different types, different .
Equal $N_e$ with different types
For : has type and . has type and . MAN delivers the same number of subfiles in both cases (worst case per type), but the number of distinct missing-subfile index sets can differ, so rate may differ slightly.
ex-cc-ch03-11
HardFull YMA '18 converse (sketch). Prove the YMA '18 converse for the symmetric case and for integer . Show that any uncoded-placement scheme has worst-case rate .
Consider the permutations of demand .
Sum up the entropies of delivery messages across all permutations.
Use the fact that uncoded placement is invariant under user relabeling.
Setup
Fix an uncoded placement scheme. For each permutation , the demand vector is a worst-case demand (all distinct). The scheme, with user labels permuted, produces rate .
Symmetry
Because uncoded placement stores raw bits symmetrically, the scheme after permutation yields the same rate: for all .
Sum over permutations
Consider the delivery messages across all permutations. By a combinatorial counting argument over the entropies involved (done explicitly in the YMA '18 paper), the average rate is at least when .
Conclude
Since the average rate and all are equal to the same , we have .
ex-cc-ch03-12
HardA non-trivial coded placement example (Gómez-Vilardebó–Tian 2018). For , (so ), construct a coded placement scheme that achieves strictly lower rate than the MAN memory-sharing scheme at this .
MAN memory-shared at achieves rate for some .
Try placing 3 linear combinations (one per user) of the 3 files.
The bits stored: , , (parity bits).
MAN memory-sharing rate
At : . At : . , so . Shared rate: .
Coded placement
Place user 1's cache: (half the bits of , plus one parity bit of ). Similar for users 2, 3.
Delivery
With careful construction (see GV-Tian '18 §V), the worst-case rate becomes , beating the MAN memory-shared rate by about 6%. This is the first known coded-placement improvement for .
Caveat
The construction does not generalize to arbitrary . It shows that some improvement is possible over MAN, but the YMA '18 result conjecturally still holds (with small constant-factor corrections) for the coded-placement regime. This is an area of active research.
ex-cc-ch03-13
ChallengeOpen problem formulation. State the exact converse question for coded placement as a combinatorial-optimization problem. Discuss why standard IT techniques (Fano, cut-set, averaging) have not closed the gap.
The difficulty: coded placement breaks the symmetry that the YMA averaging argument exploits.
The space of coded placements is exponentially larger than uncoded.
Formulation
Given , the optimal coded-placement rate is subject to and decodability constraints. The optimization is over all measurable functions , not just bit-wise placements.
Why standard techniques fail
- Cut-set bounds joint entropy of caches but does not exploit combinatorial structure of the delivery phase.
- Averaging arguments (YMA '18) rely on user-permutation symmetry, which holds for uncoded but not coded placements.
- LP bounds grow exponentially in ; no tractable tightening.
- Algebraic bounds (entropy region) yield trivial bounds matching cut-set. The missing ingredient is a new technique that captures the coded-placement gain without sacrificing the worst-case demand bound.
Research directions
Proposed approaches in the literature:
- Entropy-region computer search (Csiszár-Körner '11 style) — scales to .
- Non-Shannon inequality-aided bounds — gains up to 4% for small .
- Symmetric-coded-placement restrictions (Tian-Chen '18) — prove MAN optimal under symmetry.
- Construction-based attacks: find a scheme strictly better than MAN for general .
ex-cc-ch03-14
ChallengePrivacy-constrained converse (preview). In the demand-private coded caching of Chapter 12, each user must not learn the demands of other users. State the analog of the MAN rate formula under this constraint: can the same rate be achieved with privacy?
The CommIT demand-privacy result (Wan–Caire '21+) is that yes, the same rate is achievable.
Claim
Wan-Caire '21+ showed that demand-private coded caching achieves the same rate as non-private MAN, via an augmented scheme that uses shared randomness to mask demand-dependent quantities.
Open question
Whether privacy strictly reduces rate for general coded-placement schemes (beyond MAN) is open. For uncoded placement, privacy comes at zero rate cost.
Reference
See Ch 12 for the full treatment. The CommIT group (Wan, Sun, Ji, Tuninetti, Caire) produced the foundational results on private coded caching.
ex-cc-ch03-15
ChallengeNon-uniform demand: state of the art. For Zipf-distributed demands with exponent , the expected rate of any scheme is a function of . State the best known order-optimal bounds and identify the exact-rate open question.
Consult Ji-Tulino-Llorca-Caire 2017 for the order-optimal bounds.
The exact rate remains open for general .
Known bounds
For Zipf demand with exponent , the expected worst-case rate is . The upper and lower bounds differ by a constant factor depending on .
Open question
Exact characterization of is open. The Caire group's work (Ji-Tulino-Llorca-Caire 2017, Jin-Cui-Liu-Caire 2017, Zhang-Pedarsani-Ji 2015) gives order-optimal bounds but not exact results.
Significance
Non-uniform demand is the realistic case; solving this question would bridge the worst-case theory of this chapter with production CDN workloads. Among the most important open problems in coded-caching theory.