Placement Optimization for Zipf Demand

Beyond Random Placement

Under Zipf demand, random uniform placement is suboptimal: popular files should be cached more often than unpopular ones. But pure popularity caching has its own issue: no coded-multicast opportunity because everyone caches the same files. The sweet spot is a hybrid placement that combines popularity bias with sufficient randomness for MAN-style delivery.

This section derives the optimal placement parameters for Zipf demand and characterizes the resulting rate. The Ji-Tulino- Llorca-Caire result shows that the optimized placement is order-optimal for the expected-rate setting.

Theorem: Order-Optimal Rate for Zipf Demand

Under Zipf-α\alpha demand with α(0,1)\alpha \in (0, 1), the optimal expected rate for the shared-link coded caching problem satisfies Rexp(M,α)  =  Θ ⁣(K1(M/N)1α1+K(M/N)1α).R^*_\text{exp}(M, \alpha) \;=\; \Theta\!\left(K \cdot \frac{1 - (M/N)^{1-\alpha}}{1 + K (M/N)^{1-\alpha}}\right). That is, the Zipf concentration α\alpha shifts the effective memory ratio from μ\mu to μ1α\mu^{1-\alpha} — caching popular files buys more than its memory weight.

Under Zipf, the top fraction μ\mu of files by popularity covers μ1α\mu^{1-\alpha} of the probability mass. So an MM-file cache covers a μ1α\mu^{1-\alpha}-fraction of demand — more than μ\mu. The MAN-like formula K(1μeff)/(1+Kμeff)K(1 - \mu_{\text{eff}})/(1 + K\mu_{\text{eff}}) with effective μeff=μ1α\mu_{\text{eff}} = \mu^{1-\alpha} achieves the claimed rate.

Popularity-Weighted Random Placement

Complexity: Placement: O(KN)O(KN) coin flips. Total storage per user: approximately MM files with high probability (Hoeffding concentration).
Input: Library {W1,,WN}\{W_1, \ldots, W_N\} ordered by popularity;
Zipf exponent α\alpha; memory ratio μ\mu; users 1,,K1, \ldots, K.
Output: Caches Z1,,ZK\mathcal{Z}_1, \ldots, \mathcal{Z}_K.
1. Compute file caching probability: qn=min(1,μN1α/HN,αnα)q_n = \min(1, \mu \cdot N^{1-\alpha} / H_{N,\alpha} \cdot n^{-\alpha}).
2. for each user kk do
3. \quad for each file n[N]n \in [N] do
4. \quad\quad User kk caches file nn independently with probability qnq_n.
5. \quad end for
6. end for
7. return (Z1,,ZK)(\mathcal{Z}_1, \ldots, \mathcal{Z}_K).

This is the "soft popularity" placement: popular files cached by more users, but each user's cache is randomized. Combines the hit- rate benefit of popularity with the coded-multicast opportunity of MAN. Asymptotically order-optimal for Zipf-α\alpha demand.

Example: Zipf Placement Design

For α=0.8\alpha = 0.8, N=1000N = 1000, K=100K = 100, M=100M = 100, derive the optimal placement and expected rate.

Key Takeaway

Under Zipf-α\alpha demand, the effective memory ratio is μ1α\mu^{1-\alpha}, not μ\mu. Popular files amplify the caching gain. The MAN-like rate formula applies with this effective memory; for typical α0.8\alpha \approx 0.8, the effective caching gain tefft_\text{eff} can be 5-10× the naive value. This is the practical answer to why coded caching is highly effective for video/media CDNs.

Open Problem: Exact Non-Uniform Rate

The Ji-Tulino-Llorca-Caire result is order-optimal: it gives the correct scaling in K,N,M,αK, N, M, \alpha. But the exact constant-factor rate is not known. Specifically, for Zipf demand, the question of whether a particular scheme is exactly optimal (like YMA '18 is for uniform demand) remains open.

Progress:

  • Zhang-Pedarsani-Ji (2015): refined constants.
  • Jin-Cui-Liu-Caire (2017): closed gap for specific regimes.
  • Niesen-Maddah-Ali (2017): worst-case Zipf analysis.

A tight characterization for arbitrary Zipf-α\alpha remains a research topic. The CommIT group has published several refinements but the problem is not fully solved.

Historical Note: The Non-Uniform Demand Research Arc

2014–2020

The coded-caching community's evolving understanding of non-uniform demand:

  • 2014: Maddah-Ali-Niesen's MAN scheme assumes uniform demand. The "Θ(M/N)\Theta(M/N) story" is worst-case optimal.
  • 2015: Pedarsani-Maddah-Ali-Niesen — first analysis of MAN under Zipf demand. Shows popularity can hurt.
  • 2015-2016: Multiple extensions: Zhang-Pedarsani-Ji, Ji-Caire- Molisch (D2D). Order-optimal bounds.
  • 2017: Ji-Tulino-Llorca-Caire — comprehensive Zipf analysis. Order-optimal achievable scheme; exact rate open.
  • 2018: Jin-Cui-Liu-Caire — structural refinements. Heterogeneous caches (Sengupta-Tandon-Clancy).
  • 2020+: Privacy-aware non-uniform analysis (Wan-Caire).

The story has become progressively more realistic: from worst-case / uniform to average-case / Zipf to heterogeneous / privacy-constrained. Each step brings the theory closer to practical CDN design.