Heterogeneous Caches and General Popularity
Beyond Uniform Setups
The MAN analysis assumes homogeneous caches () and uniform demand (every file equally likely). Real networks violate both:
- Users have different cache sizes. Mobile phones: 32-512 GB. Tablets: more. Home routers: much more.
- Demand is skewed. Zipf- with is typical. A few files attract most demand.
The optimal coded-caching rate under heterogeneous and general popularity is unknown in closed form. Bounds exist but are not tight. This is the second open frontier.
Definition: Heterogeneous Coded Caching Problem
Heterogeneous Coded Caching Problem
The heterogeneous coded caching problem allows:
- Cache sizes (non-uniform).
- Demand distribution (non-uniform).
Objective: minimize expected delivery rate For uniform and equal , this reduces to the worst-case MAN rate.
The combined degree of generalization is what makes the problem hard. Either heterogeneous caches alone or Zipf demand alone is partially understood; together, they are open.
Theorem: Upper and Lower Bounds for Heterogeneous Caches
For heterogeneous coded caching with cache sizes , the worst-case expected rate satisfies where is the mean memory ratio, and are bounded by constants depending on the spread .
Tight characterization is open.
At the mean memory ratio, MAN-like results hold. Heterogeneity creates slack that the optimal scheme may or may not exploit — whether it does is precisely the open question.
Upper bound (achievable)
Apply MAN to the homogeneous system with cache each (upper-bound everyone's cache at ; underperform for large-cache users). .
Lower bound (converse)
Apply cut-set with user subsets weighted by . Achieves of the same order.
Gap
bounded by ; closes in the limit . Tight characterization of the midrange remains open.
Heterogeneous Caching Bounds
Upper and lower bounds for heterogeneous caches as varies ( fixed). Shaded region between bounds shows the current knowledge gap.
Parameters
Zipf Popularity: What's Known
For Zipf- demand with homogeneous caches:
- Expected rate approach (Niesen-Maddah-Ali, Ji-Tulino-Llorca- Caire): Popularity-weighted placement gives order-optimal expected rate. Gap to lower bound: within constant factor.
- Worst-case approach: The MAN rate applies unchanged; popularity-aware schemes improve the expected rate but not the worst-case.
Neither approach is tight for general Zipf-. The optimal expected rate is known only up to constant factors.
Example: Heterogeneous Caches in a Mixed Device Network
A home network: 1 smart TV (cache GB), 3 phones ( GB). Total library 1 TB. What's the expected rate under uniform demand?
Parameters
GB. . ; .
Upper bound
MAN rate at : files/use.
Lower bound
Cut-set: . Loose.
Gap
Unknown tight rate in the range . Likely closer to MAN for small .
Practical
TV and phones jointly cache; coded MAN-style delivery across all four works but under-uses the TV's cache. Open: scheme that fully uses TV's excess capacity.
Memory Sharing: A Partial Solution
A clever partial solution exists: memory sharing. Treat each user's cache as a weighted sum of "virtual" caches of standard sizes. Apply MAN to each virtual system; deliver on the union.
For users with varying widely: memory sharing achieves something like the average rate, within a small constant factor. Not tight but useful.
This is why production systems treat heterogeneous caches practically: by aggregating into tiers (phone-tier, tablet-tier, TV-tier) each homogeneous within.
Heterogeneous Coded Caching with Shared Caches
The CommIT contribution to heterogeneous coded caching:
- Framework for heterogeneous cache sizes. Formalizes the per-user cache size variation and shared-cache structures.
- Achievable scheme. Memory-sharing-based scheme with provable rate close to the (loose) cut-set lower bound.
- Practical relevance. Addresses real-world heterogeneous deployments (mobile + tablet + TV) where MAN's uniform assumption fails.
- Gap analysis. Quantifies the distance from tight characterization; identifies the research frontier.
This paper is part of the CommIT group's ongoing work to close the heterogeneous-cache gap. Its results complement the Zhang-Moharrami-Caire multi-rate framework (Ch 21) — both address practical generalizations of MAN.
Common Mistake: Don't Assume Mean Memory Is Enough
Mistake:
Applying MAN with and ignoring the variance in .
Correction:
For highly heterogeneous caches (e.g., 1 TB vs 1 GB), the mean tells only part of the story. The large-cache user may be severely underserved by a mean- placement; conversely, small-cache users may be served suboptimally.
Memory sharing + per-tier MAN approximates the optimum but does not close the gap. Use with awareness: may be substantial for high variance.
Key Takeaway
Heterogeneous caches and general popularity remain open. Known bounds give constant-factor characterization; tight optimal schemes are unknown. CommIT Ibrahim-Caire 2024 is the latest contribution; practical deployments use memory sharing as approximation. The heterogeneous case is the most significant open problem after coded placement.