Successive Refinement + Coded Caching
Why Successive Refinement Matters for Caching
Successive refinement (SR) encodes video as a base layer plus enhancement layers. Each layer is individually decodable; layers combine for higher quality. SR matters for caching for two reasons:
- Storage efficiency. Cache can store base layer of many videos; enhance on demand from server. Tradeoff: more video coverage vs per-video quality.
- Delivery efficiency. When users request the same video at different qualities, a base layer multicast serves all; only differential enhancements need per-user delivery.
The joint problem of SR + coded caching yields richer achievable regions than either alone. This section develops the formal theory.
Definition: Successive Refinement Placement
Successive Refinement Placement
SR placement strategy allocates cache fraction to layer , with . Each layer uses MAN placement with effective memory .
Design choice: how to split cache across layers. Options:
- Equal split: . Uniform quality across videos.
- Base-heavy: Most cache for base layer. Wide coverage, low quality.
- Top-heavy: Most cache for top layer. Narrow coverage, high quality.
- Optimal: Driven by demand profile and QoE metric.
The split interacts with demand: if most users request HD, store more HD-layer content. The optimization is a convex program over once the demand distribution is known.
Theorem: Optimal Cache Allocation Across SR Layers
For multi-rate coded caching with layer bitrates and demand profile , the optimal cache split maximizing expected QoE (under a concave QoE function of quality) is The split is weighted by popularity and bitrate.
Layers with higher popularity and higher bitrate deserve more cache. The formula is effectively "cache in proportion to expected delivery demand at each layer."
Setup
Maximize subject to . QoE at level proportional to MAN gain at , which depends on .
Lagrangian
Apply Lagrange with multiplier on the budget. KKT conditions yield: .
Solution
, normalized to sum 1.
Why Convexity Matters for QoE
The QoE function is concave in quality: a unit of quality at low resolution (SD β HD) is worth more than a unit at high (4K β 8K). This concavity means the optimization above is a standard convex program β tractable and uniquely solvable.
Convexity has a design implication: the optimal split favors diminishing returns. Cache everyone's base layer (big gain) before boosting individual users' top layer (small gain). This matches the fairness intuition: a 50% QoE improvement for many users dominates a 5% improvement for a few.
Chunk-Level Placement Delivery Rate
Delivery rate vs fraction of video chunks cached per user. Coded and uncoded compared. Coded achieves MAN gain even at chunk granularity.
Parameters
Example: SR Cache Allocation for Netflix-Like Service
A streaming service has quality levels. Observed demand: 20% SD (1 Mbps), 50% HD (3 Mbps), 30% 4K (10 Mbps). Per-user cache: Mbps-seconds. Determine optimal cache split.
Weights $p_\ell b_\ell$
SD: . HD: . 4K: . Total: .
Optimal split
, , .
Cache allocation
: SD Mbps-sec; HD ; 4K .
Operational meaning
Most cache devoted to 4K content (heavy bitrate). Only small allocation for SD. Matches intuition: 4K requests are scarce per-user but heavy in bits.
Multi-Rate Coded Caching for Successively-Refinable Content
The CommIT group's key contribution to video coded caching:
- Multi-rate framework. Extends MAN to successively- refinable content with quality layers.
- Achievable rate. per demand profile β preserving the coded multicasting gain at each quality level.
- Optimal cache split. Closed-form solution proportional to weights.
- QoE optimization. Integrates rate-distortion with user-facing QoE metrics.
This work bridges the rate-distortion / source-coding perspective (classical Gaussian successive refinement) with the multi-user multicasting perspective (MAN). It is the canonical reference for video coded caching in the post-2020 literature.
Common Mistake: Successive Refinement Is Not Universal
Mistake:
Applying SR analysis to any video encoder.
Correction:
Classical SR (Equitz-Cover) holds for Gaussian and some other sources. Real video encoders (H.264, HEVC) are only approximately successively refinable β there's a penalty from inter-layer prediction gaps.
In practice: SVC (H.264) incurs overhead vs single-layer encoding; HEVC SHVC . These are small but not zero. Analyses assuming exact SR should be interpreted as upper bounds.
Key Takeaway
Successive refinement + coded caching achieves rate. Optimal cache allocation across layers is . The convex optimization integrates rate-distortion and multicasting. CommIT Zhang- Moharrami-Caire 2020 is the foundational reference.