Ferkans — Interactive Telecom Tutor

Why QoE, Not Just Rate?

The MAN rate is a network-side metric: files per channel use. Users care about a different metric: Quality of Experience (QoE). QoE captures:

Startup latency. Time from play-click to first frame.
Rebuffering. Frequency and duration of pauses.
Average bitrate. Perceived video resolution.
Bitrate switching. Frequency and magnitude of quality jumps.

Production streaming optimizes for QoE; so should QoE-aware coded caching. This section develops the QoE-caching coupling, connecting the information-theoretic $1 + K\mu$ gain to user-perceived quality.

Definition:
Standard QoE Model

A canonical QoE metric (Yin-Jindal 2015): $\text{QoE} \;=\; \sum_{t=1}^T q^{(t)} - \lambda_s \sum_{t=1}^{T-1} |q^{(t+1)} - q^{(t)}| - \lambda_r \cdot T_{\text{rebuf}} - \lambda_d \cdot T_\text{startup},$ where:

$q^{(t)}$ : average bitrate watched at time $t$ (higher is better).
$\lambda_s$ : smoothness penalty (bitrate switching is jarring).
$\lambda_r$ : rebuffering penalty (dominant concern).
$T_\text{rebuf}$ : total rebuffering duration.
$T_\text{startup}$ : startup delay.

The coefficients $\lambda_{s,r,d}$ are calibrated to user studies: typically $\lambda_r \approx 1$ per second of rebuffering (very strong penalty); $\lambda_s, \lambda_d \approx 0.1$ .

QoE is highly sensitive to rebuffering: users tolerate lower bitrates but hate stalls. This drives caching design toward aggressive pre-fetch and chunk availability.

QoE vs Cache Size

Composite QoE (bitrate, startup, rebuffering) vs cache size. Coded caching boosts QoE via reduced miss rate, fewer rebuffers, and faster startup.

Parameters

Users K20

Files N1000

Definition:
Chunked Video Delivery

In chunked delivery (standard in HLS, DASH), each video is divided into chunks (typically 2-10 seconds each). Chunks are the units of caching and delivery. For a video of duration $T_v$ seconds with $n_c = T_v / \tau_c$ chunks (chunk length $\tau_c$ ):

Each chunk encoded at $L$ quality levels.
Client fetches chunks sequentially.
Server can deliver any subset of chunks in any order.

Coded chunk caching: Apply MAN at chunk granularity. Each cache stores a subset of chunks; coded XOR delivery over $(t+1)$ -subsets of users requesting different chunks.

Chunked structure refines the MAN analysis: instead of file-granular subfiles, we work with chunk-granular placement. Subpacketization benefits from chunk structure: typical video chunks are large enough ( $\sim 5$ MB) to support practical combinatorial splits.

Theorem: Chunk-Level Coded Caching Rate

For chunk-level coded caching with $n_c$ chunks per video, per-user cache $M$ (in chunks), and uniform chunk demand, the achievable delivery rate is $R \;=\; \frac{K(1 - \mu)}{1 + K\mu} \cdot \text{chunks per second}.$ Same as MAN, applied at chunk granularity.

Chunks play the role of "files" in the basic MAN analysis. The rate formula is unchanged. Advantage: chunks are smaller, subpacketization is more practical.

Proof

Reformulation

Treat each chunk as a file. New MAN instance: $K$ users, $N' = n_c N$ "files" (chunks), $M' = M$ chunks per user.

Apply MAN

Standard MAN rate formula: $R = K(1 - M'/N')/(1 + KM'/N')$ . Since $M'/N' = \mu$ (same ratio), rate formula unchanged.

Per-second interpretation

Rate measured per chunk. At $1/\tau_c$ chunks per second: total rate scales. $\blacksquare$

Example: YouTube-Scale Chunk Caching

YouTube-style streaming: $K = 1000$ concurrent viewers in a geographic region, $N = 10^7$ videos, each 300 chunks of 5 seconds. Per-user cache 100 chunks. Analyze delivery rate and QoE.

Solution

Parameters

$n_c N = 3 \times 10^9$ chunks. $M/N' = 100 / (3 \times 10^9) = 3.3 \times 10^{-8}$ — tiny.

Effective MAN gain

$K\mu = 1000 \cdot 3.3 \times 10^{-8} = 3.3 \times 10^{-5}$ . Almost no MAN gain at this scale.

Why? Library too big

$N'$ is huge vs cache. Solution: focus on popular videos (Zipf structure) rather than full library.

Popularity-aware

Cache the top-100 videos' chunks. New effective $N' \approx 100 \cdot 300 = 30000$ chunks. $K\mu \approx 3.3$ . Strong MAN gain on popular content.

QoE impact

Hit rate on popular videos $\to 90\%$ . QoE: startup latency drops from 500ms to 50ms; rebuffering rare; smooth playback.

,

🔧Engineering Note

Production QoE Optimization

How production services optimize QoE:

Chunk prefetch. Client fetches 2-3 chunks ahead; buffer smooths over network variation. Coded caching reduces per- chunk server load, easing prefetch.
Adaptive bitrate switching. Client selects quality based on network estimation. Caching makes bitrate upgrades cheap (base layer already cached).
Regional CDN tiers. Origin → regional PoP → ISP cache → device. Chunks propagate down on first miss. Coded caching can be applied at each tier.
ML-based popularity prediction. YouTube/Netflix use ML to forecast which videos to cache where. Combines with coded caching opportunistically.

Status: Coded caching at chunk granularity is research-stage; production CDNs use LRU/LFU on chunks. Integration of coded layer is a 3-5 year practical roadmap.

Practical Constraints

•
YouTube: ML prediction + LRU chunks
•
Netflix Open Connect: static predictive + LRU
•
Akamai: regional LRU + admission control
•
Coded chunk caching: research stage

,

Common Mistake: Rate Optimization ≠ QoE Optimization

Mistake:

Assuming minimizing $R$ automatically maximizes QoE.

Correction:

QoE has multiple components:

Bitrate $\sim (1 - \text{rate penalty})$ . Lower rate → higher bitrate.
Rebuffering $\sim$ variance of delivery. Lower rate doesn't help if delivery is bursty.
Startup $\sim$ first-chunk latency. Depends on cache hit, not delivery rate.

A caching scheme that reduces average rate but causes rebuffering spikes can reduce QoE. Practical schemes optimize expected bitrate - $\lambda_r \cdot$ rebuffering variance — not just average rate.

Key Takeaway

Video caching targets QoE, not just rate. QoE composites bitrate, startup, rebuffering, smoothness. Coded caching helps all four via reduced miss rate and multicast efficiency. Chunk-level coded caching is a practical refinement of MAN, suitable for integration with DASH. Production integration is a 3-5 year roadmap.

Quality-of-Experience and Chunked Delivery

Why QoE, Not Just Rate?

Definition: Standard QoE Model

QoE vs Cache Size

Parameters

Definition: Chunked Video Delivery

Theorem: Chunk-Level Coded Caching Rate

Reformulation

Apply MAN

Per-second interpretation

Example: YouTube-Scale Chunk Caching

Parameters

Effective MAN gain

Why? Library too big

Popularity-aware

QoE impact

Production QoE Optimization

Common Mistake: Rate Optimization ≠ QoE Optimization

Key Takeaway

Definition:
Standard QoE Model

Definition:
Chunked Video Delivery