The Cache-Fronthaul Tradeoff
Two Resources, One Latency
The NDT formula reveals a fundamental substitution: cache and fronthaul capacity can be traded off to achieve the same delivery time. An operator can choose a cache-heavy architecture (cheap cache at each EN, modest fronthaul) or a cloud-heavy one (lean ENs, fat fronthaul). Both can deliver the same NDT.
The Pareto frontier — the set of pairs achieving a given NDT target — defines the architectural design space. This section characterizes this frontier and relates it to deployment costs.
Theorem: Iso-NDT Contours
For fixed and target NDT , the set of pairs achieving is The iso-NDT contours in the plane are hyperbolae: smooth tradeoff between cache and fronthaul at any given latency target.
Doubling the cache fraction halves the "1 - " factor, doubling the available slack. Doubling fronthaul halves the fronthaul contribution. Either works to meet the NDT budget.
Equation
Setting and solving: . Rearranging: .
Feasibility
Both and must be non-negative; ; . These bound the iso-NDT contour, yielding the feasible region.
Shape
In the plane, the contour is a hyperbola. Cache- cheap regime: large , small . Fronthaul-cheap regime: large , small .
NDT vs Memory Ratio
NDT as a function of memory ratio , for varying fronthaul capacities . At high , even small cache is enough to hit . At low , need large cache to approach NDT = 1. This is the "vertical" view of the cache-fronthaul tradeoff: hold fixed, vary cache.
Parameters
NDT Surface
3D surface plot of the NDT as a function of both cache ratio and fronthaul capacity. The surface is monotonically non-increasing in both variables; iso-NDT curves are the level sets. The Pareto frontier for any target is an iso-NDT contour on this surface.
Parameters
NDT Cache-Fronthaul Tradeoff Curves
Example: Architectural Choice: Cloud-Heavy vs Cache-Heavy
A 5G deployment has , , . Target NDT . Two architectures: (A) Cloud-heavy: , find minimum . (B) Cache-heavy: , find minimum .
(A) Cloud-heavy
. . . : infeasible!
NDT = 2 is too aggressive for ; the downlink alone contributes 2.375 already. Relax target: for , , files/use. That works.
(B) Cache-heavy
: . , so . Need 92% cache fraction — cache the entire library almost.
Comparison
Neither extreme meets NDT ≤ 2 comfortably. A balanced point — say , — gives , still above 2. The "sweet spot" depends on deployment constraints and cost ratios.
Cost consideration
Cache at RU costs ~500/Gbps/month. The economical choice depends on the content-to-bandwidth ratio. Video-heavy (large libraries) favors cache; general traffic favors fronthaul.
Definition: Marginal Substitution Rate
Marginal Substitution Rate
At a given operating point on an iso-NDT contour, the marginal substitution rate measures how much fronthaul can be saved by adding one unit of cache. From the NDT formula: This rate is steep near the cache-limited corner (small , need lots of cache growth for small reduction) and shallow near the fronthaul-limited corner.
In deployment economics: buy cache if , where are unit costs. This determines the optimal architecture along the iso-NDT curve.
Pareto Frontier in Operator Terms
The NDT iso-contour is the Pareto frontier for architectural decisions: any point on the contour is Pareto-optimal (no other point dominates it in both and ). Deployment decisions slide along this frontier based on:
- Cost structure. Cache hardware (storage + controllers) vs. fronthaul bandwidth lease/installation.
- Content catalog size. Larger library needs more aggregate cache; fixed scales with .
- Traffic pattern. Cacheable (video) benefits from cache; mixed traffic has JLEC separation concerns (Ch 6).
- Access latency. Cache at RU gives ms access; cloud fronthaul adds ms.
A typical 5G deployment sits in the "balanced" region of the Pareto frontier, with moderate cache () and moderate fronthaul ( files/use at high SNR).
Common Mistake: NDT Is Not a Rate
Mistake:
Confusing NDT with a throughput or capacity value.
Correction:
NDT is a dimensionless latency ratio. means "delivery takes twice as long as the baseline MU-MIMO." It does not mean "rate is half" — the relation to rate involves the SNR and the reference baseline. For rate conversions: at high SNR.
NDT = 1 corresponds to per-user rate = — the asymptotic single-user capacity.