Additivity of Caching and Spatial Gains (DoF=t+L\mathrm{DoF} = t + L)

Why Additive, Not Multiplicative?

Before stating the DoF theorem formally, let us build intuition. The caching gain tt says: one transmission can carry information for t+1t+1 users via XOR cancellation using their cached side information. The spatial gain LL says: one transmission can carry independent information to LL users via orthogonal beamforming.

Now ask: why can't these mechanisms compose multiplicatively? The answer is that they share the same "one transmission." The caching mechanism lets a single beam serve t+1t+1 users with a single message; the spatial mechanism lets LL independent beams coexist. The cache-aided scheme uses LL beams, each carrying a coded-caching XOR message. Each beam delivers to t+1t+1 users (via caching) β€” but the total number of users served is L(t+1)βˆ’Lt=L+tL(t+1) - L t = L + t after careful double-counting of who helps whom. The gains add.

This is not a hand-waving argument; it is forced by the DoF upper bound on any BC with side information. We state it rigorously now.

Theorem: Lampiris-Caire DoF Theorem

For the cache-aided Gaussian MIMO BC with KK single-antenna users, an LL-antenna transmitter, library size NN, per-user cache MM such that t=KM/N∈{0,1,…,K}t = K M/N \in \{0, 1, \ldots, K\} integer:

β€…β€ŠDoF(M)β€…β€Š=β€…β€Šmin⁑(t+L,β€…β€ŠK)β€…β€Š.\boxed{\;\mathrm{DoF}(M) \;=\; \min(t + L,\; K)\;}.

This DoF is achieved by the Lampiris-Caire scheme (Β§5.3) and matches the information-theoretic upper bound for this class of channels.

Additive decomposition: the caching gain tt contributes tt extra DoF on top of the classical MIMO BC DoF of LL, up to the saturation limit KK. The upper bound KK is trivial (cannot exceed the number of users).

πŸŽ“CommIT Contribution(2017)

Multi-Antenna Coded Caching β€” DoF = t+Lt + L

E. Lampiris, P. Elia, G. Caire β€” IEEE International Symposium on Information Theory

Lampiris, Elia, and Caire established that the DoF of the cache-aided MIMO BC is DoF=t+L\mathrm{DoF} = t + L, where t=KM/Nt = K M/N is the caching gain and L=LL = L is the transmit antenna count. The proof treats the delivery phase as transmission of general message sets to the users, exploiting both the cached side information and the spatial multiplexing degrees of freedom.

Key insight. Caching provides a "virtual antenna" effect: the tt cached bits at each user effectively act as if the user had tt additional antennas. Combined with the LL actual transmit antennas, the sum DoF is t+Lt + L. Crucially, this is not a multiplicative tLtL gain β€” the two mechanisms are decoupled.

The scheme is deployable at moderate KK and has been followed by extensive CommIT follow-up work (Lampiris-Elia-Caire 2018 on no-CSIT regimes, Lampiris-Caire 2019 on low-complexity schemes, Lampiris- Bhattacharjee-Caire 2020 on fading). The 2017 paper is the foundational "general message set" formulation.

coded-cachingmimocommitdofView Paper β†’

DoF Decomposition vs Number of Antennas

At fixed memory ratio ΞΌ\mu and number of users KK, plot DoF vs number of antennas LL. Three curves: (1) blue: total DoF = t+Lt + L (combined); (2) red dashed: pure MIMO (no caching) = LL; (3) green dotted: pure caching (L = 1) = t+1t + 1. The gap between the blue curve and the max(red, green) is the "additional" gain of combining caching with MIMO β€” and it equals min⁑(t,L)\min(t, L) in the additive regime.

Parameters
20
0.2
16

DoF Additivity: DoF=t+L\mathrm{DoF} = t + L

Three curves in the (M/N, DoF) plane: pure MIMO (horizontal, DoF = LL), pure caching (diagonal from (0, 1) to (1, K), DoF = t+1t+1), and combined (diagonal shifted up by LL, DoF = t+Lt+L). The combined curve is additive β€” the caching gain tt sits on top of the spatial gain LL.

Why the Converse Works

The DoF upper bound DoF≀t+L\mathrm{DoF} \leq t + L is not obvious β€” the naive argument says either tt (caching) or LL (MIMO) alone bounds DoF, not their sum. The tight converse uses a refined cut-set argument that accounts for both the cache budget and the antenna count simultaneously.

Concretely: consider ss users. Over TT channel uses, the transmitter sends TLT L DoF's worth of information (L DoF per channel use). Plus the caches contribute sM/Fs M / F file-units of side information. Together they must determine ss files. Hence TL+sM/Fβ‰₯sT L + s M/F \geq s, i.e., T/sβ‰₯(1βˆ’M/F)/LT/s \geq (1 - M/F)/L. Optimizing ss and applying uncoded-placement optimality yields DoF≀t+L\mathrm{DoF} \leq t + L.

The full proof (Lampiris-Caire '17 Β§IV) uses an averaging argument over symmetric demand vectors, similar to the YMA '18 style but extended to the multi-antenna setting.

Historical Note: The Emergence of Multi-Antenna Coded Caching

2015–2019

The multi-antenna extension of coded caching was explored independently by several groups in 2015–2017. Shariatpanahi, Caire, Khalaj (2016) gave the first scheme for 2-antenna systems; Naderializadeh, Maddah-Ali, Avestimehr (2017) treated the general MIMO BC with interference alignment. The clean result DoF=t+L\mathrm{DoF} = t + L and its tight converse were established by Lampiris, Elia, Caire (2017) via the general-message-set formulation.

The story illustrates a recurring theme in coded caching research: the combinatorial placement of MAN is the right primitive on which to build wireless extensions. Schemes that depart from this placement (e.g., random Gaussian coded placement) typically achieve the same DoF but with substantially more complex decoders.

Common Mistake: Don't Say the Gains 'Multiply'

Mistake:

Claiming that the caching gain (1+t)(1 + t) multiplies the spatial multiplexing gain LL to give (1+t)L(1+t) L.

Correction:

The correct formula is DoF=t+L\mathrm{DoF} = t + L, additive. The intuition for why it is not multiplicative: each channel use can carry LL independent streams (spatial DoF). Each stream, through cached side information, carries useful bits for t+1t+1 users (caching gain). Net users served per channel use: Lβ‹…(t+1)βˆ’Lt=L+tL \cdot (t+1) - L t = L + t, where the LtLt correction accounts for the overlap in cached bits across streams. This is the Lampiris-Caire calculation in one line.

A multiplicative gain L(t+1)L(t+1) would require an unrealistic combinatorial structure β€” there would need to be L(t+1)L(t+1) distinct "modes" of user discrimination, beyond what either mechanism alone can provide.