The Cache-Aided MIMO Broadcast Channel

From Error-Free Link to Wireless Reality

Chapters 1–4 worked with an error-free, infinite-capacity shared link. In wireless systems the shared link is replaced by a multi-antenna broadcast channel, with a transmitter equipped with LL antennas and KK single-antenna users. The link is noisy, bandwidth-limited, and has frequency-selective behavior.

Here is the question this chapter answers: how does the coded caching gain combine with the multi-antenna spatial multiplexing gain? A first-guess answer might be multiplicative — after all, both gains compress traffic. But the striking result of Lampiris, Caire, et al. (2017+) is that they are additive: the degrees of freedom satisfy DoF=t+L\mathrm{DoF} = t + L, with t=KM/Nt = K M/N. The two gains come from fundamentally different mechanisms and should not be expected to compound.

Definition:

Cache-Aided Gaussian MIMO Broadcast Channel

The cache-aided MIMO broadcast channel consists of:

  • A single transmitter with LL antennas, holding a library W={W1,,WN}\mathcal{W} = \{W_1, \ldots, W_{N}\}.
  • KK single-antenna users, each equipped with a cache of size MFM \cdot F bits.
  • User kk has channel vector hkCL\mathbf{h}_k \in \mathbb{C}^L; the observation per channel use is yk  =  hkHx+wk,wkCN(0,σ2),y_k \;=\; \mathbf{h}_k^H \mathbf{x} + \mathbf{w}_{k}, \quad \mathbf{w}_{k} \sim \mathcal{CN}(0, \sigma^2), with transmit power constraint E[x2]P\mathbb{E}[\|\mathbf{x}\|^2] \leq P. Define SNR=P/σ2\text{SNR} = P / \sigma^2.

The system operates in two phases: (i) placement — populate each user's cache off-peak; (ii) delivery — on demand vector d[N]K\mathbf{d} \in [N]^K, the transmitter emits signals x[1],,x[T]\mathbf{x}[1], \ldots, \mathbf{x}[T] over TT channel uses so each user can decode its requested file.

The CSIT assumption (channel state information at the transmitter) is non-trivial: in this chapter we assume perfect CSIT for the derivation of the DoF\mathrm{DoF} result. Chapter 7 extends to fading channels with limited CSIT.

Definition:

Delivery Rate and Degrees of Freedom

An (M,R)(M, R) scheme delivers KFK F bits of demand (one file per user) in TT channel uses, with R  =  KFT(σ2+1)log2(1+SNR)   bits per channel use per user.R \;=\; \frac{K F}{T \cdot (\sigma^2 + 1) \cdot \log_2(1 + \text{SNR})} \;\text{ bits per channel use per user}.

The sum degrees of freedom is DoF(M)    limSNRKR(SNR,M)log2SNR,\mathrm{DoF}(M) \;\triangleq\; \lim_{\text{SNR} \to \infty} \frac{K R(\text{SNR}, M)}{\log_2 \text{SNR}}, capturing the high-SNR slope of the sum-rate. In DoF terms, each user's file is a 1-DoF object (full MUD would satisfy KK users with DoF=K\mathrm{DoF} = K — but of course the shared-link constraint limits this via cache size).

Recap: The Non-Cached MIMO BC

Without caches, the Gaussian MIMO BC with LL transmit antennas and KK single-antenna users achieves DoFBC  =  min(L,K)\mathrm{DoF}_{\text{BC}} \;=\; \min(L, K) via zero-forcing beamforming (or dirty-paper coding for strict capacity). With L=1L = 1, DoFBC=1\mathrm{DoF}_{\text{BC}} = 1 (pure multicast or time-shared unicast). With LKL \geq K, the transmitter can serve all users in parallel.

Coded caching adds a new DoF resource: the aggregate cache size KM/N=tK M/N = t. Lampiris-Caire's theorem says these two resources stack additively.

Cache-aided MIMO BC topology

Cache-aided MIMO BC topology
Transmitter with LL antennas; KK single-antenna users each with cache Zk\mathcal{Z}_k. Delivery phase sends a single codeword x[t]\mathbf{x}[t] per channel use; each user receives a noisy projection yk=hkHx+wky_k = \mathbf{h}_k^H \mathbf{x} + w_k.

DoF vs Memory Ratio for Various LL

The cache-aided MIMO DoF as a function of μ=M/N\mu = M/N, for varying number of antennas LL. Solid blue is the chosen LL; dashed red is the single-antenna MAN baseline (L=1)(L = 1); dotted green is full cooperation (L=K)(L = K). Observe the additive structure: at M/N=0M/N = 0, DoF=L\mathrm{DoF} = L (pure MIMO); at M/N=1M/N = 1, DoF=K\mathrm{DoF} = K trivially; and in between DoF=t+L\mathrm{DoF} = t + L grows linearly in both the caching gain and the antenna count.

Parameters
20
4

Example: DoF Computation: K=10K = 10, N=10N = 10, M=2M = 2, L=3L = 3

Compute the DoF of the cache-aided MIMO BC for the parameters above. Decompose the gain into caching and spatial contributions.

Key Takeaway

Caching gain t=KM/Nt = KM/N and spatial multiplexing gain LL are ADDITIVE, not multiplicative. DoF=t+LK\mathrm{DoF} = t + L \leq K. Doubling antennas or doubling caches each add to DoF independently; they do not compound into a multiplicative tLtL gain. This asymmetry is the central message of the multi-antenna coded caching program.

🚨Critical Engineering Note

CSIT: The Achilles' Heel

The DoF result assumes perfect channel state information at the transmitter (CSIT). In practice, CSIT is obtained through pilot transmissions, uplink feedback, or reciprocity in TDD systems. Imperfect CSIT degrades the DoF substantially:

  1. No CSIT (LL effective = 1): DoF\mathrm{DoF} collapses to t+1t + 1 — caching gain survives, spatial multiplexing gain is lost entirely.
  2. Partial CSIT (estimation variance σe2\sigma_e^2): DoF=t+L(1σe2/σ2)\mathrm{DoF} = t + L(1 - \sigma_e^2/\sigma^2) approximately. The spatial contribution scales linearly with CSIT quality.
  3. Overhead cost: acquiring CSIT costs pilots (fraction τ/Tcoh\tau/T_{\text{coh}} of channel uses), reducing effective rate.

The CommIT group has published extensively on low-CSIT coded caching (e.g., Lampiris-Caire 2018 on no-CSIT regimes). A key result: coded caching partially substitutes for CSIT — the pre-stored cache is effectively "CSI-independent" signaling.

Practical Constraints
  • 5G NR requires CSIT acquisition every ~1 ms at sub-6 GHz

  • mmWave CSIT is more expensive (narrow beams, blockage)

  • Perfect-CSIT assumption simplifies DoF but overstates practical gain

  • FDD systems have higher CSIT cost than TDD (no reciprocity)