Graph-Theoretic Formulations

Why a Graph?

The key to making index coding tractable is to reframe it as a graph problem. Two receivers are "compatible" β€” can be served by a single transmission β€” when their demands are covered by each other's side information in a sense we will make precise. Non-compatibility is an edge; compatibility is an anti-edge. The resulting graph encodes the combinatorial structure of the instance.

Colorings, cliques, and independent sets of this graph directly give us upper and lower bounds on the broadcast rate. For the MAN family, the bounds coincide and yield the rate formula of Chapter 2.

Definition:

The Conflict Graph of an Index Coding Instance

Given an index coding instance I\mathcal{I} with single-demand receivers, the conflict graph G(I)=(V,E)G(\mathcal{I}) = (V, E) is defined by:

  • V={1,2,…,K}V = \{1, 2, \ldots, K\} β€” one vertex per receiver.
  • {i,j}∈E\{i, j\} \in E (an edge) iff diβˆ‰Sjd_i \notin \mathcal{S}_j and djβˆ‰Sid_j \notin \mathcal{S}_i. That is, receivers ii and jj conflict when neither has the other's desired message as side information.

Equivalently, {i,j}\{i, j\} is a non-edge (anti-edge) iff di∈Sjd_i \in \mathcal{S}_j or dj∈Sid_j \in \mathcal{S}_i β€” at least one of them already knows the other's demand.

Some references use the side-information graph (directed) instead: directed edge iβ†’ji \to j iff dj∈Sid_j \in \mathcal{S}_i. The undirected conflict graph here is the "symmetric complement" and is what governs the clique-cover upper bound.

Theorem: Chromatic Number Upper-Bounds Index Coding Rate

For any single-demand index coding instance with conflict graph GG, the optimal broadcast rate is upper-bounded by the fractional chromatic number: Ξ²(G)β€…β€Šβ‰€β€…β€ŠΟ‡f(G)β€…β€Šβ‰€β€…β€ŠΟ‡(G).\beta(G) \;\leq\; \chi_f(G) \;\leq\; \chi(G).

Partition the receivers into independent sets (color classes) of GG. Each color class is a group of receivers whose demands are pairwise satisfiable by a single XOR-coded transmission β€” because within an independent set, every pair of receivers has the desired message of the other in its side information.

Conflict Graph of a MAN Instance

Visualize the MAN conflict graph for small KK and tt. Each node is a (user, missing-subfile) pair; edges connect node pairs that cannot share a single delivery message. The number of independent sets needed to cover the graph equals the index-coding rate.

Parameters
4
2

Definition:

Broadcast Rate and Minrank

The broadcast rate Ξ²(I)\beta(\mathcal{I}) of an index coding instance is the optimal rate over all (possibly non-linear) codes. The linear broadcast rate Ξ»(I)\lambda(\mathcal{I}) restricts to linear codes over a fixed finite field. A foundational result (Bar-Yossef et al. 2011) shows Ξ»(I)β€…β€Š=β€…β€ŠminrkF(G(I)),\lambda(\mathcal{I}) \;=\; \text{minrk}_{\mathbb{F}}(G(\mathcal{I})), where minrk is the minimum rank of a matrix MM with Mii=1M_{ii} = 1 and Mij=0M_{ij} = 0 whenever {i,j}βˆ‰E(G)\{i, j\} \notin E(G), free otherwise.

In general, β≀λ\beta \leq \lambda, and the gap can be nonzero β€” there are index coding instances where non-linear codes strictly outperform linear codes.

Theorem: Independence Number Lower-Bounds Index Coding Rate

For any single-demand index coding instance with conflict graph GG, Ξ²(G)β€…β€Šβ‰₯β€…β€ŠΞ±(G),\beta(G) \;\geq\; \alpha(G), where Ξ±(G)\alpha(G) is the independence number of GG.

Let II be a maximum independent set of GG. Any two receivers in II are non-conflicting β€” their demands can in principle share transmissions. But crucially, the information delivered to each receiver in II still has entropy FF, so the broadcast must contain at least ∣Iβˆ£β‹…F|I| \cdot F distinct bits worth of information from the server's side. Rate β‰₯∣I∣=Ξ±(G)\geq |I| = \alpha(G).

Key Takeaway

The broadcast rate is sandwiched between Ξ±(G)\alpha(G) and Ο‡f(G)\chi_f(G): Ξ±(G)≀β(G)≀χf(G)≀χ(G).\alpha(G) \leq \beta(G) \leq \chi_f(G) \leq \chi(G). For MAN conflict graphs, the sandwich closes: Ξ±=Ξ²=Ο‡f=(Kβˆ’t)/(t+1)\alpha = \beta = \chi_f = (K-t)/(t+1). For general index coding, both the upper and lower bounds can be loose, and the gap is what makes the problem NP-hard.

Example: An Index Coding Instance with Ο‡>Ξ²\chi > \beta

Consider 4 receivers with the following side information: S1={2,4}\mathcal{S}_1 = \{2, 4\}, S2={1,3}\mathcal{S}_2 = \{1, 3\}, S3={2,4}\mathcal{S}_3 = \{2, 4\}, S4={1,3}\mathcal{S}_4 = \{1, 3\}. Each wants file WdkW_{d_k} with dk=kd_k = k. Compute the conflict graph, its chromatic number, and the optimal broadcast rate.

Common Mistake: The Chromatic Number Is Not Always Tight

Mistake:

Assuming the index coding rate always equals the chromatic number.

Correction:

Ο‡(G)\chi(G) is an upper bound on Ξ²(G)\beta(G); it is not always tight. Lubetzky and Stav (2009) constructed index coding instances with Ξ²=O(1)\beta = O(1) but Ο‡=Ξ©(log⁑n)\chi = \Omega(\log n). The gap is due to non-linear codes outperforming any linear (XOR-based) scheme.

For MAN conflict graphs, linear XOR codes are optimal and the chromatic / fractional chromatic / independence numbers coincide β€” hence the clean rate formula of Chapter 2.