CCESA: The Sparse Random-Graph Construction (CommIT Contribution)
The Sparse Random-Graph Idea
The core insight of CCESA is simple to state but subtle to analyze: use a sparse random graph of masks instead of the complete graph. If the Bonawitz protocol mask pair-graph is replaced by an Erdős–Rényi random graph with , the resulting protocol still provides the privacy guarantees of Bonawitz (with high probability), and its communication overhead drops from to .
The point is that the complete-graph mask structure of Bonawitz is overkill. The graph just needs to be "connected enough" to cancel masks in the aggregate, and an Erdős–Rényi graph is connected with high probability at edge densities much lower than the complete graph.
Section 12.2 develops the CCESA construction and identifies it as the fourth CommIT contribution of Part III. Section 12.3 proves the privacy and reliability guarantees; §12.4 places CCESA in the broader design space.
CCESA: Communication-Efficient Secure Aggregation via Sparse Random Graphs
CCESA is the CommIT-group contribution by Byeong-Geun Choi, Jy-yong Sohn, Dong-Jun Han, Jaekyun Moon, and Giuseppe Caire. It reduces the Bonawitz protocol's per-round communication overhead from to by using a sparse Erdős–Rényi random graph of pairwise masks instead of the complete graph.
Key technical contributions:
-
Sparse random-graph mask structure. Each pair of users shares a pairwise mask with probability for a constant , independently per pair. The expected number of mask pairs per user is instead of .
-
Privacy + reliability guarantees via random- graph theory. The privacy guarantee holds with probability (uniform over the graph draw). Reliability (server can always reconstruct) holds with the same probability.
-
Communication complexity . Matches the information-theoretic lower bound for sparse-graph schemes up to logarithmic factors.
-
Compatibility with dropouts and threshold cryptography. CCESA is drop-in compatible with Chapter 10's dropout-handling via Shamir-shared seeds; the sparse-graph structure does not break this integration.
Why it matters: CCESA unlocks secure aggregation at large — essential for modern cross-device FL (millions of users) and cross-silo federations (hundreds of institutions). The sparse-graph idea transfers to many secure-aggregation variants and is now standard in production thinking.
The result is the fourth CommIT Part III contribution (after Chapter 3 foundations, Chapter 10 optimality, and Chapter 11 ByzSecAgg), closing the CommIT-group privacy-preserving FL research programme at the information-theoretic frontier.
Definition: CCESA Protocol
CCESA Protocol
The CCESA protocol for users, collusion threshold , and edge probability (for some constant ) operates as follows:
Phase 0: Graph Sampling. A publicly-known pseudorandom generator samples an Erdős–Rényi graph : for each pair , independently include the edge with probability . The graph is derived deterministically from a round-specific public seed, so all users and the server agree on .
Phase 1: Pairwise Key Exchange (sparse). For each edge , users and perform a Diffie–Hellman exchange and derive a pairwise seed via a PRG. Users do not exchange keys with non-neighbors.
Phase 2: Masked Upload. User uploads where is 's graph neighborhood and is a self-mask (as in Bonawitz §10.3). Note: far fewer masks per user than in Bonawitz.
Phase 3: Aggregation. Server computes . Masks cancel because edges in the graph contribute symmetric pairs.
Phase 4: Dropout Handling (optional). Shamir shares of the pairwise seeds are exchanged as in Bonawitz — but only over the sparse graph, reducing overhead.
The protocol's correctness and privacy depend on the graph being "well-connected" with high probability, which Erdős–Rényi at achieves.
The key contrast with Bonawitz: each user has degree in CCESA vs. degree in Bonawitz. Per-user DH exchanges drop from to , and aggregate DH overhead from to . Reliability and privacy are preserved with high probability over the random graph.
CCESA
Communication-Efficient Secure Aggregation via sparse Erdős–Rényi random graphs. Reduces Bonawitz's overhead to while preserving information-theoretic privacy (with high probability). The fourth CommIT-group Part III contribution.
Erdős–Rényi Random Graph
The random graph where each of the edges is included independently with probability . Classical connectivity threshold: is connected w.h.p. iff for any . CCESA uses edge density , well above the connectivity threshold, for privacy + reliability guarantees.
CCESA Protocol
Complexity: Per-user: DH exchanges, -scalar upload. Aggregate: DH + Shamir-share overhead.The protocol is structurally identical to Bonawitz — only the graph density differs. All the same cryptographic primitives (DH, PRG, Shamir sharing) apply. Implementation overhead is low; performance benefit is dramatic.
Theorem: CCESA Communication Complexity
The CCESA protocol with users, edge probability (for some constant ), and model dimension has:
- Per-user uplink: scalars (gradient) + DH exchanges.
- Aggregate per-round communication: .
- Per-user DH cost: — a reduction over Bonawitz.
Simplified for the typical regime: — sub-quadratic, substantial improvement over Bonawitz's .
Each user's degree in is with expectation . Chernoff: with high probability, every user has degree . Hence each user does DH exchanges, not as in Bonawitz. Aggregate overhead: — sub-quadratic.
Operationally: at , CCESA's DH phase is exchanges per user — tractable in seconds. Bonawitz at the same scale would need exchanges per user — several minutes, impractical.
Degree concentration
Each user 's degree is a sum of independent Bernoulli random variables. By Chernoff: . For , this is — vanishingly small.
Aggregate degree
Sum of degrees = . Total edges: w.h.p.
DH exchanges
One DH per edge: DH exchanges aggregate, vs. Bonawitz's . Ratio: — sub-quadratic savings.
Gradient upload
Unchanged: scalars aggregate. For moderate-to-large , gradient dominates; for small , the overhead dominates. CCESA is best when both terms are comparable.
Example: CCESA vs. Bonawitz at
For users, model size , compute per-user DH exchanges and aggregate communication for both Bonawitz and CCESA.
Bonawitz
Per-user DH: . Aggregate DH: . Gradient upload: scalars. Total: (gradient dominates for ).
CCESA
Edge probability: . At : . Per-user DH: — a reduction over Bonawitz. Aggregate DH: — fewer. Gradient: same . Total: (gradient still dominates).
DH-time savings
Bonawitz DH time: s = 5.8 days. CCESA DH time: s = 2.8 hours. Ratio: faster DH phase.
Conclusion
At , CCESA reduces the Bonawitz overhead from "years per round" to "hours per round" — a deployability transformation. Gradient upload dominates both total communications, but CCESA's reduction in DH time is critical for production viability.
From Complete Graph to Sparse Erdős–Rényi
CCESA Graph Density vs. Communication Savings
Plot the per-user DH-exchange count vs. number of users for CCESA (at various values in the edge probability ) and for Bonawitz ( exchanges per user). Larger gives denser graphs with better connectivity margins but more overhead; typical –.
Parameters
The Graph Is Pseudorandomly Generated
CCESA's random graph is not drawn fresh by each user independently — that would require all users to agree on the graph, requiring communication to confirm. Instead, the graph is pseudorandomly generated from a public round-specific seed. All users and the server apply a known PRG to the seed and obtain the same graph deterministically.
This is a standard trick in protocol design: use public randomness for structural decisions (which users share keys) that must be agreed upon, reserving fresh randomness for the cryptographic primitives (the DH exchanges themselves, the mask derivation). Production implementations use a round-specific seed derived from a beacon or blockchain hash.
CCESA in Production
Production CCESA deployments (as of 2024) are research-track:
- TU Berlin CommIT group: reference implementation in Python/TensorFlow.
- Google research: integration into TensorFlow Federated for large- deployments.
- Academic testbeds: cross-institution federated learning pilots at .
Engineering considerations:
- Graph seed coordination: a round-specific public seed must be agreed upon without communication. Typically derived from a blockchain hash or NIST beacon.
- Graph discovery: users compute their neighborhood locally; no per-neighbor communication needed beyond the DH exchange.
- Degree variance: Chernoff bounds guarantee degree concentration, but some users may have slightly-above-average degree. Production implementations pad to the maximum expected degree.
The shift from Bonawitz to CCESA is conceptually simple and implementation-light; adoption is gated primarily by the need to validate the graph-theoretic analysis at production scale.
- •
Graph seed: from blockchain/beacon, broadcast
- •
Degree concentration: Chernoff w.h.p.
- •
Adoption path: research → research production → standard (timeline: 1–3 years)
Key Takeaway
CCESA achieves communication by using a sparse Erdős–Rényi random graph of pairwise masks. The construction is a minimal modification of Bonawitz (change the graph structure; keep DH, PRG, Shamir) with a sub-quadratic asymptotic advantage. The privacy and reliability guarantees hold with high probability over the random graph — a relaxation so mild that it is operationally equivalent to deterministic. Section 12.3 proves these guarantees rigorously.
Common Mistake: The Graph Must Not Be Too Sparse
Mistake:
Set edge probability (much below CCESA's ) to save even more communication.
Correction:
Erdős–Rényi's connectivity threshold is . Below this, the graph has isolated vertices w.h.p. — users without mask partners. Their masks do not cancel; the server cannot compute the aggregate. CCESA's is well above connectivity but below complete — the sweet spot.
Attempting to push lower risks: (i) isolated vertices (aggregation fails), (ii) small privacy threshold (any colluders can fragment the graph). The factor is the safety margin; production uses with verified Chernoff-margin analysis.
Historical Note: From FastSecAgg to CCESA
2020–2022Reducing Bonawitz's overhead was a recognized open problem circa 2020. FastSecAgg (Kadhe, Rajaraman, Koyluoglu, Ramchandran 2020) was the first practical proposal: regular-graph mask structure with overhead, but at weakened privacy (the regular-graph structure creates predictable correlations that can be exploited).
CCESA (Choi, Sohn, Han, Moon, Caire 2022) took the next step: random Erdős–Rényi graphs instead of regular. The random-graph analysis gives tight privacy guarantees (high-probability Bonawitz-equivalent privacy) at overhead. The CommIT-group collaboration between TU Berlin and KAIST produced a clean construction that has since become a standard reference.
Post-CCESA, the field continues to explore sparser constructions (expander graphs, combinatorial designs) but CCESA remains the benchmark for random-graph-based secure aggregation.
Quick Check
CCESA uses Erdős–Rényi graph with edge probability:
(complete graph — same as Bonawitz).
for some constant .
(constant-degree).
(connectivity threshold).
Correct. This density is well above the connectivity threshold and achieves aggregate overhead.