Exercises

ex17-1

Easy

Given n=100n = 100 users, gradient dimension d=105d = 10^5, symbol rate 10610^6/s, and digital uses b=8b = 8 bits/symbol. Compute the per-round channel-use count for each aggregator.

ex17-2

Easy

Given Οƒg2=1,n=50,\ntnmseagg=0.5,Ξ·lr=0.1,ΞΌ=1\sigma_g^2 = 1, n = 50, \ntn{mseagg} = 0.5, \eta_{\text{lr}} = 0.1, \mu = 1, compute the predicted convergence floor per Theorem 17.2.1.

ex17-3

Easy

For n=100n = 100 users, Οƒg2=2\sigma_g^2 = 2, what is the matched aggregation MSE target (where the two variance terms equal)?

ex17-4

Easy

Users have Rayleigh-fading channels ∣hk∣2∼Exp(1)|h_k|^2 \sim \text{Exp}(1), per-user budget P=1P = 1, source variance Οƒs2=1\sigma_s^2 = 1. For MSE target MSEtol=0.02\mathsf{MSE}^{\text{tol}} = 0.02 and noise Οƒ2=0.01\sigma^2 = 0.01, compute the threshold Ο„\tau and the expected fraction of users included in St\mathcal{S}_t.

ex17-5

Medium

For Rayleigh channels, if each user participates in at least Ξ±=0.7\alpha = 0.7 of their proportional-share rounds, estimate the average MSE relative to unconstrained. Use the exponential channel's Ξ±\alpha-percentile.

ex17-6

Medium

User kk has energy budget Ek=10E_k = 10 and participates in 5 rounds with Ξ³k(t)={2,0.5,1.5,0.3,1.0}\gamma_k^{(t)} = \{2, 0.5, 1.5, 0.3, 1.0\}. Compute the water-filling power allocation.

ex17-7

Medium

For an FL task with target loss gap Ξ΅=0.01\varepsilon = 0.01, Οƒg2/n=0.02\sigma_g^2/n = 0.02, \ntnmseagg/n2=0.001\ntn{mseagg}/n^2 = 0.001, Ξ·lr=0.1,ΞΌ=1,L=10\eta_{\text{lr}} = 0.1, \mu = 1, L = 10, initial gap 1010, compute the required round count TT.

ex17-8

Medium

In the CommIT IT-secure scheme (Theorem 17.4.1), derive the mask variance Οƒm2\sigma_m^2 needed so that the per-user MI leak is at most Ξ΅\varepsilon nats. Use n=100,d=64,Οƒz2=1,∣η∣2=1,Οƒ2=0.01,Ξ΅=0.01n = 100, d = 64, \sigma_z^2 = 1, |\eta|^2 = 1, \sigma^2 = 0.01, \varepsilon = 0.01.

ex17-9

Medium

Compare convergence over T=100T = 100 rounds with (a) constant Ξ·lr=0.1\eta_{\text{lr}} = 0.1 and (b) decreasing Ξ·lr,t=1/t\eta_{\text{lr},t} = 1/t. Which reaches a smaller final loss gap?

ex17-10

Hard

For Theorem 17.4.2's rate-privacy region, plot the boundary at n=50,d=128,Οƒz2=1,Οƒ2=0.01n = 50, d = 128, \sigma_z^2 = 1, \sigma^2 = 0.01 and varying Οƒm2∈[0.1,1000]\sigma_m^2 \in [0.1, 1000]. Show that rate and privacy are decoupled at fixed ∣η∣2|\eta|^2.

ex17-11

Hard

For a non-convex FL task (Theorem 17.2.3), compute the iterations needed to reach E[βˆ₯βˆ‡Fβˆ₯2]≀Ρ2\mathbb{E}[\|\nabla F\|^2] \leq \varepsilon^2 starting from F(ΞΈ0)βˆ’F⋆=F0F(\boldsymbol{\theta}_0) - F^{\star} = F_0. Use L,V,Ξ·lrL, V, \eta_{\text{lr}} as parameters.

ex17-12

Hard

Suppose threshold scheduling systematically excludes user 1, whose gradient satisfies g1=βˆ‡F(ΞΈ)+b\mathbf{g}_1 = \nabla F(\boldsymbol{\theta}) + \mathbf{b} (a bias relative to the true gradient). Derive the convergence bias introduced.

ex17-13

Hard

Sketch a hybrid scheme where the majority of the round uses AirComp but occasional digital ACK/checksum sub-rounds provide integrity. Compare its total overhead with pure digital.

ex17-14

Hard

Theorem 17.2.1 gives a noise floor. Is the bound tight? Consider the example of FL with i.i.d.\ Gaussian losses and Gaussian aggregation noise β€” does the bound match simulations?

ex17-15

Challenge

The joint wireless-FL problem (Definition 17.3.1) is non-convex. Discuss the decomposition, the known suboptimality gap, and what would be needed to close it.