Prerequisites & Notation

Before You Begin

This chapter builds on the transform methods and probability inequalities developed earlier in the book. Readers should be comfortable with moment generating functions and the Chernoff bound technique before proceeding.

  • Moment generating functions and their properties(Review fsp-ch09)

    Self-check: Can you compute MX(t)=E[etX]M_X(t) = \mathbb{E}[e^{tX}] for a Gaussian and a Bernoulli RV?

  • Chernoff bound and exponential tilting(Review fsp-ch10)

    Self-check: Can you derive P(Xa)inft>0etaMX(t)\mathbb{P}(X \geq a) \leq \inf_{t > 0} e^{-ta} M_X(t)?

  • Weak law of large numbers and CLT(Review fsp-ch11)

    Self-check: Do you understand that the sample mean concentrates around the true mean, and the CLT describes fluctuations at scale 1/n1/\sqrt{n}?

  • Kullback-Leibler divergence (basic definition)(Review ita-ch01)

    Self-check: Can you compute D(QP)D(Q \| P) for two Bernoulli distributions?

  • Convex conjugates (Legendre transform)

    Self-check: Given a convex function f(t)f(t), can you compute f(x)=supt{txf(t)}f^*(x) = \sup_t \{tx - f(t)\}?

Notation for This Chapter

Key symbols introduced or used throughout this chapter.

SymbolMeaningIntroduced
I(x)I(x)Rate function (Legendre-Fenchel transform of log-MGF)s01
Λ(θ)\Lambda(\theta)Cumulant generating function: Λ(θ)=logMX(θ)\Lambda(\theta) = \log M_X(\theta)s01
Λ(x)\Lambda^*(x)Fenchel-Legendre dual of Λ\Lambda: Λ(x)=supθ{θxΛ(θ)}\Lambda^*(x) = \sup_\theta \{\theta x - \Lambda(\theta)\}s01
P^n\hat{P}_nEmpirical distribution (type) of nn i.i.d. sampless02
D(QP)D(Q \| P)Kullback-Leibler divergence from PP to QQs02
σsg\sigma_{\mathrm{sg}}Sub-Gaussian parameter (proxy variance)s03
Xˉn\bar{X}_nSample mean: Xˉn=1ni=1nXi\bar{X}_n = \frac{1}{n}\sum_{i=1}^n X_is01
ErE_rError exponent (exponential decay rate of error probability)s04