References & Further Reading
References
- J. A. Gubner, Probability and Random Processes for Electrical and Computer Engineers, Cambridge University Press, 2006
The primary textbook for Caire's FSP course. Chapters 5-6 cover transforms and limit theorems in the same unified framework used here.
- P. Billingsley, Probability and Measure, Wiley, 3rd ed., 1995
The definitive graduate-level treatment of characteristic functions and the continuity theorem. Chapters 26-27 provide the measure-theoretic foundations for the results in this chapter.
- W. Feller, An Introduction to Probability Theory and Its Applications, Vol. II, Wiley, 2nd ed., 1971
The classic reference for characteristic functions and their applications. Chapter XV on characteristic functions remains unsurpassed in depth and elegance.
- A. Dembo and O. Zeitouni, Large Deviations Techniques and Applications, Springer, 2nd ed., 1998
The standard reference for large deviations theory. Chapter 2 develops Cramer's theorem rigorously, including the multidimensional case.
- G. R. Grimmett and D. R. Stirzaker, Probability and Random Processes, Oxford University Press, 3rd ed., 2001
Excellent treatment of generating functions, branching processes, and random walks. Chapters 5 and 11 complement the material here.
- K. B. Athreya and P. E. Ney, Branching Processes, Springer, 1972
The definitive monograph on branching processes. Goes far beyond the Galton-Watson process covered here, into multi-type and continuous-time branching.
- P. Levy, Calcul des Probabilites, Gauthier-Villars, 1925
Levy's original work establishing the continuity theorem for characteristic functions, the key tool for proving the CLT.
- H. Cramer, Sur un nouveau theoreme-limite de la theorie des probabilites, 1938
Cramer's original paper on large deviations for sums of i.i.d. random variables, establishing the exponential decay rate.
- F. Galton and H. W. Watson, On the Probability of the Extinction of Families, 1874
The founding paper of branching process theory. Galton posed the problem of family name extinction; Watson provided the generating function analysis.
- A. Papoulis and S. U. Pillai, Probability, Random Variables, and Stochastic Processes, McGraw-Hill, 4th ed., 2002
A widely used engineering textbook with clear treatment of characteristic functions and their role in proving the CLT.
- R. Durrett, Probability: Theory and Examples, Cambridge University Press, 5th ed., 2019
Modern graduate-level probability. Chapter 3 provides a rigorous treatment of characteristic functions and weak convergence.
- T. E. Harris, The Theory of Branching Processes, Springer, 1963
The first comprehensive treatment of branching processes. Accessible and still relevant for the classical Galton-Watson theory.
Further Reading
The transform methods in this chapter are the gateway to deeper topics in probability, statistics, and information theory.
Large deviations beyond Cramer's theorem
Dembo and Zeitouni, Large Deviations Techniques and Applications (1998)
Cramer's theorem is just the beginning. The general theory handles non-i.i.d. sequences, Markov chains, and empirical measures — essential for the error exponents in information theory (Book ITA).
Stein's method and rates of convergence in CLT
Chen, Goldstein, and Shao, Normal Approximation by Stein's Method (2011)
The Berry-Esseen theorem gives $O(1/\sqrt{n})$ convergence rate; Stein's method provides sharper, distribution-free bounds with explicit constants.
Multitype and continuous-time branching processes
Athreya and Ney, Branching Processes (1972)
The Galton-Watson process is the simplest branching model. Real applications (epidemic models, population genetics) require multi-type and age-dependent generalizations.
Moment problems and determinacy
Stoyanov, Counterexamples in Probability (2014)
When does a distribution's moment sequence uniquely determine it? This is the question the MGF answers (when it exists), but the answer is subtle for distributions with heavy tails.