The Central Limit Theorem
Why the Gaussian Appears Everywhere
The LLN tells us that for large . The CLT answers the follow-up question: how is distributed around ? The answer β Gaussian, regardless of the underlying distribution β is one of the most remarkable facts in all of mathematics.
For telecommunications, this has a profound operational consequence. Thermal noise is the sum of tiny random contributions from many electrons. Aggregate interference in a dense network is the sum of many weak signals. Channel estimation errors accumulate over many pilot symbols. In all these cases, the CLT explains why Gaussian models work so well: the sum of many small independent effects is approximately Gaussian, no matter what the individual effects look like.
Theorem: Central Limit Theorem (CLT)
Let be i.i.d. with mean , variance . Define the standardized sum:
Then:
that is, for all , where is the standard normal CDF.
The LLN says with fluctuations of order . The CLT says the shape of those fluctuations is Gaussian, regardless of the shape of the original distribution. The characteristic function proof reveals why: the Ch.F of the standardized sum converges to because all higher-order cumulants vanish after division by .
Standardize
Let , so , . Then .
Compute the Ch.F of $Z_n$
Let be the common Ch.F of the . Since they are i.i.d. and :
Taylor-expand the Ch.F
Since and , the Taylor expansion (Theorem 10 from Ch. 10) gives:
Substituting :
Take the $n$-th power
Using the limit :
Identify the limit and conclude
The function is the Ch.F of . It is continuous at . By the Levy continuity theorem:
The Operational Content of the CLT
The CLT gives us a practical approximation: for large ,
This means:
- The 95% confidence interval for is approximately
- The "error" is of order , the universal convergence rate for i.i.d. averaging
The question "how large must be for this approximation to be good?" is answered by the Berry-Esseen theorem below.
Theorem: Berry-Esseen Theorem
Let be i.i.d. with mean , variance , and finite third absolute moment . Then for all and all :
where is a universal constant. The best known value is (Shevtsova, 2011).
The CLT says the CDF converges to the Gaussian CDF, but how fast? Berry-Esseen says the convergence rate is , uniformly over all . The constant depends on , which measures how "non-Gaussian" the original distribution is. For symmetric distributions, the bound is tighter.
CLT Convergence: Histogram of Approaching the Gaussian
For i.i.d. samples from a chosen distribution, observe how the histogram of the standardized sum approaches the standard normal bell curve as grows.
Parameters
Berry-Esseen Rate: vs.
Observe how the maximum CDF error between the standardized sum and the Gaussian decreases as , matching the Berry-Esseen prediction.
Parameters
Example: CLT for Coin Flips: The de Moivre-Laplace Theorem
Let be i.i.d. . Use the CLT to approximate where (number of heads in 100 fair coin flips).
Compute mean and variance
, , so has mean and standard deviation .
Standardize
$
Apply the CLT approximation
0.0284n = 100$ with a discrete distribution.
Example: CLT for Waiting Times
A call center receives calls with i.i.d. exponential inter-arrival times with rate calls/minute. Approximate the probability that the total time for 100 calls exceeds 55 minutes.
Identify parameters
Each inter-arrival time with minutes and .
Apply CLT
has mean and standard deviation . Then:
Common Mistake: The CLT Is an Asymptotic Statement β Small Requires Caution
Mistake:
Applying the CLT with or and trusting the Gaussian approximation for tail probabilities.
Correction:
The CLT guarantees convergence as , but the rate depends on the underlying distribution. For heavy-tailed distributions (large ), the Berry-Esseen bound shows convergence can be slow. In particular:
- Bernoulli with : excellent by
- Exponential: reasonable by
- Chi-squared with 1 d.f. (very skewed): may need
For tail probabilities (), the approximation degrades faster than at the center.
Historical Note: The Central Limit Theorem: 200 Years of Refinement
1733β1942Abraham de Moivre (1733) proved the CLT for fair coin flips. Pierre-Simon Laplace (1812) extended it to general distributions, though his proof lacked rigor by modern standards. The first rigorous proof using characteristic functions was given by Aleksandr Lyapunov (1901). Lindeberg (1922) and Feller (1935) established the definitive necessary and sufficient conditions for the CLT to hold for independent (not necessarily identically distributed) summands. The Berry-Esseen theorem (1941-42) finally quantified the rate of convergence.
The name "central" was coined by George Polya in 1920, reflecting its central importance in probability theory β not any geometric meaning.
Why Gaussian Noise Models Work in Communications
In a communication receiver, the thermal noise at the antenna is the aggregate effect of random electron motion across billions of charge carriers. Each contributes a tiny random voltage, and the CLT guarantees that their sum is approximately Gaussian. This justifies the noise model used throughout signal processing and information theory.
More precisely, the noise in a bandwidth over a time interval is the sum of roughly independent noise "samples" (by the sampling theorem). For typical values ( MHz, ms), this is independent contributions β more than enough for the CLT to provide an excellent approximation.
- β’
The Gaussian model breaks down for impulsive noise (e.g., lightning, power line interference)
- β’
Non-Gaussian interference arises in ultra-dense networks where a few strong interferers dominate
Quick Check
The CLT says that . What convergence mode is this?
Almost sure convergence
Convergence in probability
Convergence in distribution
convergence
Correct. The CLT is a statement about convergence of CDFs: .
Central Limit Theorem
The standardized sum of i.i.d. random variables with finite variance converges in distribution to . The convergence rate is by Berry-Esseen.
Related: Weak Law of Large Numbers, Convergence in Distribution
Key Takeaway
The CLT is the reason Gaussian models dominate communications engineering. Whenever a quantity is the sum of many small independent contributions β noise, interference, estimation errors β it is approximately Gaussian, regardless of the individual distributions. The Berry-Esseen theorem tells us the approximation error is .