Chapter Summary
Chapter Summary
Key Points
- 1.
A random variable is a measurable function . The PMF and CDF fully characterize the distribution of a discrete RV. Once we have these, we can forget the underlying sample space.
- 2.
Expectation is the probability-weighted average. Its most powerful property is linearity: , which holds without any independence assumption. LOTUS lets us compute directly from the PMF of .
- 3.
Variance measures spread around the mean. The shortcut formula is almost always easier than the definition. Variance of a sum equals the sum of variances only for independent (or uncorrelated) RVs.
- 4.
Seven named distributions form a core toolkit. Bernoulli (single trial), Binomial (count of successes), Geometric (wait for first success), Negative Binomial (wait for -th success), Poisson (rare events), Discrete Uniform (equal likelihood), and Hypergeometric (sampling without replacement). Each is characterized by its PMF, mean, variance, and MGF.
- 5.
The Poisson distribution arises as a limit of the binomial (, , ). Its signature property is that mean equals variance. It is the default model for counting rare events and the starting point for queueing theory.
- 6.
The geometric distribution is the only memoryless discrete distribution. Given that you have waited trials without success, the remaining wait has the same distribution as starting fresh.
- 7.
Shannon entropy quantifies average uncertainty. It is bounded by , with the uniform distribution achieving the maximum. Entropy depends only on probabilities, not on numerical values, and it equals the minimum average description length for a source.
Looking Ahead
Chapter 6 extends the theory to continuous random variables, where the PMF is replaced by a probability density function (PDF) and sums become integrals. The key distributions β Gaussian, exponential, gamma β are the continuous counterparts of the discrete distributions studied here. The Gaussian distribution will play a central role in the remainder of this book, just as the Poisson distribution dominates discrete modeling.