Common Discrete Distributions
A Toolkit of Named Distributions
A handful of discrete distributions appear so frequently in applications that they have earned their own names. Each captures a different random mechanism: coin flips (Bernoulli/binomial), waiting times (geometric/negative binomial), rare events (Poisson), equal likelihood (discrete uniform), and sampling without replacement (hypergeometric). Knowing these distributions β and recognizing when a problem maps to one of them β saves enormous effort in both analysis and computation.
Definition: Bernoulli Distribution
Bernoulli Distribution
with :
- Mean: .
- Variance: .
- MGF: .
The Bernoulli distribution models a single trial with two outcomes. It is the building block for the binomial distribution (sum of i.i.d. Bernoulli trials).
Definition: Binomial Distribution
Binomial Distribution
: the number of successes in independent Bernoulli() trials.
- Mean: .
- Variance: .
- MGF: .
Theorem: Mean and Variance of the Binomial via Indicators
If , then and .
Decompose into indicators
Write where are independent. By linearity, .
Variance by independence
Since the are independent, .
Example: Bit Errors in a Transmitted Block
A transmitter sends a block of bits over a binary symmetric channel with bit error probability . Let be the number of errors. Find and , and estimate .
Model
, so and .
Markov bound
(crude).
Poisson approximation
Since is large and is small, . Using the Poisson CDF: .
Binomial PMF Explorer
Explore how the binomial PMF changes shape as and vary. For large and small , the binomial approaches the Poisson. For moderate and large , it approaches the Gaussian.
Parameters
Definition: Geometric Distribution
Geometric Distribution
: the number of trials until the first success.
- Mean: .
- Variance: .
- MGF: for .
Some authors define the geometric distribution as the number of failures before the first success, giving for and mean . We follow the convention of Caire's FSP course, counting the trial on which the first success occurs.
Theorem: Memoryless Property of the Geometric Distribution
If , then for all :
Moreover, the geometric distribution is the only discrete distribution with this memoryless property.
Given that you have already waited trials without success, the remaining wait has the same distribution as if you were starting fresh. This is the discrete analogue of the memoryless property of the exponential distribution.
Compute the tail probability
.
Apply Bayes
$
Geometric Memoryless Property
Visualize the memoryless property: the conditional PMF of given is identical to the original PMF, regardless of .
Parameters
Definition: Negative Binomial Distribution
Negative Binomial Distribution
: the number of trials until the -th success.
- Mean: .
- Variance: .
- MGF: for .
The negative binomial is a sum of independent Geometric() random variables. For , it reduces to the geometric distribution.
Definition: Poisson Distribution
Poisson Distribution
with :
- Mean: .
- Variance: .
- MGF: .
The Poisson distribution is remarkable in that its mean and variance are equal. This "equi-dispersion" property is a quick diagnostic: if the sample mean and variance of a count data set are roughly equal, the Poisson may be a good model.
Theorem: Poisson Limit Theorem (Law of Rare Events)
If with as , then for every fixed :
When the number of trials is large but each trial has a small success probability, the binomial distribution is well approximated by the Poisson. This is the "law of rare events" β the Poisson distribution naturally governs counts of rare occurrences.
Write out the binomial PMF
With :
Take the limit
As :
- .
- .
- .
So the limit is .
Poisson PMF vs
Explore how the Poisson PMF changes as increases. For large , the distribution becomes approximately Gaussian by the CLT.
Parameters
Definition: Discrete Uniform Distribution
Discrete Uniform Distribution
for integers :
- Mean: .
- Variance: .
The fair die (, ) is the prototypical example. The discrete uniform distribution maximizes entropy over its support β we will formalize this in Section 5.5.
Definition: Hypergeometric Distribution
Hypergeometric Distribution
Draw items without replacement from a population of items, of which are "good." Let = number of good items drawn.
- Mean: .
- Variance: .
When is much larger than , sampling without replacement is approximately the same as sampling with replacement, and the hypergeometric approaches the Binomial. The correction factor is the finite-population correction.
Summary of Common Discrete Distributions
| Distribution | PMF | Mean | Variance |
|---|---|---|---|
| Bernoulli | , | ||
| Binomial | |||
| Geometric | , | ||
| NegBin | |||
| Poisson | |||
| Uniform | |||
| Hypergeometric | See definition |
Why This Matters: Poisson Models for Network Traffic
The Poisson distribution is the workhorse model for packet arrivals in telecommunication networks. If users initiate sessions independently at a low individual rate, the total number of arrivals in a time interval is approximately Poisson (by the law of rare events). This is the starting point for queueing theory β the Poisson arrival process feeds into the M/M/1 queue, the simplest model for a network switch.
When Poisson Fails: Overdispersion in Real Networks
Real network traffic often exhibits overdispersion β the variance exceeds the mean, violating the Poisson assumption. This arises from burstiness and long-range dependence. The negative binomial distribution, which allows , is a common alternative. Always check equi-dispersion before blindly applying Poisson models.
Quick Check
A random variable has PMF for What are and ?
,
,
,
,
This is Poisson(), and for Poisson, mean = variance = .
Historical Note: Poisson and the Law of Rare Events
1837SimΓ©on Denis Poisson published his eponymous distribution in 1837 in his work Recherches sur la probabilitΓ© des jugements. But the distribution did not become famous until Ladislaus Bortkiewicz's 1898 monograph, which used it to model the number of Prussian cavalry soldiers killed by horse kicks per year β the classic "law of small numbers" example. Bortkiewicz showed that across 14 Prussian army corps over 20 years, the number of deaths per corps per year followed a Poisson distribution with remarkably well.
Poisson Distribution
A discrete distribution with parameter , PMF , and the property that mean equals variance equals .
Related: Probability Mass Function (PMF)
Moment Generating Function (MGF)
. When it exists in a neighborhood of , it uniquely determines the distribution and generates moments via .
Related: Expectation, Variance
Key Takeaway
The Poisson distribution arises as the limit of the binomial when is large and is small with . Whenever you count rare events among many independent trials, the Poisson is the natural model. Its defining signature is that mean equals variance.