The Hypergeometric Distribution
From Counting to Distribution
In Chapter 2 we derived the hypergeometric coefficient as the probability of drawing exactly "successes" from a finite population. Now we study this formula as a full-fledged probability distribution and compare it to the binomial. The key question is: when does sampling without replacement behave like sampling with replacement?
Definition: Hypergeometric Distribution
Hypergeometric Distribution
Let a population of items contain "successes" and "failures." Draw items without replacement. The number of successes follows the hypergeometric distribution with PMF:
Theorem: Mean and Variance of the Hypergeometric Distribution
If , then:
The mean is the same as for — on average, the fraction of successes in the sample matches the fraction in the population. The variance is smaller by the factor , called the finite population correction. Sampling without replacement reduces variability because each draw constrains the next.
Indicator decomposition
Define for . Then .
Compute the mean
By symmetry, for every (each item is equally likely to be in any draw position). Thus:
Compute the variance
. For : so .
Combine
$
Definition: Finite Population Correction
Finite Population Correction
The factor appearing in the variance of the hypergeometric distribution is called the finite population correction (FPC). It satisfies:
- FPC when (single draw: same as binomial).
- FPC when (full census: no variability).
- FPC as with fixed (large population: approaches binomial).
In survey sampling, the FPC is applied whenever the sample constitutes more than about 5% of the population. For most wireless applications, the "population" of possible events is effectively infinite and the FPC is negligible.
Theorem: Hypergeometric-to-Binomial Approximation
Let where as . Then for every fixed : That is, converges in distribution to .
When the population is much larger than the sample, whether we replace each drawn item or not makes negligible difference — the composition of the population barely changes between draws. This is why polls of 1,000 people can represent 300 million: the finite population correction is essentially 1.
Write the PMF as a ratio of falling factorials
$
Take the limit
As with , each ratio of the form converges to , and similarly . There are factors converging to and factors converging to :
Hypergeometric vs. Binomial
| Property | with | |
|---|---|---|
| Sampling | Without replacement | With replacement |
| Mean | ||
| Variance | ||
| Support | ||
| Independence of draws | No | Yes |
| Approximation | Approaches Bin as | Exact |
Hypergeometric vs. Binomial PMF
Compare the hypergeometric PMF (sampling without replacement) to the binomial approximation (sampling with replacement). As grows with fixed, the two distributions become indistinguishable.
Parameters
Example: Quality Control Inspection
A shipment of components contains defective ones. An inspector draws components without replacement. What is the probability that the sample contains exactly 1 defective component? Compare with the binomial approximation.
Hypergeometric calculation
\binom{5}{1} = 5\mathbb{P}(X = 1) \approx 0.3394$.
Binomial approximation
With : .
Compare
The binomial underestimates the true probability by about 7%. The finite population correction factor is , which is already quite close to 1 but not negligible for the variance. For with (same ), the two answers would agree to four decimal places.
Example: Lottery Probability
In a lottery, 6 numbers are drawn without replacement from . What is the probability of matching exactly 3 of your 6 chosen numbers?
Set up as hypergeometric
The "population" is numbers, of which are "successes" (the winning numbers). We draw , and we want matches.
Compute
$ About 1 in 57 tickets matches exactly 3 numbers.
Why This Matters: Hypergeometric Distribution in Random Access
In grant-free random access for massive IoT, a base station allocates pilot sequences. If devices are active and each selects a pilot uniformly at random, the number of devices selecting a given subset of pilots follows a distribution closely related to the hypergeometric. When is large relative to , the binomial approximation is accurate, but in overloaded regimes (), the finite-population effects become significant and the hypergeometric model is more appropriate.
Common Mistake: Forgetting the Support Constraints
Mistake:
Writing for without checking that and .
Correction:
The hypergeometric PMF is zero outside . Always verify the support before computing — especially when is close to .
Historical Note: Origins of the Hypergeometric Distribution
19th–20th centuryThe term "hypergeometric" dates to the early 19th century and reflects the connection to the hypergeometric series . The PMF of the hypergeometric distribution can be expressed as a terminating hypergeometric series. The distribution itself was studied implicitly by Laplace and explicitly by the statistician Karl Pearson in the early 1900s, who used it as the exact model for Fisher's "Lady Tasting Tea" experiment — one of the founding examples of hypothesis testing.
Quick Check
When is the finite population correction factor exactly equal to 0?
When
When
When
When
Correct. FPC . Drawing the entire population leaves no randomness: .
hypergeometric distribution
The distribution of the number of successes in draws without replacement from a population of items containing successes: .
Related: binomial distribution, finite population correction
finite population correction
The factor by which the variance of the hypergeometric distribution is smaller than that of the corresponding binomial.
Related: hypergeometric distribution
binomial distribution
The distribution of the number of successes in independent Bernoulli trials with success probability : .
Key Takeaway
The hypergeometric distribution models sampling without replacement and has the same mean as the binomial but a smaller variance by the finite population correction factor . When the population is much larger than the sample , the two distributions are practically indistinguishable.