Sigma-Algebras, Measures, and Measurability
The Three Pillars of a Probability Space
In Chapter 1 we introduced the probability space informally: is the set of outcomes, is the collection of events, and assigns probabilities. Now we make this precise. The key insight is that cannot be "all subsets of " when is uncountable — doing so leads to contradictions (Vitali sets, Banach-Tarski). The sigma-algebra is the mathematical device that tells us which subsets are "measurable" and therefore eligible to receive a probability.
Definition: Sigma-Algebra (-Algebra)
Sigma-Algebra (-Algebra)
A collection of subsets of is a -algebra (or -field) if:
- .
- If , then (closed under complementation).
- If , then (closed under countable unions).
Properties 2 and 3 together imply closure under countable intersections (by De Morgan). Also, . The pair is called a measurable space.
Example: Examples of Sigma-Algebras
Identify the sigma-algebras in the following cases:
(a) (coin flip). (b) , the Borel sigma-algebra. (c) The trivial and the discrete sigma-algebras on any .
(a) Coin flip
. When is finite or countable, the power set is a valid sigma-algebra.
(b) Borel sigma-algebra
is the smallest sigma-algebra containing all open intervals . Equivalently, it is generated by all sets of the form for . It contains all open sets, all closed sets, all countable unions and intersections thereof (the Borel hierarchy). It does not contain all subsets of — Vitali's construction shows that non-measurable sets exist (assuming the axiom of choice).
(c) Trivial and discrete
The trivial sigma-algebra is — it contains no information about . The discrete sigma-algebra is — it distinguishes every element. Every sigma-algebra on sits between these two extremes.
Definition: Generated Sigma-Algebra
Generated Sigma-Algebra
Given any collection of subsets of , the sigma-algebra generated by , written , is the smallest sigma-algebra containing : This intersection is well-defined because is always a sigma-algebra containing .
The Borel sigma-algebra is . Equivalently, .
Definition: Measure
Measure
A measure on a measurable space is a function satisfying:
- .
- Countable additivity: If are pairwise disjoint, then .
The triple is called a measure space. If , then is a probability measure and we write instead of .
Definition: Lebesgue Measure on
Lebesgue Measure on
The Lebesgue measure on is the unique measure satisfying for every interval . Its existence is guaranteed by the Carathodory extension theorem.
Key properties:
- Translation invariance: for all .
- Countable sets have measure zero: .
- The Cantor set has measure zero yet is uncountable.
Lebesgue measure is the "right" notion of length/area/volume for measurable subsets of . In probability, a continuous random variable has a density if and only if its distribution is absolutely continuous with respect to Lebesgue measure — and the density is the Radon-Nikodym derivative (Section 22.4).
Theorem: Carathodory Extension Theorem
Let be an algebra of subsets of and let be a pre-measure (finitely additive and countably additive on ). If is -finite (i.e., with ), then extends uniquely to a measure on .
The theorem says: if you know how to assign "lengths" to intervals in a consistent way (the pre-measure on the algebra of finite unions of intervals), then there is exactly one way to extend this assignment to all Borel sets. This is how Lebesgue measure is constructed — start from and extend.
Outer measure construction
Define the outer measure . This is defined on all subsets of but is not necessarily additive.
Identify measurable sets
A set is -measurable (in the Carathodory sense) if for every : . The collection of all -measurable sets forms a sigma-algebra , and restricted to is a measure.
Uniqueness from $\sigma$-finiteness
One shows , so extends . Uniqueness follows from -finiteness and the - theorem.
Definition: Measurable Function (= Random Variable)
Measurable Function (= Random Variable)
Let and be measurable spaces. A function is -measurable if
When is a probability space and , , a measurable function is called a random variable. The condition becomes:
This is the formal version of what we used informally in Chapter 5: a random variable is a function from the sample space to that is "compatible" with the sigma-algebra, so that we can compute probabilities of events like .
Definition: Expectation as Lebesgue Integration
Expectation as Lebesgue Integration
If is a random variable on , its expectation is: where the right-hand side is the Lebesgue integral of with respect to the probability measure . This single definition unifies:
- Discrete case: (integral w.r.t. counting measure).
- Continuous case: (integral w.r.t. Lebesgue measure, via the change-of-variables formula).
- Mixed/singular case: handled naturally.
Example: Expectation of a Mixed Random Variable
Let be a random variable with CDF: Note that has a jump of size at and a continuous part on . Compute .
Decompose the distribution
The distribution of is a mixture: with probability , is uniform on (continuous part with density on ), and with probability , (point mass).
Compute via the Lebesgue integral
P_X = \frac{1}{2}\lambda|_{[0,1)} + \frac{1}{2}\delta_1\mathbb{E}[X] = \int x, dP_X(x)$, which the Lebesgue integral handles seamlessly.
Riemann vs. Lebesgue Integration
| Property | Riemann | Lebesgue |
|---|---|---|
| Partitions | Domain | Range of |
| Handles discontinuous functions | Only if discontinuities have measure zero | All measurable functions |
| Convergence theorems | Uniform convergence required | MCT, DCT (pointwise suffices) |
| Defines probability | Only for continuous and discrete RVs | All RVs (discrete, continuous, mixed, singular) |
| Completeness of | No (Riesz-Fischer fails) | Yes ( is a Banach space) |
Historical Note: Vitali's Non-Measurable Set and the Necessity of Sigma-Algebras
1905In 1905, Giuseppe Vitali constructed a subset of that cannot be assigned a Lebesgue measure in any consistent way. The construction uses the axiom of choice: partition into equivalence classes where iff , and select one representative from each class. The resulting set satisfies , but if then (contradiction), and if then (contradiction).
This is why we need sigma-algebras: we cannot assign probabilities to all subsets of an uncountable set in a translation-invariant way. The Borel (or Lebesgue) sigma-algebra is the largest collection of "well-behaved" sets that admits a consistent measure.
Common Mistake: Not All Subsets of Are Measurable
Mistake:
Assuming that any subset of one can "describe" must be Lebesgue measurable, or that non-measurable sets are merely a logical curiosity with no practical consequence.
Correction:
Vitali's construction shows that non-measurable sets exist (under the axiom of choice). While these sets do not arise in engineering applications, their existence is the reason we need the formalism of sigma-algebras. Without it, the entire theory of probability measures on is inconsistent.
Common Mistake: Borel Sets vs. Lebesgue-Measurable Sets
Mistake:
Conflating the Borel sigma-algebra with the Lebesgue sigma-algebra .
Correction:
The Lebesgue sigma-algebra is strictly larger: . There exist Lebesgue-measurable sets that are not Borel sets (subsets of measure-zero Cantor-type sets). For probability theory, the Borel sigma-algebra is almost always sufficient.
Quick Check
Which of the following is NOT a sigma-algebra on ?
(the power set)
, which is not in the collection. So it fails closure under unions.
Quick Check
In the measure-theoretic definition, a random variable is:
Any function from to
A function such that for all
A function with a PDF
This is the measurability condition: the preimage of every Borel set (equivalently, every half-line) must be in the sigma-algebra.
Why This Matters: Mixed Distributions in Fading Channels
In wireless communications, mixed random variables arise naturally. Consider a Rayleigh fading channel where outage occurs: the effective rate is when the channel is not in outage, and (with positive probability) during outage. The resulting has a point mass at zero plus a continuous part — exactly the type of distribution that the Lebesgue integral handles naturally but the Riemann integral cannot.
Sigma-Algebra
A collection of subsets of that is closed under complementation and countable unions, and contains itself. It defines which events can be assigned a probability.
Related: Borel Sigma-Algebra, Measurable Function (= Random Variable)
Borel Sigma-Algebra
The smallest sigma-algebra on containing all open sets, denoted . Equivalently, generated by intervals for .
Related: Sigma-Algebra (-Algebra), Lebesgue Measure on
Lebesgue Measure
The unique complete, translation-invariant measure on that assigns to every interval. The standard notion of "length" for measurable subsets of .
Related: Borel Sigma-Algebra, Lebesgue Integral
Key Takeaway
A probability space is a measure space with total mass one. The sigma-algebra tells us which questions about the outcome can be answered with a probability; a random variable is a measurable function that maps outcomes to numbers in a way compatible with ; and expectation is the Lebesgue integral with respect to .