Sigma-Algebras, Measures, and Measurability

The Three Pillars of a Probability Space

In Chapter 1 we introduced the probability space (Ω,F,P)(\Omega, \mathcal{F}, P) informally: Ω\Omega is the set of outcomes, F\mathcal{F} is the collection of events, and PP assigns probabilities. Now we make this precise. The key insight is that F\mathcal{F} cannot be "all subsets of Ω\Omega" when Ω\Omega is uncountable — doing so leads to contradictions (Vitali sets, Banach-Tarski). The sigma-algebra is the mathematical device that tells us which subsets are "measurable" and therefore eligible to receive a probability.

Definition:

Sigma-Algebra (σ\sigma-Algebra)

A collection F\mathcal{F} of subsets of Ω\Omega is a σ\sigma-algebra (or σ\sigma-field) if:

  1. ΩF\Omega \in \mathcal{F}.
  2. If AFA \in \mathcal{F}, then AcFA^c \in \mathcal{F} (closed under complementation).
  3. If A1,A2,FA_1, A_2, \ldots \in \mathcal{F}, then n=1AnF\bigcup_{n=1}^{\infty} A_n \in \mathcal{F} (closed under countable unions).

Properties 2 and 3 together imply closure under countable intersections (by De Morgan). Also, =ΩcF\emptyset = \Omega^c \in \mathcal{F}. The pair (Ω,F)(\Omega, \mathcal{F}) is called a measurable space.

Example: Examples of Sigma-Algebras

Identify the sigma-algebras in the following cases:

(a) Ω={H,T}\Omega = \{H, T\} (coin flip). (b) Ω=R\Omega = \mathbb{R}, the Borel sigma-algebra. (c) The trivial and the discrete sigma-algebras on any Ω\Omega.

,

Definition:

Generated Sigma-Algebra

Given any collection C\mathcal{C} of subsets of Ω\Omega, the sigma-algebra generated by C\mathcal{C}, written σ(C)\sigma(\mathcal{C}), is the smallest sigma-algebra containing C\mathcal{C}: σ(C)={F:F is a σ-algebra and CF}.\sigma(\mathcal{C}) = \bigcap \{ \mathcal{F} : \mathcal{F} \text{ is a } \sigma\text{-algebra and } \mathcal{C} \subseteq \mathcal{F} \}. This intersection is well-defined because 2Ω2^{\Omega} is always a sigma-algebra containing C\mathcal{C}.

The Borel sigma-algebra is B(R)=σ({(a,b):a<b})\mathcal{B}(\mathbb{R}) = \sigma(\{(a,b) : a < b\}). Equivalently, B(R)=σ({(,x]:xR})\mathcal{B}(\mathbb{R}) = \sigma(\{(-\infty, x] : x \in \mathbb{R}\}).

Definition:

Measure

A measure on a measurable space (Ω,F)(\Omega, \mathcal{F}) is a function μ:F[0,]\mu : \mathcal{F} \to [0, \infty] satisfying:

  1. μ()=0\mu(\emptyset) = 0.
  2. Countable additivity: If A1,A2,FA_1, A_2, \ldots \in \mathcal{F} are pairwise disjoint, then μ ⁣(n=1An)=n=1μ(An)\mu\!\left(\bigcup_{n=1}^{\infty} A_n\right) = \sum_{n=1}^{\infty} \mu(A_n).

The triple (Ω,F,μ)(\Omega, \mathcal{F}, \mu) is called a measure space. If μ(Ω)=1\mu(\Omega) = 1, then μ\mu is a probability measure and we write PP instead of μ\mu.

Definition:

Lebesgue Measure on R\mathbb{R}

The Lebesgue measure λ\lambda on (R,B(R))(\mathbb{R}, \mathcal{B}(\mathbb{R})) is the unique measure satisfying λ((a,b])=ba\lambda((a, b]) = b - a for every interval (a,b](a, b]. Its existence is guaranteed by the Carathodory extension theorem.

Key properties:

  • Translation invariance: λ(A+x)=λ(A)\lambda(A + x) = \lambda(A) for all xRx \in \mathbb{R}.
  • Countable sets have measure zero: λ(Q)=0\lambda(\mathbb{Q}) = 0.
  • The Cantor set has measure zero yet is uncountable.

Lebesgue measure is the "right" notion of length/area/volume for measurable subsets of Rn\mathbb{R}^n. In probability, a continuous random variable XX has a density fXf_X if and only if its distribution PXP_X is absolutely continuous with respect to Lebesgue measure — and the density is the Radon-Nikodym derivative fX=dPX/dλf_X = dP_X / d\lambda (Section 22.4).

,

Theorem: Carathodory Extension Theorem

Let A\mathcal{A} be an algebra of subsets of Ω\Omega and let μ0:A[0,]\mu_0 : \mathcal{A} \to [0, \infty] be a pre-measure (finitely additive and countably additive on A\mathcal{A}). If μ0\mu_0 is σ\sigma-finite (i.e., Ω=nAn\Omega = \bigcup_{n} A_n with μ0(An)<\mu_0(A_n) < \infty), then μ0\mu_0 extends uniquely to a measure μ\mu on σ(A)\sigma(\mathcal{A}).

The theorem says: if you know how to assign "lengths" to intervals in a consistent way (the pre-measure on the algebra of finite unions of intervals), then there is exactly one way to extend this assignment to all Borel sets. This is how Lebesgue measure is constructed — start from λ((a,b])=ba\lambda((a,b]) = b - a and extend.

,

Definition:

Measurable Function (= Random Variable)

Let (Ω,F)(\Omega, \mathcal{F}) and (S,S)(S, \mathcal{S}) be measurable spaces. A function X:ΩSX : \Omega \to S is (F,S)(\mathcal{F}, \mathcal{S})-measurable if X1(B)={ωΩ:X(ω)B}Ffor all BS.X^{-1}(B) = \{\omega \in \Omega : X(\omega) \in B\} \in \mathcal{F} \quad \text{for all } B \in \mathcal{S}.

When (Ω,F,P)(\Omega, \mathcal{F}, P) is a probability space and S=RS = \mathbb{R}, S=B(R)\mathcal{S} = \mathcal{B}(\mathbb{R}), a measurable function XX is called a random variable. The condition becomes: {ω:X(ω)x}Ffor all xR.\{\omega : X(\omega) \leq x\} \in \mathcal{F} \quad \text{for all } x \in \mathbb{R}.

This is the formal version of what we used informally in Chapter 5: a random variable is a function from the sample space to R\mathbb{R} that is "compatible" with the sigma-algebra, so that we can compute probabilities of events like {Xx}\{X \leq x\}.

,

Definition:

Expectation as Lebesgue Integration

If XX is a random variable on (Ω,F,P)(\Omega, \mathcal{F}, P), its expectation is: E[X]=ΩX(ω)dP(ω),\mathbb{E}[X] = \int_{\Omega} X(\omega)\, dP(\omega), where the right-hand side is the Lebesgue integral of XX with respect to the probability measure PP. This single definition unifies:

  • Discrete case: E[X]=xxP(X=x)\mathbb{E}[X] = \sum_x x \cdot P(X = x) (integral w.r.t. counting measure).
  • Continuous case: E[X]=xfX(x)dx\mathbb{E}[X] = \int_{-\infty}^{\infty} x f_X(x)\, dx (integral w.r.t. Lebesgue measure, via the change-of-variables formula).
  • Mixed/singular case: handled naturally.

Example: Expectation of a Mixed Random Variable

Let XX be a random variable with CDF: FX(x)={0x<0x/20x<11x1F_X(x) = \begin{cases} 0 & x < 0 \\ x/2 & 0 \leq x < 1 \\ 1 & x \geq 1 \end{cases} Note that FXF_X has a jump of size 1/21/2 at x=1x = 1 and a continuous part on [0,1)[0,1). Compute E[X]\mathbb{E}[X].

Riemann vs. Lebesgue Integration

PropertyRiemannLebesgue
PartitionsDomain [a,b][a,b]Range of ff
Handles discontinuous functionsOnly if discontinuities have measure zeroAll measurable functions
Convergence theoremsUniform convergence requiredMCT, DCT (pointwise suffices)
Defines probabilityOnly for continuous and discrete RVsAll RVs (discrete, continuous, mixed, singular)
Completeness of LpL^pNo (Riesz-Fischer fails)Yes (LpL^p is a Banach space)

Historical Note: Vitali's Non-Measurable Set and the Necessity of Sigma-Algebras

1905

In 1905, Giuseppe Vitali constructed a subset of [0,1][0,1] that cannot be assigned a Lebesgue measure in any consistent way. The construction uses the axiom of choice: partition [0,1][0,1] into equivalence classes where xyx \sim y iff xyQx - y \in \mathbb{Q}, and select one representative from each class. The resulting set VV satisfies [0,1]qQ[0,1](V+q)[0,1] \subseteq \bigcup_{q \in \mathbb{Q} \cap [0,1]} (V + q), but if λ(V)=0\lambda(V) = 0 then λ([0,1])=0\lambda([0,1]) = 0 (contradiction), and if λ(V)>0\lambda(V) > 0 then λ([0,1])=\lambda([0,1]) = \infty (contradiction).

This is why we need sigma-algebras: we cannot assign probabilities to all subsets of an uncountable set in a translation-invariant way. The Borel (or Lebesgue) sigma-algebra is the largest collection of "well-behaved" sets that admits a consistent measure.

Common Mistake: Not All Subsets of R\mathbb{R} Are Measurable

Mistake:

Assuming that any subset of R\mathbb{R} one can "describe" must be Lebesgue measurable, or that non-measurable sets are merely a logical curiosity with no practical consequence.

Correction:

Vitali's construction shows that non-measurable sets exist (under the axiom of choice). While these sets do not arise in engineering applications, their existence is the reason we need the formalism of sigma-algebras. Without it, the entire theory of probability measures on R\mathbb{R} is inconsistent.

Common Mistake: Borel Sets vs. Lebesgue-Measurable Sets

Mistake:

Conflating the Borel sigma-algebra B(R)\mathcal{B}(\mathbb{R}) with the Lebesgue sigma-algebra L(R)\mathcal{L}(\mathbb{R}).

Correction:

The Lebesgue sigma-algebra is strictly larger: B(R)L(R)\mathcal{B}(\mathbb{R}) \subsetneq \mathcal{L}(\mathbb{R}). There exist Lebesgue-measurable sets that are not Borel sets (subsets of measure-zero Cantor-type sets). For probability theory, the Borel sigma-algebra is almost always sufficient.

Quick Check

Which of the following is NOT a sigma-algebra on Ω={a,b,c}\Omega = \{a, b, c\}?

{,Ω}\{\emptyset, \Omega\}

{,{a},{b,c},Ω}\{\emptyset, \{a\}, \{b,c\}, \Omega\}

{,{a},{b},Ω}\{\emptyset, \{a\}, \{b\}, \Omega\}

2Ω2^{\Omega} (the power set)

Quick Check

In the measure-theoretic definition, a random variable X:ΩRX : \Omega \to \mathbb{R} is:

Any function from Ω\Omega to R\mathbb{R}

A function such that {omega:X(omega)leqx}inmathcalF\{\\omega : X(\\omega) \\leq x\} \\in \\mathcal{F} for all xinmathbbRx \\in \\mathbb{R}

A function with a PDF

Why This Matters: Mixed Distributions in Fading Channels

In wireless communications, mixed random variables arise naturally. Consider a Rayleigh fading channel where outage occurs: the effective rate is R=log2(1+SNRh2)R = \log_2(1 + \text{SNR} \cdot |h|^2) when the channel is not in outage, and R=0R = 0 (with positive probability) during outage. The resulting RR has a point mass at zero plus a continuous part — exactly the type of distribution that the Lebesgue integral handles naturally but the Riemann integral cannot.

Sigma-Algebra

A collection F\mathcal{F} of subsets of Ω\Omega that is closed under complementation and countable unions, and contains Ω\Omega itself. It defines which events can be assigned a probability.

Related: Borel Sigma-Algebra, Measurable Function (= Random Variable)

Borel Sigma-Algebra

The smallest sigma-algebra on R\mathbb{R} containing all open sets, denoted B(R)\mathcal{B}(\mathbb{R}). Equivalently, generated by intervals (,x](-\infty, x] for xRx \in \mathbb{R}.

Related: Sigma-Algebra (σ\sigma-Algebra), Lebesgue Measure on R\mathbb{R}

Lebesgue Measure

The unique complete, translation-invariant measure on R\mathbb{R} that assigns λ((a,b])=ba\lambda((a,b]) = b - a to every interval. The standard notion of "length" for measurable subsets of R\mathbb{R}.

Related: Borel Sigma-Algebra, Lebesgue Integral

Key Takeaway

A probability space (Ω,F,P)(\Omega, \mathcal{F}, P) is a measure space with total mass one. The sigma-algebra F\mathcal{F} tells us which questions about the outcome ω\omega can be answered with a probability; a random variable is a measurable function that maps outcomes to numbers in a way compatible with F\mathcal{F}; and expectation is the Lebesgue integral with respect to PP.