Sigma-Algebras and Why They Matter

Why Can't We Just Assign Probabilities to All Subsets?

When Ξ©\Omega is finite or countably infinite, we can and do assign probabilities to every subset. The power set 2Ξ©2^\Omega works fine.

When Ξ©\Omega is uncountable β€” say Ξ©=[0,1]\Omega = [0,1] β€” a disturbing fact emerges: it is mathematically impossible to define a function P:2[0,1]β†’[0,1]\mathbb{P}: 2^{[0,1]} \to [0,1] that satisfies even the most basic requirements (translation invariance and countable additivity) simultaneously. The obstruction was made explicit by Giuseppe Vitali in 1905, who constructed a subset of [0,1][0,1] that cannot be assigned a consistent length (the Vitali set β€” see the historical note below).

The resolution is not to assign probabilities to all subsets, but to restrict attention to a carefully chosen collection of subsets β€” the sigma-algebra β€” that is large enough to contain every event we will ever want to discuss, but small enough to exclude the pathological sets where probability breaks down.

For the practicing engineer, this matters the moment you work with continuous-valued random variables. Every CDF, every PDF, every integral of the form ∫Af(x) dx\int_A f(x)\,dx implicitly uses the Borel sigma-algebra on R\mathbb{R}. Understanding sigma-algebras now will prevent confusion later when continuous random variables and stochastic processes enter the picture.

,

Definition:

Sigma-Algebra

A collection F\mathcal{F} of subsets of Ξ©\Omega is called a sigma-algebra (or Οƒ\sigma-field) if:

  1. Contains the empty set: βˆ…βˆˆF\emptyset \in \mathcal{F}.
  2. Closed under complementation: If A∈FA \in \mathcal{F}, then Ac∈FA^c \in \mathcal{F}.
  3. Closed under countable unions: If A1,A2,β€¦βˆˆFA_1, A_2, \ldots \in \mathcal{F}, then ⋃i=1∞Ai∈F\bigcup_{i=1}^{\infty} A_i \in \mathcal{F}.

The elements of F\mathcal{F} are called measurable sets or events.

From the three axioms, we immediately derive:

  • Ξ©=βˆ…c∈F\Omega = \emptyset^c \in \mathcal{F}.
  • Closed under countable intersections (by De Morgan: β‹‚Ai=(⋃Aic)c\bigcap A_i = (\bigcup A_i^c)^c).
  • Closed under set difference: Aβˆ–B=A∩BcA \setminus B = A \cap B^c.

So a sigma-algebra is closed under all the operations we ever apply to events.

,

Example: Four Important Sigma-Algebras

For Ξ©={1,2,3,4,5,6}\Omega = \{1,2,3,4,5,6\} (a single die roll) and for Ξ©=[0,1]\Omega = [0,1], identify the following sigma-algebras: (a) The trivial sigma-algebra. (b) The sigma-algebra generated by the event A={1,2,3}A = \{1,2,3\}. (c) The power set of {1,2,3,4,5,6}\{1,2,3,4,5,6\}. (d) The Borel sigma-algebra on [0,1][0,1].

Historical Note: Vitali Sets and the Need for Sigma-Algebras

1905

In 1905, the Italian mathematician Giuseppe Vitali constructed the first example of a non-measurable set β€” a subset of [0,1][0,1] that cannot be consistently assigned a length (measure). The construction uses the Axiom of Choice to select one representative from each equivalence class of real numbers under the relation x∼yβ€…β€ŠβŸΊβ€…β€Šxβˆ’y∈Qx \sim y \iff x - y \in \mathbb{Q}. The resulting set VV cannot be assigned a value ΞΌ(V)∈[0,∞]\mu(V) \in [0,\infty] that is consistent with the requirement that the measure of [0,1][0,1] is 1 and that measure is translation invariant.

Vitali's example was a shock to early 20th-century mathematics: it showed that "set" and "measurable set" are not the same concept. The sigma-algebra formalizes exactly this distinction. For engineers, the practical import is that probability statements about continuous random variables should always be framed in terms of Borel sets (intervals and their countable combinations) β€” and one never needs to worry about Vitali sets in any physical application.

Definition:

Borel Sigma-Algebra

The Borel sigma-algebra on R\mathbb{R}, denoted B(R)\mathcal{B}(\mathbb{R}), is the smallest sigma-algebra containing all open intervals (a,b)(a, b) for a<b∈Ra < b \in \mathbb{R}.

Equivalently, B(R)\mathcal{B}(\mathbb{R}) is generated by any of the following:

  • All open sets in R\mathbb{R}.
  • All half-open intervals (βˆ’βˆž,x](-\infty, x] for x∈Rx \in \mathbb{R}.
  • All closed sets in R\mathbb{R}.

The restriction to [0,1][0,1] gives B([0,1])\mathcal{B}([0,1]); the generalization to Rd\mathbb{R}^d gives B(Rd)\mathcal{B}(\mathbb{R}^d), generated by open rectangles (hyper-rectangles in dd dimensions).

The equivalence of these three generating families is a non-trivial theorem (proved in any measure theory course). For our purposes, the take-away is that the Borel sigma-algebra on R\mathbb{R} contains every set we will ever encounter in practice: intervals of every type, level sets of continuous functions, countable sets, and so on.

,

Key Takeaway

For finite and countably infinite Ξ©\Omega, use the power set. For uncountable Ξ©\Omega (the continuous case), use the Borel sigma-algebra. In both cases, the sigma-algebra specifies which events can be assigned a probability. From Chapter 5 onward, when random variables map outcomes into R\mathbb{R}, the Borel sigma-algebra is always in the background.

Sigma-Algebras for Different Sample Spaces

Sample Space Ξ©\OmegaNatural F\mathcal{F}Size / StructureExample
Finite: {Ο‰1,…,Ο‰N}\{\omega_1,\ldots,\omega_N\}Power set 2Ξ©2^\Omega2N2^N elementsCoin flip, die roll, BPSK symbol
Countably infinite: N\mathbb{N}Power set 2N2^{\mathbb{N}}Uncountably many setsNumber of retransmissions
Uncountable: R\mathbb{R}Borel B(R)\mathcal{B}(\mathbb{R})Uncountably many; excludes Vitali setsReceived signal amplitude
Uncountable: Rn\mathbb{R}^nBorel B(Rn)\mathcal{B}(\mathbb{R}^n)Generated by open rectanglesMIMO received vector

Sigma-Algebra

A collection F\mathcal{F} of subsets of Ξ©\Omega that contains βˆ…\emptyset and is closed under complementation and countable unions. Specifies which events can be assigned a probability.

Related: Event, Borel Sigma-Algebra, Probability Space

Borel Sigma-Algebra

The smallest sigma-algebra on R\mathbb{R} containing all open intervals. Standard choice for continuous probability spaces.

Related: Sigma-Algebra

Common Mistake: Not Every Subset Is an Event

Mistake:

When Ξ©\Omega is uncountable, it is tempting to assume that every subset can be assigned a probability. After all, what does it even mean for P(A)\mathbb{P}(A) to be undefined for some set AA?

Correction:

In the continuous setting, probability is defined only for measurable sets β€” elements of the sigma-algebra F\mathcal{F}. Non-measurable sets like the Vitali set genuinely exist (under the Axiom of Choice), but they never arise from any physical measurement or engineering calculation. The practical rule: if you specify an event using a finite combination of inequalities involving a continuous random variable ({X≀3}\{X \leq 3\}, {X2+Y2<1}\{X^2 + Y^2 < 1\}, etc.), it is automatically a Borel set and is measurable.