PDF, CDF, and the Relationship Between Them

From Discrete to Continuous

In the discrete setting, every outcome carries a positive probability mass. For a continuous random variable, the probability of any single point is zero β€” and yet the random variable still has a well-defined distribution. The resolution of this apparent paradox is the probability density function: a non-negative function whose integral over any interval gives the probability of falling in that interval. The CDF, which we already defined for discrete RVs, remains valid and becomes the bridge between the discrete and continuous worlds.

Definition:

Continuous Random Variable

A random variable XX is continuous if its CDF F(x)=P(X≀x)F(x) = \mathbb{P}(X \leq x) is a continuous function of xx. Equivalently, P(X=x)=0\mathbb{P}(X = x) = 0 for every x∈Rx \in \mathbb{R}.

Continuity of the CDF is a stronger condition than merely having P(X=x)=0\mathbb{P}(X = x) = 0 for all xx; it requires that the CDF has no jumps anywhere.

Definition:

Probability Density Function (PDF)

Let XX be a continuous random variable with CDF FF. The probability density function (PDF) of XX is the function f:Rβ†’[0,∞)f : \mathbb{R} \to [0, \infty) satisfying

F(x)=βˆ«βˆ’βˆžxf(u) duforΒ allΒ x∈R.F(x) = \int_{-\infty}^{x} f(u)\,du \quad \text{for all } x \in \mathbb{R}.

Equivalently, for any interval (a,b](a, b]:

P(a<X≀b)=∫abf(x) dx.\mathbb{P}(a < X \leq b) = \int_a^b f(x)\,dx.

The PDF is not a probability β€” it can exceed 1. The quantity f(x) dxf(x)\,dx represents the infinitesimal probability mass near xx: P(x<X≀x+dx)β‰ˆf(x) dx\mathbb{P}(x < X \leq x + dx) \approx f(x)\,dx.

Theorem: Properties of the PDF

Let ff be the PDF of a continuous RV XX. Then:

  1. f(x)β‰₯0f(x) \geq 0 for all x∈Rx \in \mathbb{R}.
  2. βˆ«βˆ’βˆžβˆžf(x) dx=1\int_{-\infty}^{\infty} f(x)\,dx = 1.
  3. For any Borel set AβŠ†RA \subseteq \mathbb{R}: P(X∈A)=∫Af(x) dx\mathbb{P}(X \in A) = \int_A f(x)\,dx.

Conversely, any non-negative function satisfying property 2 is a valid PDF.

Theorem: Fundamental Theorem: Differentiating the CDF

If FF is differentiable at xx, then

f(x)=ddxF(x)=Fβ€²(x).f(x) = \frac{d}{dx}F(x) = F'(x).

This is the continuous analog of the relation pX(xi)=F(xi)βˆ’F(xiβˆ’)p_X(x_i) = F(x_i) - F(x_i^-) for discrete RVs.

The CDF accumulates probability from βˆ’βˆž-\infty to xx. Its rate of accumulation at xx is exactly the density at xx.

Theorem: Properties of the CDF

The CDF F(x)=P(X≀x)F(x) = \mathbb{P}(X \leq x) of any random variable XX satisfies:

  1. lim⁑xβ†’βˆ’βˆžF(x)=0\lim_{x \to -\infty} F(x) = 0 and lim⁑xβ†’+∞F(x)=1\lim_{x \to +\infty} F(x) = 1.
  2. FF is non-decreasing: x≀yβ€…β€ŠβŸΉβ€…β€ŠF(x)≀F(y)x \leq y \implies F(x) \leq F(y).
  3. FF is right-continuous: lim⁑h↓0F(x+h)=F(x)\lim_{h \downarrow 0} F(x + h) = F(x).

Moreover, these three properties characterize CDFs: any function satisfying all three is the CDF of some random variable.

Example: Computing Probabilities from the CDF

Let XX have CDF

F(x)={0x<01βˆ’eβˆ’2xxβ‰₯0.F(x) = \begin{cases} 0 & x < 0 \\ 1 - e^{-2x} & x \geq 0. \end{cases}

Compute P(1<X≀3)\mathbb{P}(1 < X \leq 3) and P(X>2)\mathbb{P}(X > 2).

Example: Deriving the PDF from a Given CDF

A random variable XX has CDF

F(x)={0x<0x20≀x<11xβ‰₯1.F(x) = \begin{cases} 0 & x < 0 \\ x^2 & 0 \leq x < 1 \\ 1 & x \geq 1. \end{cases}

Find the PDF f(x)f(x) and compute P(1/4<X≀3/4)\mathbb{P}(1/4 < X \leq 3/4).

Example: The PDF Can Exceed 1

Let X∼Uniform[0,1/3]X \sim \text{Uniform}[0, 1/3]. Show that f(x)f(x) exceeds 1 for all xx in the support, and verify that XX is nonetheless a valid random variable.

Common Mistake: Confusing PDF Values with Probabilities

Mistake:

Interpreting f(x)=3f(x) = 3 as meaning "the probability of X=xX = x is 3."

Correction:

For a continuous RV, P(X=x)=0\mathbb{P}(X = x) = 0 for every xx. The PDF is a density: f(x) dxf(x)\,dx is the infinitesimal probability mass near xx. The density can be arbitrarily large; only its integral over any set must lie in [0,1][0, 1].

Quick Check

If F(x)=1βˆ’eβˆ’3xF(x) = 1 - e^{-3x} for xβ‰₯0x \geq 0 (and 00 for x<0x < 0), what is f(2)f(2)?

1βˆ’eβˆ’61 - e^{-6}

3eβˆ’63e^{-6}

eβˆ’6e^{-6}

6eβˆ’36e^{-3}

Historical Note: The CDF: From Laplace to Kolmogorov

1812–1933

The idea of describing a random variable through its cumulative distribution dates back to Laplace's work on the "generating function of errors" in the early 19th century. However, the modern axiomatic treatment β€” where the CDF is the primary object and the PDF is derived from it β€” was established by Kolmogorov in his 1933 Grundbegriffe der Wahrscheinlichkeitsrechnung. Kolmogorov's key insight was that the CDF, being defined through the probability measure alone, works for discrete, continuous, and mixed distributions alike, without any need to distinguish cases at the foundational level.

Probability Density Function (PDF)

A non-negative function ff such that P(X∈A)=∫Af(x) dx\mathbb{P}(X \in A) = \int_A f(x)\,dx for every Borel set AA. The PDF is the derivative of the CDF wherever it exists.

Related: Cumulative Distribution Function (CDF)

Cumulative Distribution Function (CDF)

The function F(x)=P(X≀x)F(x) = \mathbb{P}(X \leq x). It is non-decreasing, right-continuous, with lim⁑xβ†’βˆ’βˆžF(x)=0\lim_{x \to -\infty} F(x) = 0 and lim⁑xβ†’βˆžF(x)=1\lim_{x \to \infty} F(x) = 1.

Related: Probability Density Function (PDF)

Key Takeaway

The CDF is the universal descriptor of any random variable β€” discrete, continuous, or mixed. The PDF exists only for continuous RVs and is obtained by differentiating the CDF. For any interval, P(a<X≀b)=F(b)βˆ’F(a)=∫abf(x) dx\mathbb{P}(a < X \leq b) = F(b) - F(a) = \int_a^b f(x)\,dx.

PDF and CDF Explorer

Visualize the PDF and CDF of a Gaussian distribution. Adjust ΞΌ\mu and Οƒ\sigma and observe how the area under the PDF curve corresponds to the CDF value.

Parameters
0
1
1

Point at which to evaluate the CDF

Why This Matters: CDF and Outage Probability in Wireless

In wireless communications, the CDF of the received signal-to-noise ratio (SNR) Ξ³\gamma directly gives the outage probability: Pout(Ξ³0)=FΞ³(Ξ³0)=P(γ≀γ0)P_{\text{out}}(\gamma_0) = F_\gamma(\gamma_0) = \mathbb{P}(\gamma \leq \gamma_0). Designing a system to meet a target outage probability is equivalent to ensuring that the CDF of the SNR falls below the required threshold at the target rate. This is why the CDF β€” and the PDF from which it derives β€” is the fundamental tool in fading channel analysis.