Order Statistics

Why Order Statistics?

Given nn i.i.d. random variables, the order statistics are the sorted values. The minimum and maximum are the simplest β€” and the most useful. In wireless communications, the SNR after selection combining is the maximum of LL i.i.d. channel gains, and the outage capacity is governed by the minimum eigenvalue of a random matrix. Understanding the distribution of extremes is essential.

Definition:

Order Statistics

Let X1,X2,…,XnX_1, X_2, \ldots, X_n be i.i.d. random variables with common CDF FXF_{X} and PDF fXf_{X}. The order statistics are the sorted values

X(1)≀X(2)≀⋯≀X(n),X_{(1)} \le X_{(2)} \le \cdots \le X_{(n)},

where X(1)=min⁑(X1,…,Xn)X_{(1)} = \min(X_1, \ldots, X_n) and X(n)=max⁑(X1,…,Xn)X_{(n)} = \max(X_1, \ldots, X_n).

Theorem: CDF of the Minimum and Maximum

For nn i.i.d. RVs with common CDF FXF_{X}:

  1. Maximum: FX(n)(x)=[FX(x)]nF_{X_{(n)}}(x) = [F_{X}(x)]^n.

  2. Minimum: FX(1)(x)=1βˆ’[1βˆ’FX(x)]nF_{X_{(1)}}(x) = 1 - [1 - F_{X}(x)]^n.

Theorem: PDF of the kk-th Order Statistic

The PDF of the kk-th order statistic X(k)X_{(k)} from nn i.i.d. continuous RVs with CDF FXF_{X} and PDF fXf_{X} is

fX(k)(x)=n!(kβˆ’1)!(nβˆ’k)! [FX(x)]kβˆ’1 [1βˆ’FX(x)]nβˆ’k fX(x).f_{X_{(k)}}(x) = \frac{n!}{(k-1)!(n-k)!}\, [F_{X}(x)]^{k-1}\,[1 - F_{X}(x)]^{n-k}\,f_{X}(x).

Example: Maximum of nn Uniform Random Variables

Let X1,…,XnX_1, \ldots, X_n be i.i.d. Uniform[0,1]\text{Uniform}[0,1]. Find the CDF and PDF of M=max⁑(X1,…,Xn)M = \max(X_1, \ldots, X_n) and compute E[M]\mathbb{E}[M].

Order Statistics of i.i.d. Uniforms

Visualize the PDF of the kk-th order statistic from nn i.i.d. Uniform[0,1]\text{Uniform}[0,1] random variables. Adjust nn and kk to see how the distribution shifts.

Parameters
5
5

Why This Matters: Selection Combining and Order Statistics

In a diversity receiver with LL independent branches, selection combining chooses the branch with the highest SNR. If the branch SNRs are i.i.d. Exp(Ξ³Λ‰)\text{Exp}(\bar{\gamma}) (Rayleigh fading), then the post-combining SNR is Ξ³SC=X(L)\gamma_{\text{SC}} = X_{(L)}, the maximum order statistic. Its CDF is FΞ³SC(x)=(1βˆ’eβˆ’x/Ξ³Λ‰)LF_{\gamma_{\text{SC}}}(x) = (1 - e^{-x/\bar{\gamma}})^L. This directly gives the outage probability as a function of the number of diversity branches β€” a fundamental tradeoff in receiver design.

Common Mistake: Order Statistic Index Out of Range

Mistake:

Setting k>nk > n or k<1k < 1 in the kk-th order statistic formula.

Correction:

The order statistic X(k)X_{(k)} is only defined for 1≀k≀n1 \le k \le n. The interactive plot should clamp kk to this range. Attempting k>nk > n produces a meaningless result.

Quick Check

If X1,…,XnX_1, \ldots, X_n are i.i.d. Exp(Ξ»)\text{Exp}(\lambda), what is the distribution of X(1)=min⁑(X1,…,Xn)X_{(1)} = \min(X_1, \ldots, X_n)?

Exp(nΞ»)\text{Exp}(n\lambda)

Exp(Ξ»/n)\text{Exp}(\lambda/n)

Gamma(n,Ξ»)\text{Gamma}(n, \lambda)

Exp(Ξ»)\text{Exp}(\lambda)

Key Takeaway

The CDF of the maximum of nn i.i.d. RVs is [FX(x)]n[F_{X}(x)]^n; for the minimum it is 1βˆ’[1βˆ’FX(x)]n1 - [1 - F_{X}(x)]^n. The minimum of nn i.i.d. exponentials with rate Ξ»\lambda is again exponential with rate nΞ»n\lambda β€” a result used constantly in diversity analysis and reliability engineering.