Ferkans — Interactive Telecom Tutor

Why Statistics Is Central to Wireless Simulation

Every wireless system is inherently stochastic: noise is random, fading is random, user locations are random, and data bits are random. To simulate these systems faithfully, you need to draw samples from the correct distributions, verify your random number generators produce the right statistics, and generate correlated random vectors to model realistic channel conditions.

This section shows you how scipy.stats gives you a unified interface to over 100 distributions, how to fit distributions to data, and how to generate correlated Gaussian vectors via the Cholesky decomposition.

Definition:
Probability Density Function (PDF)

A continuous random variable $X$ has probability density function $f_X(x)$ satisfying:

$P(a \le X \le b) = \int_a^b f_X(x)\,dx, \quad f_X(x) \ge 0, \quad \int_{-\infty}^{\infty} f_X(x)\,dx = 1$

In scipy.stats, every distribution object provides pdf(x), cdf(x), rvs(size), fit(data), and ppf(q) (quantile function).

from scipy.stats import norm
X = norm(loc=0, scale=1)      # standard normal
X.pdf(0.0)                    # f(0) = 0.3989...
X.cdf(1.96)                   # P(X <= 1.96) = 0.975
samples = X.rvs(size=10000)   # draw 10000 samples

The loc and scale parameters shift and scale any distribution. For norm, loc is the mean and scale is the standard deviation.

Definition:
CDF and Quantile (Percent-Point) Function

The cumulative distribution function (CDF) is:

$F_X(x) = P(X \le x) = \int_{-\infty}^{x} f_X(t)\,dt$

Its inverse $F_X^{-1}(q)$ is the quantile function (ppf in SciPy): for $0 < q < 1$ , $F_X^{-1}(q)$ returns the value $x$ such that $P(X \le x) = q$ .

from scipy.stats import norm
norm.ppf(0.975)    # 1.96 — the 97.5th percentile
norm.isf(0.025)    # 1.96 — inverse survival function

The quantile function is essential for computing confidence intervals and critical values in hypothesis testing.

Definition:
Gaussian (Normal) Distribution

The Gaussian distribution $X \sim \mathcal{N}(\mu, \sigma^2)$ has PDF:

$f_X(x) = \frac{1}{\sqrt{2\pi\sigma^2}} \exp\!\left(-\frac{(x-\mu)^2}{2\sigma^2}\right)$

The $Q$ -function $Q(x) = 1 - \Phi(x)$ gives the tail probability and appears throughout BER analysis.

from scipy.stats import norm
from scipy.special import erfc
Q = lambda x: 0.5 * erfc(x / np.sqrt(2))
# Equivalent:
Q_scipy = lambda x: norm.sf(x)           # survival function

For BPSK over AWGN, the BER is exactly $Q\!\left(\sqrt{2E_b/N_0}\right)$ . This is the single most important formula in digital communications.

Definition:
Circularly Symmetric Complex Gaussian

A circularly symmetric complex Gaussian (CSCG) random variable $Z \sim \mathcal{CN}(0, \sigma^2)$ is defined as:

$Z = X + jY, \quad X, Y \sim \mathcal{N}(0, \sigma^2/2) \text{ independent}$

The envelope $|Z|$ follows a Rayleigh distribution and the power $|Z|^2$ follows an exponential distribution with mean $\sigma^2$ .

rng = np.random.default_rng(42)
sigma2 = 1.0
z = rng.standard_normal(10000) * np.sqrt(sigma2/2) \
  + 1j * rng.standard_normal(10000) * np.sqrt(sigma2/2)
# |z| is Rayleigh, |z|^2 is exponential

Definition:
Multivariate Gaussian and Covariance Matrix

A random vector $\mathbf{x} \sim \mathcal{N}(\boldsymbol{\mu}, \mathbf{R})$ has joint PDF:

$f(\mathbf{x}) = \frac{1}{(2\pi)^{n/2} |\mathbf{R}|^{1/2}} \exp\!\left(-\frac{1}{2}(\mathbf{x}-\boldsymbol{\mu})^T \mathbf{R}^{-1} (\mathbf{x}-\boldsymbol{\mu})\right)$

where $\mathbf{R} = E[(\mathbf{x}-\boldsymbol{\mu})(\mathbf{x}-\boldsymbol{\mu})^T]$ is the covariance matrix (positive semi-definite).

from scipy.stats import multivariate_normal
R = np.array([[1.0, 0.8], [0.8, 1.0]])
rv = multivariate_normal(mean=[0, 0], cov=R)
samples = rv.rvs(size=5000)

Definition:
Generating Correlated Gaussian Samples via Cholesky

To generate $\mathbf{x} \sim \mathcal{N}(\boldsymbol{\mu}, \mathbf{R})$ :

Compute the Cholesky factor: $\mathbf{R} = \mathbf{L}\mathbf{L}^T$
Draw i.i.d. standard normals: $\mathbf{z} \sim \mathcal{N}(\mathbf{0}, \mathbf{I})$
Set $\mathbf{x} = \boldsymbol{\mu} + \mathbf{L}\mathbf{z}$

Then $\mathrm{Cov}[\mathbf{x}] = \mathbf{L} \cdot \mathbf{I} \cdot \mathbf{L}^T = \mathbf{R}$ .

L = np.linalg.cholesky(R)    # lower triangular
z = rng.standard_normal((n, N_samples))
x = mu[:, None] + L @ z      # each column is a correlated sample

This is far more efficient than multivariate_normal.rvs() when you need millions of samples: compute $\mathbf{L}$ once ( $O(n^3)$ ), then each sample costs only $O(n^2)$ .

Theorem: Linear Transformation of Gaussian Vectors

If $\mathbf{x} \sim \mathcal{N}(\boldsymbol{\mu}, \mathbf{R})$ and $\mathbf{y} = \mathbf{A}\mathbf{x} + \mathbf{b}$ , then:

$\mathbf{y} \sim \mathcal{N}(\mathbf{A}\boldsymbol{\mu} + \mathbf{b},\; \mathbf{A}\mathbf{R}\mathbf{A}^T)$

In particular, if $\mathbf{R} = \mathbf{L}\mathbf{L}^T$ and $\mathbf{z} \sim \mathcal{N}(\mathbf{0}, \mathbf{I})$ , then $\mathbf{x} = \mathbf{L}\mathbf{z} \sim \mathcal{N}(\mathbf{0}, \mathbf{R})$ .

A linear transformation of a Gaussian is still Gaussian. The mean transforms linearly and the covariance picks up $\mathbf{A}$ on both sides (the "sandwich" formula).

Proof

Mean

$E[\mathbf{y}] = E[\mathbf{A}\mathbf{x} + \mathbf{b}] = \mathbf{A}E[\mathbf{x}] + \mathbf{b} = \mathbf{A}\boldsymbol{\mu} + \mathbf{b}$ .

Covariance

$\mathrm{Cov}[\mathbf{y}] = E[(\mathbf{y} - E[\mathbf{y}])(\mathbf{y} - E[\mathbf{y}])^T] = \mathbf{A}\,\mathrm{Cov}[\mathbf{x}]\,\mathbf{A}^T = \mathbf{A}\mathbf{R}\mathbf{A}^T$ .

Theorem: Maximum Likelihood Estimation for Gaussian Parameters

Given $N$ i.i.d. samples $x_1, \dots, x_N$ from $\mathcal{N}(\mu, \sigma^2)$ , the maximum likelihood estimators are:

$\hat{\mu}_{\mathrm{ML}} = \frac{1}{N}\sum_{i=1}^N x_i = \bar{x}, \qquad \hat{\sigma}^2_{\mathrm{ML}} = \frac{1}{N}\sum_{i=1}^N (x_i - \bar{x})^2$

The MLE for $\sigma^2$ is biased; the unbiased estimator divides by $N-1$ (Bessel's correction), which is what np.var(x, ddof=1) computes.

The log-likelihood is a concave quadratic in $\mu$ and $\log\sigma^2$ . Setting the gradient to zero gives the sample mean and sample variance.

Proof

Log-likelihood

$\ell(\mu, \sigma^2) = -\frac{N}{2}\ln(2\pi\sigma^2) - \frac{1}{2\sigma^2}\sum_{i=1}^N (x_i - \mu)^2$ .

Optimize over $\mu$

$\frac{\partial \ell}{\partial \mu} = \frac{1}{\sigma^2}\sum(x_i - \mu) = 0 \implies \hat{\mu} = \bar{x}$ .

Theorem: Goodness of Fit via KS Statistic

The Kolmogorov-Smirnov statistic between the empirical CDF $\hat{F}_N(x)$ and a reference CDF $F_0(x)$ is:

$D_N = \sup_x |\hat{F}_N(x) - F_0(x)|$

Under $H_0: X \sim F_0$ , $\sqrt{N}\,D_N \to K$ where $K$ has the Kolmogorov distribution. scipy.stats.kstest returns $(D_N, p)$ .

The KS statistic measures the worst-case vertical gap between the empirical and theoretical CDFs. A small $D_N$ (large $p$ -value) means the data is consistent with $F_0$ .

Example: Exploring Distributions with scipy.stats

Use scipy.stats to create Rayleigh, Ricean, and Nakagami distributions, compute their PDFs, and verify that rvs() samples match the theoretical PDF.

Solution

Create distribution objects

from scipy.stats import rayleigh, rice, nakagami
import numpy as np

sigma = 1.0
ray = rayleigh(scale=sigma)           # Rayleigh(sigma)
K_factor = 3.0
nu = np.sqrt(K_factor / (1 + K_factor))
ric = rice(nu / sigma, scale=sigma)   # Rice(K)
m = 2.0
nak = nakagami(m, scale=np.sqrt(2*m)) # Nakagami(m, Omega=2m)

Compare PDF to histogram

x = np.linspace(0, 5, 200)
samples = ray.rvs(size=100000)
# Histogram should match ray.pdf(x)
print(f"Mean:  theoretical={ray.mean():.4f}, sample={samples.mean():.4f}")
print(f"Var:   theoretical={ray.var():.4f},  sample={samples.var():.4f}")

Fit distribution to data

# Fit Rayleigh parameters from data
params = rayleigh.fit(samples)
print(f"Fitted loc={params[0]:.4f}, scale={params[1]:.4f}")

Example: Generating Correlated Fading Samples

Generate 10000 pairs of correlated Rayleigh fading coefficients with correlation $\rho = 0.8$ using the Cholesky method, and verify the sample correlation matches the target.

Solution

Build covariance matrix and factor

import numpy as np

rho = 0.8
R = np.array([[1.0, rho], [rho, 1.0]])
L = np.linalg.cholesky(R)

Generate correlated complex Gaussian

rng = np.random.default_rng(42)
N = 10000
z = (rng.standard_normal((2, N))
     + 1j * rng.standard_normal((2, N))) / np.sqrt(2)
h = L @ z   # correlated complex Gaussian

Verify

h_env = np.abs(h)   # Rayleigh envelopes
rho_hat = np.corrcoef(h_env[0], h_env[1])[0, 1]
print(f"Target correlation: {rho:.2f}")
print(f"Sample envelope correlation: {rho_hat:.4f}")
# Note: envelope correlation != complex correlation

Example: Fitting a Distribution to Channel Measurement Data

Given simulated channel power data, fit Rayleigh, Rice, and Nakagami distributions and determine which best fits the data using the KS test.

Solution

Generate synthetic data (Rice distributed)

from scipy.stats import rice, rayleigh, nakagami, kstest
import numpy as np

rng = np.random.default_rng(42)
K = 5.0   # Ricean K-factor
nu = np.sqrt(K / (1 + K))
data = rice.rvs(nu, size=5000, random_state=42)

Fit each distribution

ray_params = rayleigh.fit(data)
ric_params = rice.fit(data)
nak_params = nakagami.fit(data)

KS test for goodness of fit

for name, dist, params in [
    ("Rayleigh", rayleigh, ray_params),
    ("Rice", rice, ric_params),
    ("Nakagami", nakagami, nak_params),
]:
    D, p = kstest(data, dist.cdf, args=params)
    print(f"{name:10s}: D={D:.4f}, p={p:.4f}")
# Rice should have the highest p-value (best fit)

Distribution Explorer

Compare PDF, CDF, and histogram of scipy.stats distributions. Adjust distribution parameters and sample size to see how well the empirical distribution matches the theoretical one.

Parameters

Correlated Gaussian Sampling via Cholesky

Visualize how the Cholesky decomposition transforms i.i.d. Gaussian samples into correlated pairs. Adjust the correlation coefficient and observe the scatter plot and marginal histograms.

Parameters

Central Limit Theorem in Action

Watch the distribution of the sample mean converge to a Gaussian as the number of averaged samples increases. Start with any distribution (uniform, exponential, Rayleigh) and see the CLT at work.

Parameters

Wireless Distribution Family Tree — Relationships between common distributions in wireless communications. A complex Gaussian envelope yields Rayleigh; adding a LOS component gives Rice; Nakagami generalizes both.

Distributions and Sampling

python

Complete scipy.stats distribution exploration, fitting, and Cholesky sampling.

# Code from: ch09/python/distributions.py
# Load from backend supplements endpoint

Quick Check

To generate $\mathbf{x} \sim \mathcal{N}(\mathbf{0}, \mathbf{R})$ via Cholesky, you compute $\mathbf{R} = \mathbf{L}\mathbf{L}^T$ and then set $\mathbf{x} = \mathbf{L}\mathbf{z}$ where $\mathbf{z} \sim \mathcal{N}(\mathbf{0}, \mathbf{I})$ . What happens if $\mathbf{R}$ is not positive definite?

The samples will have the wrong mean

np.linalg.cholesky raises a LinAlgError

The samples will be uncorrelated

The resulting distribution will be Rayleigh instead of Gaussian

Correction:

np.linalg.cholesky raises a LinAlgError

Cholesky decomposition requires a positive-definite matrix. If R is only PSD (has zero eigenvalues) or indefinite, it will fail.

Common Mistake: Using Global Seeds Instead of RNG Objects

Mistake:

Using np.random.seed(42) and then np.random.randn(...) for reproducibility. This uses the legacy global state, which is not thread-safe and causes subtle bugs in parallel simulations.

Correction:

Use the new np.random.default_rng(42) and call methods on the returned Generator object. Each parallel worker should get its own RNG with a different seed (or use SeedSequence.spawn()).

Common Mistake: Complex Gaussian Variance Convention

Mistake:

Writing z = rng.standard_normal(N) + 1j * rng.standard_normal(N) to get $\mathcal{CN}(0, 1)$ . This produces $\mathrm{Var}[z] = 2$ , not $1$ .

Correction:

Divide by $\sqrt{2}$ : z = (rng.standard_normal(N) + 1j * rng.standard_normal(N)) / np.sqrt(2). Now $E[|z|^2] = 1$ as intended.

Why This Matters: From Distributions to Channel Models

The distributions in this section are not abstract mathematics — they are the building blocks of wireless channel models. A Rayleigh fading channel coefficient is literally $h \sim \mathcal{CN}(0, 1)$ ; a Ricean channel with $K$ -factor adds a deterministic LOS component. The Cholesky method for generating correlated samples is exactly how correlated MIMO channels are simulated in Section 9.4.

See full treatment in Generating Fading Channels

Historical Note: Gauss and the Normal Distribution

19th century

Carl Friedrich Gauss derived the normal distribution in 1809 while studying the distribution of astronomical observation errors. The characteristic bell curve bears his name in many languages ("Gauss-Verteilung" in German). The Central Limit Theorem, which explains why the Gaussian is so ubiquitous, was proven rigorously by Lyapunov in 1901.

Historical Note: Lord Rayleigh and the Distribution of Wave Amplitudes

19th century

John William Strutt, 3rd Baron Rayleigh, derived the Rayleigh distribution in 1880 while studying the intensity of sound waves resulting from many superimposed sources. The same distribution describes the envelope of a narrowband signal that is the sum of many scattered multipath components — the physical basis of Rayleigh fading in wireless communications.

Probability Density Function (PDF)

A function $f_X(x)$ such that $P(a \le X \le b) = \int_a^b f_X(x)\,dx$ .

Cumulative Distribution Function (CDF)

$F_X(x) = P(X \le x)$ , a non-decreasing function from 0 to 1.

Cholesky Decomposition

Factorization of a positive-definite matrix as $\mathbf{R} = \mathbf{L}\mathbf{L}^T$ where $\mathbf{L}$ is lower triangular. Used to generate correlated random samples.

Related: Covariance Matrix

Covariance Matrix

$\mathbf{R} = E[(\mathbf{x}-\boldsymbol{\mu})(\mathbf{x}-\boldsymbol{\mu})^T]$ , a positive semi-definite matrix encoding pairwise correlations.

Related: Cholesky Decomposition

Maximum Likelihood Estimation (MLE)

A method that finds parameter values maximizing the likelihood function $L(\theta) = \prod_i f(x_i; \theta)$ .

Q-function

The tail probability of the standard normal: $Q(x) = 1 - \Phi(x) = \frac{1}{2}\mathrm{erfc}(x/\sqrt{2})$ . Appears in BER expressions for many modulation schemes.

Distributions and Sampling

Why Statistics Is Central to Wireless Simulation

Definition: Probability Density Function (PDF)

Definition: CDF and Quantile (Percent-Point) Function

Definition: Gaussian (Normal) Distribution

Definition: Circularly Symmetric Complex Gaussian

Definition: Multivariate Gaussian and Covariance Matrix

Definition: Generating Correlated Gaussian Samples via Cholesky

Theorem: Linear Transformation of Gaussian Vectors

Mean

Covariance

Theorem: Maximum Likelihood Estimation for Gaussian Parameters

Log-likelihood

Optimize over $\mu$

Theorem: Goodness of Fit via KS Statistic

Example: Exploring Distributions with scipy.stats

Create distribution objects

Compare PDF to histogram

Fit distribution to data

Example: Generating Correlated Fading Samples

Build covariance matrix and factor

Generate correlated complex Gaussian

Verify

Example: Fitting a Distribution to Channel Measurement Data

Generate synthetic data (Rice distributed)

Fit each distribution

KS test for goodness of fit

Distribution Explorer

Parameters

Correlated Gaussian Sampling via Cholesky

Parameters

Central Limit Theorem in Action

Parameters

Wireless Distribution Family Tree

Distributions and Sampling

Quick Check

Common Mistake: Using Global Seeds Instead of RNG Objects

Common Mistake: Complex Gaussian Variance Convention

Why This Matters: From Distributions to Channel Models

Historical Note: Gauss and the Normal Distribution

Historical Note: Lord Rayleigh and the Distribution of Wave Amplitudes

Probability Density Function (PDF)

Cumulative Distribution Function (CDF)

Cholesky Decomposition

Covariance Matrix

Maximum Likelihood Estimation (MLE)

Q-function

Definition:
Probability Density Function (PDF)

Definition:
CDF and Quantile (Percent-Point) Function

Definition:
Gaussian (Normal) Distribution

Definition:
Circularly Symmetric Complex Gaussian

Definition:
Multivariate Gaussian and Covariance Matrix

Definition:
Generating Correlated Gaussian Samples via Cholesky