Functions of Random Variables and Moment-Generating Functions
Why Transformations of Random Variables Matter
In telecommunications we rarely observe a random variable "as is." If represents Gaussian noise at the receiver, the signal power is , the envelope is , and the log-amplitude is . Each of these is a function of , and its distribution is generally different from that of itself.
Consider the complex baseband received signal in a narrowband channel with no line-of-sight component: where are independent (by the central limit theorem applied to many scattered paths). The envelope turns out to follow a Rayleigh distribution; the instantaneous power is exponential; and if a deterministic line-of-sight component is present the envelope becomes Ricean. The sum of squared independent standard normals yields the chi-squared distribution , which appears in hypothesis testing and in the analysis of diversity combiners.
All of these distributions are derived from the Gaussian via elementary transformations. This section develops the general machinery for finding the distribution of given the distribution of , introduces moment-generating and characteristic functions as bookkeeping devices for moments, and catalogues the fading distributions that arise in wireless channels.
Theorem: CDF Method for Functions of a Random Variable
Let be a continuous random variable with known CDF and PDF , and let where is a measurable function. Then the CDF of is and, wherever is differentiable,
The idea is conceptually the simplest possible: translate the event into an equivalent event involving , compute its probability using , and differentiate. The difficulty lies entirely in characterizing the set , which depends on the shape of .
Write and identify the region in -space.
For monotone increasing , the region is .
Proof for strictly increasing $g$
Assume is strictly increasing and differentiable, so that exists and is also strictly increasing. Then and therefore Differentiating with respect to via the chain rule: Since is strictly increasing, , so the absolute value is superfluous here. For strictly decreasing the inequality flips, introducing a minus sign that the absolute value absorbs.
Theorem: PDF of a Monotone Transformation
Let be a continuous random variable with PDF , and let where is strictly monotone (either increasing or decreasing) and differentiable on the support of . Then for in the range of , and otherwise.
The PDF is a density: it measures probability per unit length. When stretches or compresses the -axis, the density must be rescaled by the reciprocal of the local stretching factor . The absolute value ensures the density remains nonnegative regardless of whether is increasing or decreasing.
Apply the CDF method from Theorem thm-cdf-method.
Handle the increasing and decreasing cases separately, then unify with the absolute value.
Case 1: $g$ strictly increasing
From the CDF method, . Differentiating: Since is increasing, is also increasing, so and .
Case 2: $g$ strictly decreasing
When is strictly decreasing, is also strictly decreasing, so . Thus Differentiating: Since is decreasing, , so .
Unified formula
Combining both cases: This completes the proof.
Example: Affine Transformation of a Gaussian
Let and with . Find the distribution of .
Step 1: Identify the transformation
The function is strictly monotone (increasing if , decreasing if ). Its inverse is , and , so .
Step 2: Apply the monotone-transform formula
$
Step 3: Identify the result
This is the PDF of . Therefore The Gaussian family is closed under affine transformations: shifting changes the mean, scaling changes both mean and variance (the variance scales by , not ).
Definition: Chi-Squared Distribution
Chi-Squared Distribution
Let be independent standard normal random variables, . The random variable has the chi-squared distribution with degrees of freedom, written . Its PDF is where is the gamma function.
Key moments:
- .
- .
The chi-squared distribution is a special case of the gamma distribution: .
In detection theory, the test statistic for an energy detector is a sum of squared samples. Under the null hypothesis (noise only), each sample is Gaussian, so the test statistic is distributed. The number of degrees of freedom equals the time-bandwidth product, connecting the chi-squared distribution directly to spectrum sensing in cognitive radio.
Definition: Rayleigh Distribution
Rayleigh Distribution
Let be independent random variables with . The envelope of the complex signal is The random variable has the Rayleigh distribution with parameter , whose PDF is
Key moments:
- .
- .
- .
The Rayleigh distribution models the envelope of the received signal in a non-line-of-sight (NLOS) multipath fading channel. By the central limit theorem, the superposition of many independent scattered paths produces in-phase and quadrature components that are approximately Gaussian with zero mean and equal variance, making the envelope Rayleigh.
The instantaneous power is exponentially distributed with mean : this is the basis of the exponential fading model used throughout wireless link budget analysis.
Definition: Ricean Distribution
Ricean Distribution
Let and be independent, where is the amplitude of a deterministic line-of-sight (LOS) component and is its phase. The envelope follows the Ricean (or Rician) distribution with PDF where is the modified Bessel function of the first kind of order zero.
The Ricean -factor is defined as the ratio of LOS power to scattered (diffuse) power.
Special cases:
- (): the Ricean reduces to the Rayleigh.
- : the channel becomes deterministic (AWGN).
The Ricean distribution models fading in channels with a dominant LOS path plus weaker scattered components β common in satellite links, millimeter-wave urban microcells, and short-range indoor channels. The -factor is often quoted in dB; typical values range from (moderate LOS) to (strong LOS).
Definition: Nakagami- Distribution
Nakagami- Distribution
The Nakagami- distribution has PDF where is the mean power and is the fading severity parameter (or shape parameter), defined by
Key moments:
- .
- .
Relation to gamma distribution: If , then the power .
Special cases:
- : Nakagami- reduces to Rayleigh (with ).
- : one-sided Gaussian.
- : deterministic (no fading).
- The Nakagami- can approximate the Ricean for appropriate .
The Nakagami- distribution was proposed by Minoru Nakagami in 1960 as a flexible model that fits empirical fading data better than Rayleigh or Ricean in many scenarios. Its main advantage is analytic tractability: the gamma-distributed power leads to closed-form expressions for error rates and outage probabilities with maximal-ratio combining (MRC).
Definition: Log-Normal Distribution
Log-Normal Distribution
A random variable is log-normally distributed if , i.e., where . The PDF is
Key moments:
- .
- .
In wireless, the parameter is often expressed in dB: . Typical values of for outdoor shadowing range from 4 to 12 dB.
The log-normal distribution models large-scale shadow fading (also called log-normal shadowing). As a signal propagates through the environment, it encounters many obstructions, each imposing a multiplicative attenuation. In dB, these attenuations become additive, and by the CLT the total path loss (in dB) is approximately Gaussian β hence the received power (in linear scale) is log-normal. This should be contrasted with small-scale fading (Rayleigh/Ricean), which operates on a much shorter spatial scale.
Definition: Moment-Generating Function (MGF)
Moment-Generating Function (MGF)
The moment-generating function of a random variable is defined for all (or ) for which the expectation exists.
Key properties:
- Moment extraction: The -th moment of is obtained by differentiating times at :
- Uniqueness: If exists in an open interval containing , it uniquely determines the distribution of . That is, for all in some open interval implies .
- Normalization: .
The MGF is a powerful tool because it converts sums of independent random variables into products (see the next theorem). However, the MGF may not exist for heavy-tailed distributions (e.g., Cauchy). The characteristic function , defined below, always exists and serves as a universal replacement.
Theorem: MGF of a Sum of Independent Random Variables
Let and be independent random variables whose MGFs and both exist in a neighborhood of . Then has MGF More generally, if are mutually independent,
Sums of independent random variables become products in the MGF domain. This is the probabilistic analog of the convolution theorem: the PDF of a sum is the convolution of the individual PDFs, and the MGF (a two-sided Laplace transform of the PDF) converts convolution to multiplication.
Write .
Use independence to factor the expectation.
Proof
n\blacksquare$
Definition: Characteristic Function
Characteristic Function
The characteristic function (CF) of a random variable is where .
Key properties:
- Always exists: Since , the integral converges absolutely for every and every distribution.
- Relation to MGF: whenever the MGF exists.
- Moment extraction: when the -th moment exists.
- Uniqueness: The CF uniquely determines the distribution (inversion theorem).
- Fourier transform of the PDF: , the Fourier transform of the density. Hence .
- Independence and sums: If and are independent, then .
The characteristic function is the Fourier transform of the PDF and always exists, unlike the MGF. In modern probability theory, the CF is the standard tool for proving limit theorems such as the central limit theorem (the CF of the standardized sum converges pointwise to , the CF of a standard normal).
Example: MGF of the Gaussian and Sum of Independent Gaussians
(a) Derive the MGF of .
(b) Use the result to prove that the sum of independent Gaussian random variables is itself Gaussian.
Part (a): Derive $M_X(s)$
$
Part (b): Sum of independent Gaussians
Let for , mutually independent. Define . By the product rule for MGFs of independent sums: This is the MGF of . By uniqueness of the MGF, The Gaussian family is therefore closed under addition of independents: the sum of independent Gaussians is Gaussian, with the means and variances adding.
Fading Distributions in Wireless Channels
| Property | Rayleigh | Ricean | Nakagami- | Log-Normal |
|---|---|---|---|---|
| Parameters | , (or -factor) | , | , | |
| Physical model | Many scattered paths, no LOS | LOS + scattered paths | General fading severity | Multiplicative obstructions (shadowing) |
| Mean power | ||||
| Power distribution | Exponential | Non-central (2 dof) | Gamma | Log-normal |
| When it applies | Dense urban NLOS, rich scattering | Suburban, satellite, mmWave with LOS | Flexible fit; indoor, land-mobile | Shadowing on dB-scale (large-scale fading) |
| Relation to Gaussian | Envelope of zero-mean complex Gaussian | Envelope of nonzero-mean complex Gaussian | Via gamma; approximates Ricean | (exponentiated Gaussian) |
| Closed-form BER? | Yes (simple) | Series involving | Yes (via gamma integral) | No (numerical / approximation) |
| Key reference | Rayleigh (1880) | Rice (1948) | Nakagami (1960) | Gudmundson (1991) |
From Gaussian to Rayleigh to Ricean: Fading Distribution Genesis
Why This Matters: Why Rayleigh Fading Arises in Non-LOS Channels
Consider a narrowband wireless channel where the transmitted signal reaches the receiver via independent scattered paths, with no dominant line-of-sight component. The received complex baseband signal is where and are the amplitude and phase of the -th path. If is large and the paths are independent with no single dominant term, then by the central limit theorem the real and imaginary parts of are each approximately Gaussian: and they are approximately independent (the phases being uniformly distributed ensures zero cross-correlation, and for Gaussian random variables, uncorrelated implies independent).
The envelope therefore follows the Rayleigh distribution. The instantaneous received power is exponentially distributed with mean , and in dB the power fluctuations are approximately dB around the local mean.
This model is remarkably accurate in dense urban environments (e.g., the classic Jakes model) and forms the baseline against which all wireless system designs are tested.
Historical Note: Lord Rayleigh and the Random Superposition of Waves
In 1880, John William Strutt, 3rd Baron Rayleigh, published "On the resultant of a large number of vibrations of the same pitch and of arbitrary phase" (Philosophical Magazine, vol. 10). He showed that when many sinusoidal waves of equal amplitude but random phases are superimposed, the amplitude of the resultant follows what we now call the Rayleigh distribution, and its intensity (squared amplitude) is exponentially distributed.
Rayleigh's original interest was in acoustics and optics β the scattering of sound and light. Nearly a century later, his result found its most influential application in wireless communications, where it describes the envelope fluctuations caused by multipath propagation. The fact that a model from 19th-century wave physics applies directly to 21st-century 5G channels illustrates the remarkable universality of the Gaussian mechanism: whenever many independent contributions add up, the CLT produces Gaussianity, and the envelope becomes Rayleigh.
Quick Check
Let and . Using the CDF method, what distribution does follow?
Exponential with rate
(chi-squared with 1 degree of freedom)
Rayleigh
By definition, is the distribution of the square of a single standard normal. Alternatively, by the CDF method: for , and differentiating yields , which is the PDF.
Quick Check
In a wireless channel, the Ricean -factor is measured to be . Which fading distribution does the channel follow?
Log-normal
Nakagami- with
Rayleigh
AWGN (no fading)
When , the LOS power , so there is no line-of-sight component. The Ricean distribution with reduces exactly to the Rayleigh distribution. This is the canonical non-LOS fading model.
Quick Check
Let . What is ?
The MGF of a Gaussian is . Setting gives (normalization). The first derivative at gives , and the second derivative gives .
Chi-Squared Distribution
The distribution of the sum of squares of independent standard normal random variables: . A special case of the gamma distribution with shape and scale . Mean , variance .
Rayleigh Distribution
The distribution of the envelope when are independent. PDF: for . Models small-scale fading in non-line-of-sight wireless channels.
Related: Rayleigh Distribution, Why Rayleigh Fading Arises in Non-LOS Channels, Lord Rayleigh and the Random Superposition of Waves
Ricean -Factor
The ratio of line-of-sight (LOS) power to diffuse (scattered) power in a Ricean fading channel. corresponds to Rayleigh fading (no LOS); corresponds to a purely deterministic channel. Typically expressed in dB.
Related: Ricean Distribution, Rayleigh Distribution
Nakagami- Distribution
A two-parameter distribution for fading envelopes, parameterized by the shape (fading severity) and mean power . Reduces to Rayleigh when . The power , yielding closed-form error-rate expressions.
Related: Nakagami- Distribution, Rayleigh Distribution
Moment-Generating Function (MGF)
For a random variable , the MGF is . It encodes all moments via , uniquely determines the distribution (when it exists in a neighborhood of ), and converts sums of independents to products: .
Related: Moment-Generating Function (MGF), MGF of a Sum of Independent Random Variables, Characteristic Function
Characteristic Function
The characteristic function of is , the Fourier transform of the PDF. Unlike the MGF, it always exists for every distribution. It uniquely determines the distribution and shares the product property for sums of independents: .
Related: Characteristic Function, Moment-Generating Function (MGF)
Common Mistake: Rayleigh Distribution vs. Rayleigh Quotient
Mistake:
"The Rayleigh distribution and the Rayleigh quotient are related because they are both named after Lord Rayleigh."
Correction:
Despite sharing a name, these are entirely different concepts:
-
The Rayleigh distribution is a probability distribution for the envelope of complex Gaussian noise: . It arises from Lord Rayleigh's work on random superposition of waves (1880).
-
The Rayleigh quotient is the ratio for a Hermitian matrix . It arises in eigenvalue problems and is used in beamforming optimization and principal component analysis.
Confusing the two β for instance, citing a "Rayleigh fading channel" when one means a "Rayleigh quotient optimization" β can lead to serious miscommunication in technical writing.
Key Takeaway
Every major fading distribution in wireless communications is derived from the Gaussian distribution via simple transformations:
- Rayleigh: envelope of a zero-mean complex Gaussian (, both components i.i.d. ).
- Ricean: envelope of a nonzero-mean complex Gaussian (add a deterministic LOS phasor).
- Chi-squared: sum of squared Gaussians ().
- Nakagami-: generalization whose power is gamma-distributed; encompasses Rayleigh () and approximates Ricean.
- Log-normal: exponentiated Gaussian (); models large-scale shadowing.
The CDF method and the monotone-transform formula are the two workhorses that connect these distributions back to the Gaussian. The moment-generating function and characteristic function then provide algebraic shortcuts β especially the product rule for sums of independents, which lets us prove closure properties (e.g., sum of Gaussians is Gaussian) without computing convolution integrals.
Understanding these derivations β rather than memorizing the PDFs β gives the engineer a unified mental model: Gaussian in, transformation out, fading distribution obtained.