Composite Hypotheses and the GLRT

When the Signal Is Only Partially Known

In practice the received signal is rarely known exactly. A radar return has an unknown amplitude set by the target's radar cross-section; a carrier arrives with an unknown phase; a pulse is delayed by an unknown propagation time. We must detect whether something is there even though what it looks like lies in a parametric family. This is the domain of composite hypothesis testing, for which the generalised likelihood ratio test (GLRT) is the workhorse.

Definition:

Composite Hypothesis Testing

Let {p(y;θ)}θΘ\{p(\mathbf{y};\boldsymbol{\theta})\}_{\boldsymbol{\theta}\in\Theta} be a family of densities indexed by a parameter vector θ\boldsymbol{\theta}. A composite hypothesis testing problem is a pair

H0:θΘ0,H1:θΘ1,\mathcal{H}_0: \boldsymbol{\theta} \in \Theta_0, \qquad \mathcal{H}_1: \boldsymbol{\theta} \in \Theta_1,

where Θ0,Θ1\Theta_0, \Theta_1 are disjoint subsets of Θ\Theta. When at least one Θj\Theta_j contains more than a single point, the hypothesis is called composite; otherwise simple.

The LRT of Chapter 1 is not directly applicable: there is no single density against which to form a ratio. We need a principled way to collapse the parameter uncertainty.

Definition:

Generalised Likelihood Ratio Test (GLRT)

The GLRT replaces the unknown parameters under each hypothesis with their maximum-likelihood estimates and forms the ratio

L~(y)  =  maxθΘ1p(y;θ)maxθΘ0p(y;θ).\widetilde{L}(\mathbf{y}) \;=\; \frac{\max_{\boldsymbol{\theta}\in\Theta_1} p(\mathbf{y};\boldsymbol{\theta})}{\max_{\boldsymbol{\theta}\in\Theta_0} p(\mathbf{y};\boldsymbol{\theta})}.

The GLRT decides H1\mathcal{H}_1 if L~(y)>η\widetilde{L}(\mathbf{y}) > \eta for a threshold η\eta chosen to achieve a prescribed false-alarm rate.

Unlike the Neyman--Pearson LRT, the GLRT is not guaranteed to be uniformly most powerful. It is, however, (i) easily derived whenever the MLE is tractable, (ii) asymptotically optimal under mild regularity, and (iii) invariant under reparametrisation --- which is why it dominates engineering practice.

Definition:

Non-Coherent Detection

Detection of a signal whose carrier phase is unknown and treated as a nuisance parameter is called non-coherent detection. Formally, let s(t;ϕ)=a(t)cos(2πfct+ϕ)s(t; \phi) = a(t) \cos(2\pi f_c t + \phi) with ϕ[0,2π)\phi \in [0, 2\pi) unknown uniform. The GLRT over ϕ\phi yields the envelope detector.

Theorem: GLRT for Unknown-Amplitude Signal in AWGN

Consider the composite hypothesis

H0:y=w,H1:y=As+w,AR{0}\mathcal{H}_0: \mathbf{y} = \mathbf{w}, \qquad \mathcal{H}_1: \mathbf{y} = A\,\mathbf{s} + \mathbf{w}, \quad A \in \mathbb{R}\setminus\{0\}

with wN(0,σ2I)\mathbf{w}\sim\mathcal{N}(\mathbf{0},\sigma^2\mathbf{I}) and known unit-energy template s\mathbf{s}. The GLRT statistic is

~(y)  =  (sTy)22σ2,\widetilde{\ell}(\mathbf{y}) \;=\; \frac{(\mathbf{s}^{\mathsf{T}}\mathbf{y})^2}{2\sigma^2},

and the test decides H1\mathcal{H}_1 iff sTy>γ|\mathbf{s}^{\mathsf{T}}\mathbf{y}| > \gamma. Under H0\mathcal{H}_0, 2~(y)χ122\widetilde{\ell}(\mathbf{y}) \sim \chi_1^2; under H1\mathcal{H}_1 it is non-central χ12\chi_1^2 with non-centrality A2/σ2A^2/\sigma^2.

Without knowing the sign of AA, the detector cannot meaningfully compare the correlator to a single threshold; it compares its magnitude, yielding a two-sided test.

Theorem: GLRT for Unknown-Phase Signal Yields the Envelope Detector

Consider a signal with two known real quadrature components sI,sQRn\mathbf{s}_I, \mathbf{s}_Q \in \mathbb{R}^n of equal norm sI=sQ=Es/2\|\mathbf{s}_I\| = \|\mathbf{s}_Q\| = \sqrt{E_s/2} and sIsQ\mathbf{s}_I \perp \mathbf{s}_Q, parameterised by an unknown phase ϕ[0,2π)\phi \in [0, 2\pi):

H1:y  =  cosϕsI+sinϕsQ+w.\mathcal{H}_1: \mathbf{y} \;=\; \cos\phi\,\mathbf{s}_I + \sin\phi\,\mathbf{s}_Q + \mathbf{w}.

The GLRT statistic is the envelope

R(y)  =  (sITy)2+(sQTy)2.R(\mathbf{y}) \;=\; \sqrt{(\mathbf{s}_I^{\mathsf{T}}\mathbf{y})^2 + (\mathbf{s}_Q^{\mathsf{T}}\mathbf{y})^2}.

Without phase information, the detector cannot distinguish II from QQ branches. The magnitude I2+Q2\sqrt{I^2 + Q^2} is phase-invariant.

Example: Cost of Unknown Amplitude

At PF=102P_F = 10^{-2} and Es/N0=5E_s/N_0 = 5 dB, compare the detection probability of (i) the clairvoyant LRT that knows A=1A = 1 exactly and (ii) the GLRT that allows AA to take any sign.

Example: Matched-Subspace GLRT

Suppose H1:y=Sα+w\mathcal{H}_1: \mathbf{y} = \mathbf{S}\boldsymbol{\alpha} + \mathbf{w} where SRn×k\mathbf{S}\in\mathbb{R}^{n\times k} has known orthonormal columns and αRk\boldsymbol{\alpha}\in\mathbb{R}^k is unknown. Derive the GLRT.

GLRT vs. LRT for Unknown Amplitude

Detection probability of the clairvoyant LRT, a mismatched LRT that assumes the wrong amplitude A0=1A_0 = 1, and the two-sided GLRT, as Es/N0E_s/N_0 varies. Vary β\beta to see the LRT's vulnerability to amplitude mismatch --- something the GLRT avoids by construction.

Parameters
0.01
1
32

Common Mistake: GLRT Is Not Always UMP

Mistake:

"The GLRT is derived from the LRT, so it must be uniformly most powerful for every composite hypothesis."

Correction:

The GLRT is a heuristic. For a two-sided Gaussian mean test (A0A \lessgtr 0), no UMP test even exists, so no procedure can dominate the GLRT uniformly --- but one can sometimes design locally most powerful tests that outperform the GLRT at small signal levels. Still, the GLRT is the engineering default because its derivation is mechanical and its asymptotic efficiency is well understood.

Quick Check

The envelope detector is the GLRT for which problem?

Known signal in AWGN

Signal with unknown carrier phase

Signal with unknown delay only

Signal with unknown amplitude and sign

Quick Check

Under regularity conditions, as nn\to\infty the GLRT statistic 2logL~(y)2\log\widetilde{L}(\mathbf{y}) under H0\mathcal{H}_0 converges in distribution to:

A Gaussian N(0,1)\mathcal{N}(0,1).

A chi-squared χk2\chi_k^2 with k=dimΘ1dimΘ0k = \dim\Theta_1 - \dim\Theta_0.

A non-central chi-squared.

The exponential distribution.

Operational Interpretation

Every practical detector that claims to "adapt" to an unknown nuisance parameter --- adaptive matched filter, CFAR, GPS acquisition with Doppler search, cell-search in LTE --- is a disguised GLRT. The max-over-parameters step is typically implemented as a bank of parallel correlators followed by a maximum operator, or (in the continuous case) a peak-finder on the correlator output. Seen through this lens the detector's structural complexity is proportional to the dimension and fineness of the nuisance-parameter search grid.