Sufficient Statistics for Detection
Why sufficient statistics matter
We have now seen the matched filter three times --- as LRT, as SNR-maximising linear filter, as continuous-time projection. The point is that every one of these derivations collapsed the full observation (or ) into a single scalar statistic . That compression is not accidental: is a sufficient statistic for the detection problem. Once we have it, the raw observation carries no additional information about which hypothesis is true. This section formalises sufficiency, states the Fisher--Neyman factorisation theorem, and uses it to explain why signal-space receivers for digital modulation reduce a waveform to its finite-dimensional projection.
Definition: Sufficient Statistic
Sufficient Statistic
Let be an observation with density depending on a parameter (here indexes the hypothesis). A statistic is sufficient for if the conditional distribution of given does not depend on :
Theorem: Fisher--Neyman Factorisation
A statistic is sufficient for if and only if the density admits the factorisation where depends on only through and does not depend on .
Any dependence on enters only through --- so captures all the parameter-relevant information.
Sufficient direction ($\Leftarrow$): factorisation implies sufficiency
Suppose . The marginal density of is obtained by integrating over the level set : The conditional density is then which is independent of .
Necessary direction ($\Rightarrow$): sufficiency implies factorisation
If is sufficient, write . Set and . By the sufficiency assumption is free of , which gives the factorisation.
Consequence for hypothesis testing
In binary testing the parameter is the hypothesis index . The likelihood ratio is then so depends on only through . Any Bayes, Neyman--Pearson, or ML decision rule based on can equivalently be computed from alone.
Example: The Matched-Filter Output is Sufficient for Detection in AWGN
Show that for the binary problem versus with , the matched-filter statistic is sufficient for the hypothesis index.
Write the density under each hypothesis
for .
Expand the squared norm
. Substitute into the density:
Identify $g$ and $h$
Let . The second exponential depends on only through , and the prefactor does not depend on . The factorisation holds with and .
Conclude via Fisher--Neyman
By the Fisher--Neyman theorem, is sufficient for the hypothesis index. No further processing of can help: the matched-filter output is all that is needed for any Bayes, Neyman--Pearson, or ML decision.
Dimensionality reduction for free
In the preceding example, the observation lives in but the sufficient statistic is a scalar. That collapse --- from dimensions to --- is not a numerical trick; it is a structural fact about the problem. Sufficient statistics pinpoint the minimum dimensionality needed for optimal inference. When we move to -ary hypothesis testing in Chapter 3, the sufficient statistic becomes a vector of projections. When we move to parameter estimation in Part II, sufficient statistics tell us how many numbers we need to keep from a dataset of size .
Theorem: Sufficiency of Signal-Space Projections for Waveform Detection
Consider the -ary detection problem in continuous-time AWGN: , , , with white Gaussian noise of PSD . Let with be an orthonormal basis of in . The vector of projections with components is a sufficient statistic for the hypothesis index.
Decompose $y(t)$ into in-subspace and out-of-subspace parts
Write , where lies in the orthogonal complement of . Since each , the signal contributes zero to : depends only on and is independent of .
Independence of $Y_{\mathrm{proj}}$ and $y_\perp$
For white Gaussian noise, projections onto orthonormal functions are independent Gaussian random variables (see FSP Ch. 14). Thus and are independent, and 's distribution does not depend on .
Conditional density is free of the hypothesis
The density of given is the density of --- independent of . By definition, is sufficient for the hypothesis.
Key Takeaway
The Gram--Schmidt-constructed projections form a sufficient statistic for -ary signal detection in AWGN. This is why every digital receiver in Chapters 8--10 of the telecom book is drawn as correlator bank + minimum distance decoder: correlation extracts the sufficient statistic, and the remaining scalar noise outside the signal subspace is discarded without loss.
Visualising the Sufficient-Statistic Collapse
A high-dimensional observation vector projected onto the signal subspace. The perpendicular component is noise-only and carries no information about the hypothesis.
Parameters
Why This Matters: From Sufficiency to the MIMO Receiver
The sufficiency argument generalises directly to MIMO receivers: the matched-filter bank collects sufficient statistics for detecting the symbol vector, and everything downstream (ZF, MMSE, sphere decoding) operates on these projected observations. Chapter 15 of the telecom book builds on this fact.
Common Mistake: Sufficiency can fail when parameters are unknown
Mistake:
Assuming that the matched-filter output is still sufficient when the signal amplitude is unknown.
Correction:
When is replaced by with an unknown parameter, the sufficient statistic must carry enough information to infer as well --- typically for Gaussian noise. The GLRT from §2 is exactly the construction that uses this larger sufficient statistic correctly.
Quick Check
Why is the inner product sufficient for detecting a known signal in white Gaussian noise?
Because it has maximum variance among all linear statistics.
Because the likelihood ratio depends on only through this inner product (Fisher--Neyman).
Because the component of perpendicular to is always zero.
Because the noise is Gaussian.
The density factorises with depending only on the inner product.
Sufficient statistic
A function of the observations such that the conditional distribution of given is free of the parameter being inferred. Sufficient statistics preserve all information about the parameter while reducing dimensionality.
Related: Fisher--Neyman Factorisation, Minimal Sufficient Statistic
Subspace Matched Filters for Joint Sensing and Communication
The sufficient-statistic view developed here extends naturally to integrated sensing and communication (ISAC) systems, where the same waveform must carry information and probe the environment. The CommIT group has shown that the optimal ISAC receiver decomposes the observation into orthogonal subspaces carrying (i) the communication payload and (ii) the sensing parameters, with each subspace admitting its own matched filter. The Fisher--Neyman machinery from this section is the formal underpinning of that decomposition.
Sufficiency determines ADC and sampling requirements
In practice sufficiency guides system design: you only need to sample and digitise in the signal subspace. For a PAM/QAM receiver with symbols on pulse shape , a single matched-filter output per symbol is sufficient --- you do not need to oversample and then post-process. This is why real receivers use symbol-rate sampling after the matched filter, cutting the ADC data rate to the signal bandwidth.