Ferkans — Interactive Telecom Tutor

Why Stop Early?

In the fixed-sample-size detection framework of Chapter 1, the statistician collects $n$ samples, forms the log-likelihood ratio $\ell^{(n)} = \sum_{i=1}^n \ell(y_i)$ , and compares it to a threshold. The sample size $n$ is chosen in advance to meet target false-alarm and miss probabilities $(\alpha, \beta)$ .

Two observations motivate a sequential approach. First, when the true hypothesis is "easy" — for example, the LLR increments are all strongly positive — a few samples suffice, and the remaining $n - k$ observations are wasted. Second, when the true hypothesis is "hard" — the LLR walks near zero — no fixed $n$ is large enough to guarantee reliable detection.

Wald's sequential probability ratio test (SPRT) resolves both issues. The sample size $N$ becomes a random variable, determined by the data: observe one sample at a time, update the cumulative LLR, and decide as soon as it crosses an upper or lower threshold. For a given target pair $(\alpha, \beta)$ , the SPRT uses on average the smallest possible number of samples among all tests achieving those error levels — this is the Wald-Wolfowitz optimality theorem.

This section develops the SPRT from its threshold design through Wald's identity for the ASN, and closes with its connection to early stopping in iterative decoders, where the decision statistic is monitored between iterations.

Definition:
Sequential Probability Ratio Test (SPRT)

Let $Y_1, Y_2, \ldots$ be i.i.d. observations under either $\mathcal{H}_0$ (density $f_0$ ) or $\mathcal{H}_1$ (density $f_1$ ). Define the per-sample log-likelihood ratio $\ell(y) = \log \frac{f_1(y)}{f_0(y)}$ and the cumulative LLR $\ell^{(n)} = \sum_{i=1}^n \ell(Y_i).$

Fix two thresholds $B < 0 < A$ . The SPRT is the pair $(g_{\mathrm{SPRT}}, N)$ defined by the stopping rule

$N = \inf\{n \geq 1 : \ell^{(n)} \notin (B, A)\}$

and the terminal decision

$g_{\mathrm{SPRT}}(Y_1, \ldots, Y_N) = \begin{cases} 1 & \text{if } \ell^{(N)} \geq A, \\ 0 & \text{if } \ell^{(N)} \leq B. \end{cases}$

The test continues (takes another sample) while $\ell^{(n)} \in (B, A)$ .

The thresholds $A$ and $B$ are the only design parameters. Choosing them properly yields a test with prescribed error probabilities — the content of Wald's inequalities below.

SPRT (Sequential Probability Ratio Test)

A sequential hypothesis test that accumulates log-likelihood ratios one sample at a time and stops as soon as the cumulative LLR exits the interval $(B, A)$ . Optimal in ASN among tests with the same error probabilities.

Average Sample Number (ASN)

The expected stopping time $\mathbb{E}_i[N]$ of a sequential test under hypothesis $\mathcal{H}_i$ . Also called the expected sample size. For the SPRT it is computed via Wald's identity.

Wald's Identity

For a random walk $S_n = \sum_{i=1}^n X_i$ with i.i.d. increments and a stopping time $N$ with $\mathbb{E}[N] < \infty$ , $\mathbb{E}[S_N] = \mathbb{E}[N] \cdot \mathbb{E}[X_1]$ .

Related: Average Sample Number (ASN)

Theorem: Wald's Threshold Approximation

Let $P_f = \Pr(g_{\mathrm{SPRT}} = 1 \mid \mathcal{H}_0)$ and $P_d = \Pr(g_{\mathrm{SPRT}} = 1 \mid \mathcal{H}_1)$ denote the false-alarm and detection probabilities of the SPRT with thresholds $(B, A)$ . Ignoring the excess over the boundary at the stopping time,

$e^{A} \approx \frac{P_d}{P_f}, \qquad e^{B} \approx \frac{1 - P_d}{1 - P_f}.$

Consequently, to achieve targets $P_f \leq \alpha$ and $1 - P_d \leq \beta$ , Wald's choice is

$A = \log \frac{1 - \beta}{\alpha}, \qquad B = \log \frac{\beta}{1 - \alpha}.$

The thresholds depend only on the target error probabilities, not on the densities $f_0, f_1$ or the SNR. Every SPRT that wants false-alarm $\alpha = 10^{-3}$ and miss $\beta = 10^{-3}$ uses the same thresholds $A \approx 6.9$ , $B \approx -6.9$ nats regardless of the signal model.

Show Hint

Partition the sample space by the terminal decision and apply a change of measure from $f_0^{\otimes N}$ to $f_1^{\otimes N}$ .

On the event $\{g_{\mathrm{SPRT}} = 1\}$ we have $\ell^{(N)} \geq A$ , i.e., $\prod_i f_1(Y_i)/f_0(Y_i) \geq e^A$ .

Integrate this inequality against $f_0^{\otimes N}$ to bound $P_f$ by $e^{-A}P_d$ .

Proof

Setup via change of measure

Let $\mathcal{A} = \{(y_1, \ldots, y_N) : g_{\mathrm{SPRT}} = 1\}$ denote the acceptance region for $\mathcal{H}_1$ . By definition, $P_f = \int_{\mathcal{A}} f_0^{\otimes N}(\mathbf{y}) d\mathbf{y}, \qquad P_d = \int_{\mathcal{A}} f_1^{\otimes N}(\mathbf{y}) d\mathbf{y}.$

Exploit the terminal inequality

On $\mathcal{A}$ we have $\ell^{(N)} \geq A$ , which means $\prod_{i=1}^N f_1(y_i) \geq e^A \prod_{i=1}^N f_0(y_i)$ . Therefore $f_1^{\otimes N}(\mathbf{y}) \geq e^A f_0^{\otimes N}(\mathbf{y})$ on $\mathcal{A}$ .

Derive the bound on $\ntn{pfa}$

Integrating both sides over $\mathcal{A}$ against Lebesgue measure, $P_d = \int_{\mathcal{A}} f_1^{\otimes N} \geq e^A \int_{\mathcal{A}} f_0^{\otimes N} = e^A P_f,$ so $P_f \leq e^{-A} P_d \leq e^{-A}$ .

Symmetric bound for the miss

Repeat on $\mathcal{A}^c = \{g_{\mathrm{SPRT}} = 0\}$ : on this event $\ell^{(N)} \leq B$ , i.e., $f_0^{\otimes N} \geq e^{-B} f_1^{\otimes N}$ . Integrating yields $1 - P_f \geq e^{-B}(1 - P_d)$ , or $1 - P_d \leq e^{B}(1 - P_f) \leq e^{B}$ .

Invert to solve for $(A, B)$

Ignoring the excess over the boundary replaces the inequalities by equalities, giving $e^{A} = P_d/P_f$ and $e^{B} = (1-P_d)/(1-P_f)$ . Substituting $P_f = \alpha$ , $1 - P_d = \beta$ yields the stated formulas. $\blacksquare$

,

Key Takeaway

The SPRT thresholds $A = \log\frac{1-\beta}{\alpha}$ , $B = \log\frac{\beta}{1-\alpha}$ are model-free: they depend only on the target error probabilities, not on the densities $f_0, f_1$ . This is the clean separation between detection requirements and signal structure that makes the SPRT a universal sequential framework.

Theorem: Wald's Identity

Let $X_1, X_2, \ldots$ be i.i.d. with $\mu = \mathbb{E}[X_1]$ finite, and let $S_n = \sum_{i=1}^n X_i$ . If $N$ is a stopping time with $\mathbb{E}[N] < \infty$ and $\mathbb{E}[|X_1|] < \infty$ , then $\mathbb{E}[S_N] = \mu \cdot \mathbb{E}[N].$

The expected "total distance" a random walk covers by a stopping time is the product of the expected number of steps and the expected step. It is the stopping-time version of linearity of expectation.

Show Hint

Express $S_N = \sum_{i=1}^\infty X_i \mathbb{1}\{N \geq i\}$ .

The event $\{N \geq i\}$ depends only on $X_1, \ldots, X_{i-1}$ , so it is independent of $X_i$ .

Proof

Expand $S_N$ as a series

Write $S_N = \sum_{i=1}^\infty X_i \mathbb{1}\{N \geq i\}$ . Taking expectations (Fubini is justified by $\mathbb{E}\sum_i |X_i| \mathbb{1}\{N \geq i\} = \mathbb{E}[|X_1|] \cdot \mathbb{E}[N] < \infty$ ), $\mathbb{E}[S_N] = \sum_{i=1}^\infty \mathbb{E}[X_i \mathbb{1}\{N \geq i\}].$

Use the stopping-time property

Since $\{N \geq i\} = \{N \leq i-1\}^c$ is measurable with respect to $X_1, \ldots, X_{i-1}$ , it is independent of $X_i$ . Hence $\mathbb{E}[X_i \mathbb{1}\{N \geq i\}] = \mu \cdot \Pr(N \geq i)$ .

Sum the tail series

Using $\mathbb{E}[N] = \sum_{i=1}^\infty \Pr(N \geq i)$ , $\mathbb{E}[S_N] = \mu \sum_{i=1}^\infty \Pr(N \geq i) = \mu \cdot \mathbb{E}[N]. \quad \blacksquare$

,

Theorem: Average Sample Number of the SPRT

Under hypothesis $\mathcal{H}_i$ , let $D_{i} = \mathbb{E}_i[\ell(Y_1)]$ denote the expected per-sample LLR, so that $D_{1} = D(f_1 \| f_0) > 0$ and $D_{0} = -D(f_0 \| f_1) < 0$ (assuming $f_0 \neq f_1$ ). Neglecting the excess over the boundary, the SPRT with Wald thresholds satisfies

$\mathbb{E}_0[N] \approx \frac{(1-\alpha) B + \alpha A}{D_{0}}, \qquad \mathbb{E}_1[N] \approx \frac{\beta B + (1-\beta) A}{D_{1}}.$

Under $\mathcal{H}_1$ the LLR random walk drifts upward at rate $D(f_1 \| f_0)$ per step, so reaching $A$ takes about $A / D(f_1 \| f_0)$ steps. The exact formula adjusts for the rare case in which the walk exits through the wrong boundary.

Show Hint

Apply Wald's identity to the random walk $\{\ell^{(n)}\}$ with stopping time $N$ .

Condition on the terminal event to evaluate $\mathbb{E}_i[\ell^{(N)}]$ in terms of $A$ , $B$ , and the error probabilities.

Proof

Apply Wald's identity

The increments are i.i.d.\ with mean $\mathbb{E}_i[\ell(Y_1)] = D_{i}$ under $\mathcal{H}_i$ . The stopping time $N$ has finite mean (this requires a separate argument, e.g., the LLR random walk has nonzero drift so $N$ has exponential tails). By TWald's Identity, $\mathbb{E}_i[\ell^{(N)}] = D_{i} \cdot \mathbb{E}_i[N].$

Evaluate the terminal expectation

Ignoring the excess over the boundary, $\ell^{(N)} \approx A$ when the SPRT accepts $\mathcal{H}_1$ and $\ell^{(N)} \approx B$ when it accepts $\mathcal{H}_0$ . Under $\mathcal{H}_0$ , acceptance of $\mathcal{H}_1$ occurs with probability $\alpha$ , so $\mathbb{E}_0[\ell^{(N)}] \approx \alpha A + (1-\alpha) B.$ Under $\mathcal{H}_1$ , acceptance of $\mathcal{H}_0$ occurs with probability $\beta$ , so $\mathbb{E}_1[\ell^{(N)}] \approx (1-\beta) A + \beta B.$

Solve for the ASN

Combining with Wald's identity, $\mathbb{E}_0[N] \approx \frac{\alpha A + (1-\alpha) B}{D_{0}}, \quad \mathbb{E}_1[N] \approx \frac{(1-\beta) A + \beta B}{D_{1}}.$ Note that $D_{0} < 0$ , while the numerator (dominated by $(1-\alpha)B < 0$ ) is also negative, so $\mathbb{E}_0[N] > 0$ . $\blacksquare$

,

Example: SPRT for a Gaussian Shift-in-Mean

Under $\mathcal{H}_0$ , $Y_i \sim \mathcal{N}(0, \sigma^2)$ ; under $\mathcal{H}_1$ , $Y_i \sim \mathcal{N}(\mu, \sigma^2)$ , with $\mu > 0$ known. Derive the SPRT explicitly and compute the ASN for $\alpha = \beta = 10^{-3}$ at $\mathrm{SNR} = \mu^2/\sigma^2 = 1$ (0 dB).

Solution

Per-sample LLR

The log-likelihood ratio simplifies to $\ell(y) = \log \frac{f_1(y)}{f_0(y)} = \frac{\mu y}{\sigma^2} - \frac{\mu^2}{2\sigma^2}.$ The cumulative LLR is $\ell^{(n)} = \frac{\mu}{\sigma^2}\sum_i Y_i - \frac{n \mu^2}{2\sigma^2}$ .

Thresholds

For $\alpha = \beta = 10^{-3}$ , $A = \log\frac{0.999}{0.001} \approx 6.907, \qquad B = \log\frac{0.001}{0.999} \approx -6.907.$

Expected increments

Under $\mathcal{H}_1$ , $D_{1} = \mathbb{E}_1[\ell(Y_1)] = \frac{\mu \cdot \mu}{\sigma^2} - \frac{\mu^2}{2\sigma^2} = \frac{\mu^2}{2\sigma^2} = 0.5$ nat. Under $\mathcal{H}_0$ , $D_{0} = -\frac{\mu^2}{2\sigma^2} = -0.5$ nat.

ASN computation

Using Theorem TAverage Sample Number of the SPRT with $\alpha = \beta = 10^{-3}$ , $A = -B = 6.907$ , $D_{1} = -D_{0} = 0.5$ , $\mathbb{E}_1[N] \approx \frac{0.999 \cdot 6.907 + 0.001 \cdot (-6.907)} {0.5} \approx \frac{6.893}{0.5} \approx 13.79.$ The fixed-sample-size NP test would need $n \approx (Q^{-1}(\alpha) + Q^{-1}(\beta))^2 / \mathrm{SNR} \approx (3.09 + 3.09)^2 / 1 \approx 38.2$ samples. The SPRT uses roughly one-third as many samples on average.

SPRT Sample Path and Decision Boundaries

Simulate sample paths of the cumulative LLR $\ell^{(n)}$ under $\mathcal{H}_0$ and $\mathcal{H}_1$ for a Gaussian shift-in-mean problem. The SPRT stops as soon as the path exits the strip $(B, A)$ . Vary the SNR and the target error levels $(\alpha, \beta)$ to see how the thresholds move and how the stopping time changes.

Parameters

SNR (dB)0

Signal-to-noise ratio $\mu^2/\sigma^2$

\alpha

0.001

Target false-alarm probability

\beta

0.001

Target miss probability

True hypothesis

Average Sample Number vs. SNR

Compare the SPRT's ASN to the fixed-sample-size Neyman-Pearson test's required sample count as SNR varies. The ASN curve reveals the Wald-Wolfowitz optimality in operational terms: roughly a factor of two to three reduction at the same error levels.

Parameters

\alpha

0.001

\beta

0.001

Random Walk of the LLR and SPRT Boundaries

Animation of the cumulative LLR under

\mathcal{H}_1

, with the Wald boundaries

A

and

B

drawn as horizontal lines. Watch the drift, the boundary hit, and the resulting decision.

Gaussian shift-in-mean,

\mathrm{SNR} = 0

dB,

\alpha = \beta = 10^{-3}

.

Theorem: Wald-Wolfowitz Optimality of the SPRT

Among all sequential tests (stopping rule + terminal decision) of $\mathcal{H}_0$ vs.\ $\mathcal{H}_1$ based on i.i.d.\ observations with $\Pr(\text{reject } \mathcal{H}_0 \mid \mathcal{H}_0) \leq \alpha$ and $\Pr(\text{accept } \mathcal{H}_0 \mid \mathcal{H}_1) \leq \beta$ , the SPRT with Wald thresholds minimizes both $\mathbb{E}_0[N]$ and $\mathbb{E}_1[N]$ simultaneously.

No other test — deterministic or randomized, fixed or variable sample size — with the same error guarantees can terminate sooner on average under either hypothesis. The SPRT is a rare example of a doubly-optimal procedure.

Proof

Reduction to Bayesian problem

Consider the Bayesian problem with prior $(\pi_0, \pi_1)$ , cost $c$ per sample, and $0$ - $1$ loss for wrong decisions. The optimal stopping rule compares the posterior to two thresholds that depend on the sampling cost.

SPRT is the Bayes-optimal rule

A direct calculation shows that the posterior-based thresholds are equivalent to LLR-based thresholds, so the SPRT is the optimal Bayes rule for some $(c, \pi_0, \pi_1)$ .

Minimax argument

By a duality between the Bayes and minimax formulations (cf. Siegmund 1985), the SPRT that achieves error levels $(\alpha, \beta)$ with the Wald thresholds also minimizes $\mathbb{E}_i[N]$ over all tests with those error levels. Full proof requires the theory of optimal stopping; see Siegmund for a modern treatment. $\blacksquare$

,

SPRT Implementation

Complexity:

O(1)

per sample; expected total

O(\mathbb{E}[N])

Input: Densities

f_0, f_1

; targets

\alpha, \beta \in (0, 1/2)

Output: Decision

d \in \{0, 1\}

and stopping time

N

1.

A \leftarrow \log \frac{1-\beta}{\alpha}

2.

B \leftarrow \log \frac{\beta}{1-\alpha}

3.

\ell \leftarrow 0

4.

n \leftarrow 0

5. loop

6.

\quad n \leftarrow n + 1

7.

\quad

observe

Y_n

8.

\quad \ell \leftarrow \ell + \log(f_1(Y_n) / f_0(Y_n))

9.

\quad

if

\ell \geq A

then return

(d, N) \leftarrow (1, n)

10.

\quad

if

\ell \leq B

then return

(d, N) \leftarrow (0, n)

11. end loop

Truncation (capping $n$ at $N_{\max}$ ) is common in practice to bound worst-case latency. The resulting test is no longer strictly optimal but retains the ASN advantage for most sample paths.

Common Mistake: Excess Over the Boundary

Mistake:

Students apply the Wald threshold formulas and assume the error probabilities are exactly $\alpha$ and $\beta$ .

Correction:

Wald's formulas ignore the excess $\ell^{(N)} - A$ (or $B - \ell^{(N)}$ ) at the stopping time. The actual errors are typically smaller than $(\alpha, \beta)$ — the SPRT over-delivers on the error guarantees. For continuous increments with bounded variance, the overshoot adds $O(1)$ nats that shift both thresholds inward in the Wald identity. Siegmund's corrected asymptotic expansions account for this.

Common Mistake: Non-i.i.d. Observations Break the SPRT

Mistake:

Applying the SPRT to dependent observations (e.g., correlated noise, adaptive measurements) using the same thresholds.

Correction:

The Wald inequalities rely on the martingale property of the likelihood ratio under $\mathcal{H}_0$ , which in turn requires the per-sample LLRs to be a martingale difference sequence. For correlated data, the LLR must be computed for the joint density, and the thresholds must be reinterpreted. A mechanical i.i.d.\ implementation can produce actual errors orders of magnitude above target.

Historical Note: Wald and Wartime Statistics

1943-1947

Abraham Wald developed the SPRT during World War II as part of the Statistical Research Group at Columbia University (1943). The problem that motivated the work was munitions quality control: how to decide whether a batch of shells met specifications while testing as few shells as possible. Sequential testing had been informally proposed by others (notably Dodge and Romig in 1929), but Wald established the rigorous optimality theory.

The SPRT was classified until 1947 because of its potential military applications. When Wald published "Sequential Analysis" that year, it became the founding text of sequential statistical inference. Wald died in a plane crash in 1950 at age 48, cutting short one of the most prolific careers in 20th-century statistics.

Historical Note: The Wald-Wolfowitz Optimality Theorem

1948

In 1948, Wald and his collaborator Jacob Wolfowitz proved that the SPRT minimizes both $\mathbb{E}_0[N]$ and $\mathbb{E}_1[N]$ among all sequential tests with given error probabilities — a doubly-optimal property that has no analogue in the fixed-sample-size theory, where only the error is minimized for a fixed $n$ .

The proof uses a Bayesian reduction that was considered unusual at the time, since many statisticians viewed Bayes and frequentist methods as philosophically incompatible. Wald's willingness to use Bayesian tools instrumentally anticipated the modern decision-theoretic viewpoint.

Why This Matters: Early Stopping in Iterative Decoders

Turbo and LDPC decoders execute a fixed maximum number of iterations (typically 8-50) regardless of channel quality. In good channels most codewords decode correctly in 3-5 iterations; the remainder of the iteration budget is wasted power and latency.

Early-stopping criteria are sequential tests: monitor a decision statistic between iterations and stop as soon as confidence exceeds a threshold. Common statistics include the minimum absolute extrinsic LLR, the change in the hard-decision vector, and the syndrome check. Each is a proxy for the cumulative LLR of an SPRT on the hypothesis "the current hard decision is the true codeword." Well-designed early stopping achieves $50$ - $70\%$ iteration savings at no BER cost in the waterfall regime.

⚠️Engineering Note

Truncated SPRT in Wireless Systems

Pure SPRT has unbounded stopping time. Any real receiver must impose a maximum sample count $N_{\max}$ — a hard deadline driven by latency budgets (e.g., the LDPC decoder must produce a bit within one OFDM symbol). The truncated SPRT forces a decision at $n = N_{\max}$ by comparing $\ell^{(N_{\max})}$ to a midpoint threshold (typically 0) if the walk is still in the continuation region.

The truncation introduces a small additional error term that must be budgeted alongside $(\alpha, \beta)$ . For $N_{\max} \geq 2\, \mathbb{E}_1[N]$ the effect is usually negligible. For tight deadlines, the thresholds must be redesigned via direct numerical optimization over the truncated problem.

Practical Constraints

•
3GPP NR turbo/LDPC decoders: $N_{\max}$ = 8-12 iterations per TTI
•
LDPC hardware decoders allocate an iteration budget in clock cycles
•
Early termination contributes directly to UE battery life

📋 Ref: 3GPP TS 38.212, Section 5.3.2 (LDPC base graphs)

Quick Check

For a sequential test with target errors $\alpha = 10^{-2}$ , $\beta = 10^{-2}$ , what is $A$ (in nats)?

$\log(0.99/0.01) \approx 4.60$

$\log(0.01) \approx -4.60$

$\log(0.01/0.99) \approx -4.60$

$\log(100) \approx 4.60$ (same as option 1, but coincidental)

Correction:

\log(0.99/0.01) \approx 4.60

$A = \log\frac{1-\beta}{\alpha} = \log\frac{0.99}{0.01} \approx 4.595$ nats.

Quick Check

In the Gaussian shift-in-mean SPRT with $\alpha = \beta$ , how does the ASN under $\mathcal{H}_1$ scale with SNR in the low-SNR regime?

$\mathbb{E}_1[N] \propto 1/\mathrm{SNR}$

$\mathbb{E}_1[N] \propto 1/\sqrt{\mathrm{SNR}}$

$\mathbb{E}_1[N] \propto 1/\mathrm{SNR}^2$

$\mathbb{E}_1[N]$ does not depend on SNR

Correction:

\mathbb{E}_1[N] \propto 1/\mathrm{SNR}

$D(f_1 \| f_0) = \mathrm{SNR}/2$ , so $\mathbb{E}_1[N] \approx 2A/\mathrm{SNR}$ .

Quick Check

Which statement about the Wald-Wolfowitz theorem is correct?

The SPRT minimizes $\mathbb{E}_0[N]$ only.

The SPRT minimizes $\mathbb{E}_1[N]$ only.

The SPRT minimizes both $\mathbb{E}_0[N]$ and $\mathbb{E}_1[N]$ among tests with the same error levels.

The SPRT minimizes $\max(\mathbb{E}_0[N], \mathbb{E}_1[N])$ only.

Correction:

The SPRT minimizes both

\mathbb{E}_0[N]

and

\mathbb{E}_1[N]

among tests with the same error levels.

This doubly-optimal property is the remarkable content of the Wald-Wolfowitz theorem.

Wald's Sequential Probability Ratio Test (SPRT)

Why Stop Early?

Definition: Sequential Probability Ratio Test (SPRT)

SPRT (Sequential Probability Ratio Test)

Average Sample Number (ASN)

Wald's Identity

Theorem: Wald's Threshold Approximation

Setup via change of measure

Exploit the terminal inequality

Derive the bound on $\ntn{pfa}$

Symmetric bound for the miss

Invert to solve for $(A, B)$

Key Takeaway

Theorem: Wald's Identity

Expand $S_N$ as a series

Use the stopping-time property

Sum the tail series

Theorem: Average Sample Number of the SPRT

Apply Wald's identity

Evaluate the terminal expectation

Solve for the ASN

Example: SPRT for a Gaussian Shift-in-Mean

Per-sample LLR

Thresholds

Expected increments

ASN computation

SPRT Sample Path and Decision Boundaries

Parameters

Average Sample Number vs. SNR

Parameters

Random Walk of the LLR and SPRT Boundaries

Theorem: Wald-Wolfowitz Optimality of the SPRT

Reduction to Bayesian problem

SPRT is the Bayes-optimal rule

Minimax argument

SPRT Implementation

Common Mistake: Excess Over the Boundary

Common Mistake: Non-i.i.d. Observations Break the SPRT

Historical Note: Wald and Wartime Statistics

Historical Note: The Wald-Wolfowitz Optimality Theorem

Why This Matters: Early Stopping in Iterative Decoders

Truncated SPRT in Wireless Systems

Quick Check

Quick Check

Quick Check

Definition:
Sequential Probability Ratio Test (SPRT)