Maximum-Likelihood Sequence Estimation
Optimal Detection on ISI Channels
Both linear equalizers and the DFE make decisions symbol by symbol, without considering the entire received sequence. The optimal approach --- maximum-likelihood sequence estimation (MLSE) --- searches over all possible transmitted sequences to find the one most likely to have produced the observed output. MLSE is implemented efficiently using the Viterbi algorithm, the same dynamic programming technique used for convolutional code decoding. The price is exponential complexity in channel memory.
Definition: Maximum-Likelihood Sequence Estimation (MLSE)
Maximum-Likelihood Sequence Estimation (MLSE)
Given a received sequence and channel , the MLSE detector finds
For AWGN, this is equivalent to minimising the sequence metric:
The minimisation is over all possible symbol sequences, where is the alphabet size and is the sequence length. Direct enumeration is infeasible, but the ISI channel has a trellis structure that enables the Viterbi algorithm.
Definition: Branch Metric
Branch Metric
In the Viterbi algorithm for MLSE, the branch metric at time for a state transition from to (corresponding to input symbol ) is
The path metric is the cumulative sum of branch metrics along a path through the trellis. The Viterbi algorithm retains only the minimum-metric survivor path entering each state.
Theorem: Optimality of MLSE
For an ISI channel with AWGN, the MLSE detector minimises the sequence error probability among all detectors. The pairwise error probability between sequences and is
where is the channel convolution matrix. The performance is dominated by the minimum distance between valid sequences:
MLSE is optimal because it considers the entire received sequence jointly, exploiting the trellis structure of ISI. The Viterbi algorithm makes this tractable by discarding suboptimal paths at each trellis stage. The free distance plays the same role as in memoryless detection --- it determines the asymptotic BER.
Likelihood function
Since i.i.d., the log-likelihood is
Maximising this is equivalent to minimising the squared Euclidean distance .
Pairwise error probability
The detector chooses over when . Since , this simplifies to a Gaussian tail probability:
where is the error sequence.
Viterbi Algorithm for ISI Channels
Complexity: Time: per sequence, where is the number of trellis states and transitions leave each state. Space: for storing survivor paths. For binary signalling () and channel memory : states, branch metric computations per time step.The complexity grows exponentially with channel memory . For --, the Viterbi algorithm becomes impractical and reduced-state approaches (RSSE, delayed decision-feedback sequence estimation) are needed.
BER Comparison: Linear EQ vs. DFE vs. MLSE
Compare the BER performance of four receivers on a multipath channel: no equalization, linear MMSE, MMSE-DFE, and MLSE. The matched-filter bound is shown as a reference.
Parameters
Viterbi Algorithm Step by Step
Example: MLSE via Viterbi on a Two-Tap Channel
Apply the Viterbi algorithm to detect the BPSK sequence transmitted over the channel , given the received samples . Assume zero initial state and no noise.
Trellis structure
With and BPSK (), the trellis has states: (previous symbol ) and (previous symbol ). Each state has 2 outgoing transitions.
Time $k = 0$
Starting from a known initial state (assume , so we start in ).
- Transition to (): ,
- Transition to (): ,
,
Time $k = 1$
From ():
- To (): ,
- To (): ,
From ():
- To (): ,
- To (): ,
Survivors: (via ), (via )
Traceback and decision
Continuing similarly for and tracing back from the minimum final metric, the detected sequence is .
The Viterbi algorithm efficiently finds this without checking all possible sequences.
Quick Check
For a channel with memory and 4-QAM modulation (), how many trellis states does the Viterbi MLSE detector have?
states
states
states
states
Correct. The trellis state is defined by the previous symbols, each from an alphabet of size , giving states.
Why This Matters: MLSE in GSM
The GSM cellular standard specifies a channel equalizer based on the Viterbi algorithm. The GSM channel is modelled with a typical memory of -- symbol periods. With GMSK modulation (approximately binary), the trellis has states --- computationally feasible for a mobile handset. Each GSM burst includes a 26-bit training sequence (midamble) used to estimate the channel taps before running the Viterbi equalizer. This combination of training-based channel estimation and MLSE equalization was a key innovation that enabled reliable communication over the highly dispersive urban radio channel.
See full treatment in Chapter 14
Historical Note: Viterbi Algorithm
1967--1974Andrew Viterbi published his algorithm in 1967 for decoding convolutional codes, but it was soon recognised as a general solution for maximum-likelihood detection on any system with a finite-state trellis representation. Forney (1972) showed that the ISI channel has a natural trellis structure and that the Viterbi algorithm provides MLSE. Ungerboeck (1974) combined this with adaptive channel estimation, creating the first practical MLSE receiver for data modems. The Viterbi algorithm remains one of the most widely used algorithms in communications, with applications in equalization, decoding, speech recognition, and bioinformatics.
MLSE in Wireless Standards β Where It Works and Where It Fails
MLSE via the Viterbi algorithm has been deployed in real systems, but only when the state count is manageable:
- GSM (2G): GMSK modulation (), gives states. Feasible on 1990s hardware. Every GSM phone contains a Viterbi equalizer.
- EDGE (2.5G): 8-PSK (), gives states. Reduced-state approaches (M-BCJR, DDFSE) are used to bring complexity to effective states.
- 3G WCDMA: Replaced equalization with RAKE receivers and later chip-level equalizers (linear/DFE), avoiding MLSE entirely.
- 4G LTE / 5G NR: OFDM eliminates the need for time-domain equalization. The only "equalization" is per-subcarrier MMSE.
- Practical limit: states is the rough boundary for real-time MLSE on current hardware. Beyond this, reduced-state or turbo equalization is required.
- β’
Real-time MLSE feasible for trellis states
- β’
GSM: 32 states (GMSK, ); EDGE: reduced from 4096 to ~128 effective states
- β’
OFDM in 4G/5G eliminates the need for MLSE entirely
Key Takeaway
MLSE is the optimal receiver for ISI channels but its complexity grows as β exponential in channel memory. This makes it practical only for low-order modulation on short channels (e.g., GSM). For modern wideband systems, OFDM avoids the problem entirely by converting the frequency-selective channel into flat subchannels.
Maximum-Likelihood Sequence Estimation (MLSE)
An optimal detection strategy that finds the transmitted symbol sequence maximising the likelihood of the entire received sequence, implemented via the Viterbi algorithm on the channel trellis.
Related: Viterbi Algorithm, Equalization
Viterbi Algorithm
A dynamic programming algorithm that finds the minimum-cost path through a trellis by retaining only one survivor path per state at each time step.
Branch Metric
The cost assigned to a single state transition in the Viterbi trellis, typically the squared Euclidean distance between the received sample and the predicted output for that transition.
Related: Viterbi Algorithm, Maximum-Likelihood Sequence Estimation (MLSE)