The Low-Complexity DD (LCD) Detector

Linear Detection With Cross-Domain Insight

MMSE (Section 2) is fast but diversity-1. MP (Section 3) is near-ML but requires iterative computation. The low-complexity DD (LCD) detector bridges the two: linear complexity O(MNlog⁑(MN))O(MN \log(MN)), but with a cross-domain structure that exploits both the DD channel matrix and its TF-domain counterpart. The result is linear-complexity detection with BER close to the ML benchmark β€” particularly at moderate SNR where MMSE fails.

The LCD detector is the practical workhorse of high-efficiency OTFS receivers. It is deployed in recent OTFS modem designs and is the detector of choice when both throughput and latency matter.

Definition:

Low-Complexity DD (LCD) Detector

The LCD detector combines cross-domain linear equalization with iterative soft quantization:

  1. Apply MMSE in the DD domain (Section 2) to obtain a soft estimate x^(0)\hat{\mathbf{x}}^{(0)}.
  2. Iterate TiterT_{\text{iter}} times:
    • Subtract the soft-estimated ISI from the received DD grid: r(t)=yDDβˆ’HDD x^(tβˆ’1)\mathbf{r}^{(t)} = \mathbf{y}_{DD} - \mathbf{H}_{DD}\,\hat{\mathbf{x}}^{(t-1)}.
    • Apply MMSE to the residual: Ξ”x(t)=MMSE(r(t))\Delta\mathbf{x}^{(t)} = \text{MMSE}(\mathbf{r}^{(t)}).
    • Update: x^(t)=x^(tβˆ’1)+Ξ”x(t)\hat{\mathbf{x}}^{(t)} = \hat{\mathbf{x}}^{(t-1)} + \Delta\mathbf{x}^{(t)}.
    • Soft-quantize: x^(t)←Qsoft(x^(t))\hat{\mathbf{x}}^{(t)} \leftarrow Q_{\text{soft}}(\hat{\mathbf{x}}^{(t)}).
  3. Hard-quantize for final output.

The iterative refinement recovers diversity loss of the single-pass MMSE. The complexity per iteration is O(MNlog⁑(MN))O(MN \log(MN)) (same as MMSE), and convergence is typically within 2-3 iterations.

Theorem: LCD Convergence and Performance

The LCD detector converges within O(log⁑(P))O(\log(P)) iterations to a fixed point whose BER at high SNR matches the full-diversity (order PP) slope. The per-iteration complexity is O(MNlog⁑(MN))O(MN \log(MN)) (two 2D FFTs plus element-wise operations), and total complexity is O(Titer MNlog⁑(MN))O(T_{\text{iter}}\,MN\log(MN)), where Titer=3T_{\text{iter}} = 3 suffices in practice.

Each iteration refines the estimate by correcting for the previously-estimated ISI. After a few iterations, the residual is noise-limited rather than interference-limited, and the soft-quantized decisions are near the true symbols. This is essentially a serial interference cancellation (SIC) scheme with MMSE for parallel processing.

Key Takeaway

LCD is the detector sweet spot. Linear complexity per iteration, O(MNlog⁑(MN))O(MN\log(MN)); only 2-3 iterations needed; BER slope = PP (full diversity). For MN=104MN = 10^4 and P=8P = 8, LCD runs in ∼3Γ—105\sim 3 \times 10^5 ops per frame β€” about 3Γ— MMSE and ∼1/30\sim 1/30Γ— the MP cost. For most practical OTFS receivers, LCD is the operational choice.

LCD Algorithm Pseudocode

Complexity: O(Titerβ‹…MNlog⁑(MN))O(T_{\text{iter}}\cdot MN\log(MN))
Input: yDD\mathbf{y}_{DD}, channel {(hi,β„“i,ki)}\{(h_i, \ell_i, k_i)\},
noise variance Οƒ2\sigma^2, QAM alphabet X\mathcal{X},
max iterations Titer=3T_{\text{iter}} = 3
Output: Detected symbols X^DD\hat{X}_{DD}
1. Compute 2D DFT of yDD\mathbf{y}_{DD}: y~=F yDD\tilde{\mathbf{y}} = \mathbf{F}\,\mathbf{y}_{DD}.
2. Compute channel eigenvalues Ξ›\boldsymbol{\Lambda} from the
PP path parameters (done once for the frame).
3. Initialize x^(0)=FH((Ξ›βˆ—/βˆ£Ξ›βˆ£2+Οƒ2) y~)\hat{\mathbf{x}}^{(0)} = \mathbf{F}^H((\boldsymbol{\Lambda}^*/|\boldsymbol{\Lambda}|^2 + \sigma^2)\,\tilde{\mathbf{y}}). (MMSE)
4. for t=1,…,Titert = 1, \ldots, T_{\text{iter}} do
5. \quad Soft-quantize: x~(tβˆ’1)=Qsoft(x^(tβˆ’1))\tilde{\mathbf{x}}^{(t-1)} = Q_{\text{soft}}(\hat{\mathbf{x}}^{(t-1)})
(soft QAM demap β€” closest QAM symbol, weighted by likelihood)
6. \quad Compute residual: r(t)=yDDβˆ’HDD x~(tβˆ’1)\mathbf{r}^{(t)} = \mathbf{y}_{DD} - \mathbf{H}_{DD}\,\tilde{\mathbf{x}}^{(t-1)}
(sparse multiply, O(P MN)O(P\,MN))
7. \quad Apply MMSE to residual: Ξ”x(t)=MMSE(r(t))\Delta\mathbf{x}^{(t)} = \text{MMSE}(\mathbf{r}^{(t)})
8. \quad Update: x^(t)=x~(tβˆ’1)+Ξ”x(t)\hat{\mathbf{x}}^{(t)} = \tilde{\mathbf{x}}^{(t-1)} + \Delta\mathbf{x}^{(t)}
9. end for
10. Hard-quantize final: x^=Qhard(x^(Titer))\hat{\mathbf{x}} = Q_{\text{hard}}(\hat{\mathbf{x}}^{(T_{\text{iter}})}).
11. Return X^DD=reshape(x^)\hat{X}_{DD} = \text{reshape}(\hat{\mathbf{x}}).

The soft quantization QsoftQ_{\text{soft}} returns an expected value under the QAM posterior β€” effectively a "soft decision" that preserves uncertainty. Hard quantization is applied only at the end. This preserves diversity across iterations.

Uncoded BER: MMSE, LCD, MP, ML-Bound

Plot uncoded BER vs SNR for all four detectors on the same PP-path OTFS channel. The ML bound is the lower envelope (diversity PP). MP tracks the ML bound within ∼0.5\sim 0.5 dB. LCD is close to MP (within 1-2 dB). MMSE is visibly inferior (diversity 1 slope). This is the master comparison plot for OTFS detection.

Parameters
4
32
16
5
30

OTFS Detector Landscape: Complexity vs Performance

DetectorComplexityDiversitySNR gap to ML (typical)
MMSEO(MNlog⁑(MN))O(MN \log(MN))15–8 dB
LCD (3 iter)O(MNlog⁑(MN))O(MN \log(MN))PP (full)1–2 dB
MP (10 iter)O(Pβ‹…MN)O(P \cdot MN)PP (full)0.5–1 dB
ML (sphere decoding)O(MN3)O(MN^3) averagePP0 dB
ML (brute force)O(∣X∣MN)O(|\mathcal{X}|^{MN})PP0 dB

Example: LCD for QPSK on P=4P = 4, MN = 1024

An OTFS receiver with MN=1024MN = 1024 and P=4P = 4 paths uses LCD with 3 iterations. Channel SNR is 20 dB. Estimate the detection BER and the number of flops.

LCD Detector Flowchart

LCD Detector Flowchart
LCD detector flowchart: start with MMSE output, iterate between residual computation (subtract soft-quantized ISI) and MMSE refinement. Three iterations typically suffice. Soft-quantization preserves diversity across iterations; hard-quantization is applied only at the final output stage.
⚠️Engineering Note

LCD in Practical OTFS Receivers

LCD is favored in deployment because:

  1. Simplicity: two 2D FFTs + element-wise operations. Easy to implement on standard DSP hardware.
  2. Deterministic latency: fixed number of iterations (3) gives predictable computation time β€” critical for URLLC applications.
  3. Full diversity: at moderate QAM orders, LCD's BER slope is indistinguishable from ML within the operating SNR range.
  4. Graceful degradation: at low SNR, LCD's performance is dominated by MMSE but remains better than MMSE alone.
  5. Compatible with existing 5G NR silicon: uses the same 2D FFT kernels as OFDM equalization.

As of 2024, LCD is the detector of choice for pilot OTFS deployments (including the CommIT cell-free OTFS testbed). MP is reserved for research and cases where the 1-2 dB extra gain is decisive.

Practical Constraints
  • β€’

    3 LCD iterations = deterministic latency

  • β€’

    Works on standard OFDM silicon

  • β€’

    Adequate BER performance for QPSK to 64-QAM

Common Mistake: Don't Over-Iterate LCD

Mistake:

Using Titerβ‰₯10T_{\text{iter}} \geq 10 in LCD, expecting better BER at higher iterations. In fact, LCD typically converges in 3 iterations and further iterations slightly degrade BER due to round-off and soft-quantization noise accumulation.

Correction:

Use exactly 3 iterations in LCD. Measure convergence by the norm of the residual; if it plateaus after 2-3 iterations, halt. Spending more iterations is wasteful without benefit. Some implementations use 2 iterations with minor BER loss (< 0.5 dB) and linear compute savings.