Ferkans — Interactive Telecom Tutor

The Joint Algorithm

This section develops the concrete joint estimation-detection algorithm for OTFS-ISAC. The structure follows from the DD-ISAC identity (§2): treat the data grid $X_{DD}$ and the target parameters $\Theta$ as two unknowns in a bilinear model, and alternate between estimating each given the other. The result is a unified ISAC receiver that achieves near-optimal sensing and detection simultaneously.

The point is that this alternating approach is not a heuristic — it is the natural EM-like algorithm for the OTFS-ISAC likelihood, with convergence guarantees to a local MAP optimum.

🎓CommIT Contribution(2020)

DD-Domain Waveform Design for ISAC

L. Gaudio, M. Kobayashi, G. Caire, G. Colavolpe — IEEE Trans. Wireless Communications

Gaudio, Kobayashi, Caire, and Colavolpe (IEEE TWC 2020) established the technical foundation for OTFS-ISAC. Their key contributions:

Quantitative CRLB analysis proving that OTFS meets the information-theoretic limits for joint range-velocity estimation, simultaneously with data communication.
Thumbtack ambiguity established rigorously as a consequence of OTFS's Gabor lattice structure (Chapter 11's foundation originated here).
Comparison with OFDM-ISAC: proof that OTFS requires fewer waveform resources to achieve the same sensing accuracy.
Joint estimation algorithm: alternating between data detection and target estimation, with performance close to ML asymptotically.

This is one of the two main CommIT contributions defining OTFS-ISAC (the other being the Yuan-Schober-Caire 2024 tutorial). Together, they establish OTFS as the 6G ISAC waveform. The algorithm we develop in this section is the concrete implementation of the Gaudio et al. framework.

commitisacgaudio-caire

Joint OTFS-ISAC Estimation-Detection

Complexity:

O(T_{\text{iter}} \cdot MN \log(MN))

Input: Received DD grid

Y_{DD}

, pilot structure, noise

variance

\sigma^2

, max iterations

T_{\text{iter}}

Output: Target estimates

\hat{\Theta}

, data estimates

\hat{X}_{DD}

1. Initial target estimation:

Use embedded pilot (Chapter 7) to find initial

(\hat{\ell}_i, \hat{k}_i, \hat{h}_i)

.

2. Initialize

\hat{X}_{DD}^{(0)} = 0

,

\hat{\Theta}^{(0)}

from step 1.

3. for iteration

t = 1, \ldots, T_{\text{iter}}

do

4.

\quad

E-step (data detection): Given

\hat{\Theta}^{(t-1)}

,

apply LCD or MP detector (Chapter 8) to obtain

\hat{X}_{DD}^{(t)}

.

5.

\quad

Super-resolution step (target refinement):

Given

\hat{X}_{DD}^{(t)}

, refine

\hat{\Theta}^{(t)}

:

a. Subtract data contribution:

\tilde{Y}^{(t)} = Y_{DD} - \mathbf{H}_{DD}(\hat{\Theta}^{(t-1)})\,\hat{X}_{DD}^{(t)}

.

b. Correlate against hypothesized

(\tau, \nu)

offsets near

each detected target: compute local ambiguity surface.

c. Newton iteration on local peak to obtain sub-grid

(\hat{\tau}, \hat{\nu})

accuracy.

6. end for

7. Return

(\hat{\Theta}^{(T_{\text{iter}})}, \hat{X}_{DD}^{(T_{\text{iter}})})

.

The algorithm alternates between:

Data detection (Chapter 8 detector with estimated channel).
Target refinement (Newton / super-resolution on residual). Each iteration improves both estimates. Typical $T_{\text{iter}} = 2$ – $3$ . Total complexity: $\sim 3\times$ data-only OTFS detection.

,

Theorem: Convergence of Joint ISAC Algorithm

The alternating estimation-detection algorithm above converges to a local MAP estimate of $(\Theta, X_{DD})$ under mild conditions:

Initialization close enough to the global MAP that the local basin contains both (i.e., initial channel estimate within the thumbtack main lobe).
Noise variance $\sigma^2$ below a level determined by the target scene's dynamic range.

Convergence speed: $O(\log(MN))$ iterations; typically 2-3 suffice at 10+ dB SNR. The final estimates $(\hat{\Theta}, \hat{X}_{DD})$ achieve the CRLB and BER slopes respectively of the two objectives.

This is the standard EM-algorithm convergence: each iteration improves the joint likelihood by moving in the gradient direction of one variable at a time. The thumbtack ambiguity ensures that the target-refinement step has a well-behaved local landscape (no ambiguity within the main lobe), hence reliable convergence.

Proof

EM interpretation

E-step: maximize likelihood w.r.t. $X_{DD}$ given $\Theta$ . M-step: maximize likelihood w.r.t. $\Theta$ given $X_{DD}$ . Each step increases the log-likelihood monotonically.

Local MAP

The algorithm converges to a fixed point of the EM iterations, which is a local maximum of $p(Y | X, \Theta)$ .

Global optimality

At high SNR and with good initialization, the local maximum is the global maximum — achieving the joint ML estimator. At low SNR or with poor initialization: local-only convergence.

Rate

EM convergence rate: bounded by the matrix condition number of the Hessian at the MAP. For OTFS with thumbtack ambiguity, the condition number is moderate; rate is $O(\log(MN))$ . $\blacksquare$

Key Takeaway

Joint OTFS-ISAC converges reliably. The alternating algorithm of Gaudio et al. converges to a MAP solution in 2-3 iterations at typical SNRs, achieving both the sensing CRLB and the detection BER slope simultaneously. The thumbtack ambiguity is the key enabler — without it, target refinement could converge to false local maxima from ambiguous ridges.

Joint ISAC Algorithm: Data BER and Sensing MSE

Plot the data BER and sensing-MSE (averaged over target scenes) as a function of outer iteration count. Both metrics improve across 3-4 iterations; convergence is rapid. Compare with separate processing (first sense, then detect with known channel) — the joint algorithm achieves both objectives simultaneously with minimal iteration overhead.

Parameters

SNR (dB)15

Target paths P3

Max iterations5

How Far Can Super-Resolution Go?

The CRLB bounds the achievable accuracy: $\sigma_v = c/(2Tf_0\sqrt{\rho})$ . At SNR = 30 dB: $\sigma_v \approx 0.01 \Delta v$ . At SNR = 40 dB: $\sigma_v \approx 0.003 \Delta v$ . At SNR = 50 dB: $\sigma_v \approx 0.001 \Delta v$ .

So 1/1000-th resolution is achievable at high SNR — sub-mm/s velocity at automotive mmWave. This is what Chapters 13-14 exploit for fine-grained target tracking and sensing-assisted beamforming.

Example: Pedestrian Tracking Accuracy

OTFS-ISAC at $W = 100$ MHz, $T = 3$ ms, $f_0 = 77$ GHz. Pedestrian at SNR = 30 dB. Compute position-tracking accuracy per frame.

Solution

Resolution

$\Delta R = 1.5$ m, $\Delta v = 0.65$ m/s.

Super-resolution

CRLB at 30 dB: $\sigma_R = 1.5/\sqrt{2\pi^2 \cdot 1000}$ = 1.1 cm. $\sigma_v = 0.65/\sqrt{2\pi^2 \cdot 1000}$ = 4.6 mm/s.

Frame rate

100 frames/sec. Averaging over 10 frames: further factor $\sqrt{10}$ improvement. 3 mm range, 1.5 mm/s velocity.

Tracking quality

Position tracked to 3 mm, velocity to 1.5 mm/s — sufficient for gait analysis, fall detection, and high-precision positioning. All while the same waveform carries ~200 Mbps data.

Newton Refinement for Fractional Offset

Complexity:

O(1)

per Newton step

Input: Residual

\tilde{Y}

, initial estimate

(\hat{\ell}_0, \hat{k}_0)

,

data

X_{DD}

Output: Refined

(\epsilon^{(\tau)}, \epsilon)

within cell

1. Initialize

(\epsilon^{(\tau)}, \epsilon) = (0, 0)

.

2. for Newton step

s = 1, \ldots, S

do

3.

\quad

Compute local ambiguity surface values on a small grid.

4.

\quad

Fit quadratic to the peak:

\Lambda \approx A - B(\epsilon^{(\tau)}-x_*)^2 - C(\epsilon-y_*)^2

.

5.

\quad

Update:

(\epsilon^{(\tau)}, \epsilon) \leftarrow (x_*, y_*)

.

6. end for

7. Return

(\epsilon^{(\tau)}, \epsilon)

.

Quadratic fit is a second-order approximation of the thumbtack near its peak (reliable within the main lobe). After 2-3 Newton iterations, the refinement converges to the peak to machine precision. Total cost per target: $\sim 10$ multiplies.

⚠️Engineering Note

Compute Budget for OTFS-ISAC

Total OTFS-ISAC receiver compute budget at 5G NR-aligned parameters ( $MN = 10^4$ , 100 frames/sec):

OTFS demodulation (Wigner + SFFT): $\sim 10^5$ ops.
Channel/target estimation (embedded pilot + detection): $\sim 10^5$ ops.
Data detection (LCD, 3 iterations): $\sim 3 \times 10^5$ ops.
Target refinement (Newton, 10 targets × 3 iters): $\sim 10^4$ ops.
Iteration outer loop (2-3 joint iterations): 2-3× above.

Total: $\sim 10^6$ - $10^7$ ops per frame = $10^8$ - $10^9$ ops/sec. Readily handled by modern SoC. For reference: OTFS data-only receiver is $\sim 10^6$ ops/frame — ISAC is $2$ - $3\times$ this budget.

Memory: ISAC needs additional buffers for the residual $\tilde{Y}$ and target parameter tables — $\sim 10\times MN$ bytes, typically $100$ KB. Negligible.

Practical Constraints

•
ISAC compute: 2-3× OTFS-communications
•
Memory: 10× more than pure communications
•
All fits in 5G SoC hardware

Joint Delay-Doppler Estimation and Data Detection

The Joint Algorithm

DD-Domain Waveform Design for ISAC

Joint OTFS-ISAC Estimation-Detection

Theorem: Convergence of Joint ISAC Algorithm

EM interpretation

Local MAP

Global optimality

Rate

Key Takeaway

Joint ISAC Algorithm: Data BER and Sensing MSE

Parameters

Definition:
Super-Resolution Target Refinement

How Far Can Super-Resolution Go?

Example: Pedestrian Tracking Accuracy

Resolution

Super-resolution

Frame rate

Tracking quality

Newton Refinement for Fractional Offset

Compute Budget for OTFS-ISAC

Joint Delay-Doppler Estimation and Data Detection

The Joint Algorithm

DD-Domain Waveform Design for ISAC

Joint OTFS-ISAC Estimation-Detection

Theorem: Convergence of Joint ISAC Algorithm

EM interpretation

Local MAP

Global optimality

Rate

Key Takeaway

Joint ISAC Algorithm: Data BER and Sensing MSE

Parameters

Definition: Super-Resolution Target Refinement

How Far Can Super-Resolution Go?

Example: Pedestrian Tracking Accuracy

Resolution

Super-resolution

Frame rate

Tracking quality

Newton Refinement for Fractional Offset

Compute Budget for OTFS-ISAC

Definition:
Super-Resolution Target Refinement