Multi-Target Tracking on the DD Grid
From Snapshot to Track
§§1-3 treated a single frame of MIMO-OTFS-ISAC: estimate the target scene once, design the beamformer, run the comms and sensing tasks in parallel. But the scene evolves: vehicles move, pedestrians cross, new scatterers appear. This section lifts the snapshot analysis to the tracking problem — estimating target trajectories over frames, exploiting their continuity in the DD-angle domain. The DD representation is especially convenient for tracking because each target is a point in the DD plane whose coordinates change smoothly frame to frame.
Definition: Target State Model
Target State Model
At frame , target has state — range, radial velocity, angle, angular velocity, complex reflectivity.
State evolution (linear constant-velocity model): with frame duration and process noise .
Observation model (from MIMO-OTFS-ISAC): where maps the state to the observation (delay, Doppler, angle), and is the estimation error with covariance given by the CRB.
Theorem: Extended Kalman Tracking on the DD-Angle Grid
For a target with linear state evolution and nonlinear observation (the mapping is nonlinear in ), the extended Kalman filter (EKF) tracks the target state with covariance where is the Kalman gain, is the observation Jacobian, and is the predicted covariance.
Under steady-state tracking with process noise and observation noise , the steady-state filter MSE is Consequence. Sensing-optimal beamforming ( illuminating target directions) reduces tracking MSE by vs. uniform illumination. This is the quantitative gain from beam-aware tracking.
Tracking a moving target is like solving a noisy linear regression — the data noise is the CRB, the process noise is how erratically the target maneuvers. Lower CRB (better sensing) compounds over time via the Kalman update, giving a multiplicative improvement in steady-state MSE. This is why even a small sensing gain per frame matters: it compounds into a large tracking gain over many frames.
Innovation
Innovation ; its covariance is .
Kalman gain
.
Update
State: . Covariance: .
Steady state
Solving the Riccati equation yields the scaling above when is identity (stationary state) and scales as .
Multi-Target EKF on the DD-Angle Grid
Example: Highway Multi-Vehicle Tracking
A roadside BS at 77 GHz tracks vehicles on a highway. Frame rate ms. Vehicle speeds 60-120 km/h. Range resolution m (from MHz), velocity resolution m/s (from ms at 77 GHz).
(a) Predict tracking MSE in steady state. (b) Evaluate association reliability for two vehicles at similar range. (c) Discuss birth/death handling at highway entrances.
Steady-state MSE
Process noise: m². Observation noise: m². Riccati: m = 1.5 cm.
Association with close vehicles
Two vehicles at ranges 80 m, 81.5 m: separation 1.5 m > resolution 1.5 m. Resolvable per frame. Joint tracking across frames further reduces MSE — after 10 frames, cm MSE. Clear discrimination.
Birth/death at entrances
New vehicle appears at 150 m range. Unassociated observation for 2-3 frames before track initiates (confirmation window). Then tracked at 1.5 cm MSE after ~1 s.
Summary
Multi-target tracking on the DD-angle grid achieves cm-level positional accuracy with 10-ms update. Highway-scale deployments operate reliably. Algorithm: EKF + JPDA.
Steady-State Tracking MSE vs SNR
Plot the steady-state Kalman tracking MSE (position) as a function of receive SNR, comparing single-snapshot CRB (no tracking) with steady-state EKF. Sliders: frame rate, process noise, beam-aware vs uniform illumination.
Parameters
Theorem: Predictive Beamforming Gain
Suppose the BS knows the predicted target states for frame with covariance . Using this prediction to pre-steer the sensing beam at frame yields improvement in tracking MSE of where the denominator is the CRB with uniform illumination. For well-tracked targets, this ratio is — predictive beamforming provides order-of-magnitude MSE improvement vs. blind (uniform) illumination.
Once a target is being tracked, the system knows where it is likely to be at the next frame — within a beamwidth. Concentrating the sensing beam there improves observation SNR and therefore reduces tracking noise. This creates a positive feedback loop: good tracking leads to good prediction leads to focused sensing leads to better tracking. The loop is stable as long as predictions do not diverge — the topic of §5.
Beam pattern
Sensing covariance concentrates energy in directions.
CRB improvement
whereas (uniform). Gain: , typically 4-10x.
Kalman update
Lower CRB directly reduces Kalman update noise, and the reduction propagates to steady-state via the Riccati equation.
Predictive Tracking with MIMO-OTFS-ISAC
The CommIT contribution on predictive MIMO-OTFS-ISAC tracking establishes two key results: (1) the steady-state tracking MSE scales as for a Kalman-filtered target, with explicit closed-form expressions for the multi-target multi-user scenario; (2) sensing-aware beamforming (pre-steering based on predictions) reduces steady-state MSE by the beamforming gain , a multiplicative improvement over blind illumination.
Combined with the DD-domain channel sparsity of §1, this result makes cm-level multi-target tracking feasible at highway frame rates (100 Hz). Without the DD framework, the same sensing gain would be nullified by channel estimation errors on the order of the target spacing. The DD domain's sparsity is what allows the predictive feedback loop to remain stable under realistic CSI uncertainty.
Historical Note: From Classical Radar Tracking to DD-Angle EKF
Classical radar tracking (PDA, IMM, JPDA) dates to Bar-Shalom's 1970s work on multi-target estimation. Classical algorithms operate in Cartesian position-velocity space and assume a known measurement likelihood. The DD-angle framework here gives a principled prior distribution for the measurements (from the DD structure of OTFS), not an ad-hoc choice. This is the main advance: the same Kalman and JPDA machinery, but with measurement noise and innovation covariances derived from the waveform, not guessed.
In automotive applications, this integration eliminates the "sensor fusion layer" that classical designs use to reconcile radar and camera tracks — OTFS-ISAC provides both modalities simultaneously, with coherent measurement models.
Common Mistake: Don't Track Ghosts
Mistake:
Associating every observed DD-angle peak with a target track. Spurious peaks — from sidelobes of nearby targets, ground clutter, or random noise — create ghost tracks that persist if not actively pruned.
Correction:
Use confirmation windows: a track is confirmed only after 2-3 frames of consistent observations. Use track quality metrics (cumulative innovation, likelihood ratio) to terminate low-quality tracks. In high-clutter environments (urban, forest), operate with higher birth thresholds (). Cross-modal confirmation with camera or lidar is a standard robustification technique in automotive.