Sparsity and Its Consequences

Sparsity Is a Resource, Not a Curiosity

We have said repeatedly that the DD channel is "sparse." This section turns that qualitative observation into quantitative advantages β€” concrete reductions in pilot overhead, detection complexity, and estimation error β€” that motivate the detailed transceiver design of Chapters 6-8.

The point is that sparsity is not merely an aesthetic property. It determines whether OTFS can be deployed in 5G/6G: a channel with PP parameters can be estimated from O(P)O(P) pilots instead of O(MN)O(MN), detected with O(PMN)O(PMN) operations instead of O(MN)2O(MN)^2, and tracked robustly under noise that would wash out a dense MNMN-parameter estimate.

Definition:

Effective Channel Support

The effective support of the DD channel on the (M,N)(M, N) grid is the rectangle Sβ€…β€Š=β€…β€Š{(β„“,k):0≀ℓ≀lmax⁑,β€‰βˆ’kmax⁑≀k≀kmax⁑},\mathcal{S} \;=\; \{(\ell, k) : 0 \leq \ell \leq l_{\max},\,-k_{\max} \leq k \leq k_{\max}\}, with lmax⁑=βŒˆΟ„max⁑WβŒ‰l_{\max} = \lceil \tau_{\max} W \rceil and kmax⁑=⌈fDTβŒ‰k_{\max} = \lceil f_D T \rceil. The actual PP paths lie within S\mathcal{S}, often covering only a small fraction of it. The effective support cardinality ∣S∣=(lmax⁑+1)(2kmax⁑+1)|\mathcal{S}| = (l_{\max} + 1)(2 k_{\max} + 1) is the natural upper bound for the number of "active" channel taps on the grid.

Theorem: Minimum Pilot Overhead for DD Channel Estimation

Consider a single pilot impulse placed at (β„“p,kp)(\ell_p, k_p) on the DD grid, surrounded by a guard region G={(β„“p+β„“,kp+k):0≀ℓ≀lmax⁑,β€‰βˆ’kmax⁑≀k≀kmax⁑}\mathcal{G} = \{(\ell_p + \ell, k_p + k) : 0 \leq \ell \leq l_{\max},\,-k_{\max} \leq k \leq k_{\max}\} in which no data is transmitted. After passing through a PP-path channel with effective support within S\mathcal{S}, the received DD grid contains in G\mathcal{G} a scaled-and-shifted copy of the pilot with amplitudes {hi}i=1P\{h_i\}_{i=1}^P at positions {(β„“i,ki)}\{(\ell_i, k_i)\}. All PP parameters can be recovered from the ∣G∣|\mathcal{G}| guard-region observations provided SNRpilotβ‹…βˆ£Gβˆ£β€…β€Šβ‰₯β€…β€ŠP,\mathrm{SNR}_{\text{pilot}} \cdot |\mathcal{G}| \;\geq\; P, i.e., the pilot SNR is large enough that each path's peak stands out above the noise floor.

The pilot overhead is ∣G∣/(MN)=(lmax⁑+1)(2kmax⁑+1)/(MN)|\mathcal{G}|/(MN) = (l_{\max} + 1)(2k_{\max} + 1)/(MN).

The pilot is "delta-like" in DD; the channel's action is to produce PP copies of it in the guard region. Each copy reveals one (hi,β„“i,ki)(h_i, \ell_i, k_i) triple β€” all the information needed to reconstruct h(Ο„,Ξ½)h(\tau, \nu). The guard region must be large enough to contain all copies without collision with data cells; the pilot SNR must be high enough for reliable peak detection.

Embedded Pilot in a Guard Region (Preview)

Place a single pilot impulse on the DD grid surrounded by a guard region sized for the channel's maximum delay and Doppler. After passing through the channel, the guard region contains PP copies of the pilot at the path coordinates β€” directly readable as (hi,β„“i,ki)(h_i, \ell_i, k_i). Adjust pilot power, guard size, and channel complexity to see how the peaks stand out against the noise floor. This is a preview of the embedded pilot estimator of Chapter 7.

Parameters
32
16
3
4
1
25
5

Example: Pilot Overhead at 5G Numerology

At 5G NR numerology-1 (Ξ”f=30\Delta f = 30 kHz, Ts=33.3 μT_s = 33.3\,\mus), an OTFS frame uses M=512M = 512, N=16N = 16. The channel has Ο„max⁑=2 μ\tau_{\max} = 2\,\mus, fD=500f_D = 500 Hz. Compute the pilot overhead using an embedded pilot with guard region of minimum size.

Theorem: MSE Scaling of Least-Squares DD Channel Estimation

Given a pilot of power PpilP_{\text{pil}} and a guard region large enough to fit all paths, the least-squares estimate h^i\hat{h}_i of each path gain has variance Var(h^iβˆ’hi)β€…β€Š=β€…β€ŠΟƒ2Ppil.\mathrm{Var}(\hat{h}_i - h_i) \;=\; \frac{\sigma^2}{P_{\text{pil}}}. The total MSE of the channel estimate is MSE=Pβ‹…Οƒ2/Ppil\mathrm{MSE} = P \cdot \sigma^2/P_{\text{pil}}, independent of the grid size MNMN.

Because the DD channel has only PP unknown parameters, the estimation error scales with PP, not with MNMN. In the TF domain, estimating H(f,t)H(f, t) at the Nyquist rate requires O(Ο„max⁑fDTW)O(\tau_{\max} f_D T W) pilot samples; the MSE scales with this larger number. Sparsity is the reason OTFS estimation is far cleaner than OFDM estimation at high mobility.

Key Takeaway

Sparsity converts an MNMN-dimensional channel estimation problem into a PP-dimensional one. The consequences are: (i) pilot overhead drops from ∼10%\sim 10\% (OFDM) to ∼1%\sim 1\% (OTFS) under similar conditions; (ii) estimation MSE scales with PP, not MNMN; (iii) detection complexity scales with Pβ‹…MNP \cdot MN rather than (MN)2(MN)^2. All three concrete advantages flow from the single structural fact that h(Ο„,Ξ½)h(\tau, \nu) has support of cardinality PP on the physical channel, and the DD grid resolves that support cleanly.

⚠️Engineering Note

When Sparsity Breaks Down

The sparsity argument requires PP to be small and the paths to be resolvable. Three regimes stress this assumption:

  • Rich scattering (dense urban NLOS): when the number of significant paths P∼50P \sim 50 or more (rare but possible in certain deep-scatter environments), the "sparsity" becomes marginal. The DD channel has many entries but each is weak, and the advantage over OFDM shrinks. In the extreme limit Pβ†’βˆžP \to \infty, OTFS has no essential advantage over OFDM.
  • Unresolvable clusters: when multiple physical reflectors are closer than Δτ=1/W\Delta\tau = 1/W in delay or Δν=1/T\Delta\nu = 1/T in Doppler, they merge into a single grid cell. The effective sparsity is determined by the number of resolvable clusters, not raw reflectors. For narrowband systems (W<1W < 1 MHz), the delay resolution can be too coarse to resolve close scatterers.
  • Diffuse scattering: some channel models (3GPP TR 38.901 indoor hotspot) include a diffuse component beyond the discrete paths. On the DD grid this manifests as a low-level "background" across many cells, reducing effective sparsity.

For realistic terrestrial mobile channels (urban macro, vehicular), measurements (COST 2100, METIS) consistently find P≀20P \leq 20 with good cluster resolution β€” the regime where OTFS sparsity is decisive.

Practical Constraints
  • β€’

    Discrete-path model valid for P≀20P \leq 20 typical

  • β€’

    Resolution requires Wβ‰₯1/Δτmin⁑W \geq 1/\Delta\tau_{\min} (bandwidth covers tap separation)

  • β€’

    Diffuse component adds ∼5%\sim 5\% noise floor on DD cells

πŸ“‹ Ref: 3GPP TR 38.901

Why This Matters: Embedded Pilot Design in Chapter 7

The pilot-overhead result of this section is the motivation for the embedded pilot scheme developed in Chapter 7. There we show (following the CommIT cell-free OTFS work) that a single pilot impulse with an appropriately sized guard region achieves the minimum overhead established here, with MSE matching the theoretical lower bound. The superimposed pilot approach β€” also treated in Chapter 7 β€” removes the guard region at the cost of additional receiver processing.