Relation to STFT and Gabor Frames

Three Views of the Same Object

The Zak transform can feel abstract because it is defined by a formal infinite sum. It turns out that the Zak transform is intimately connected to two more familiar time-frequency objects: the short-time Fourier transform (STFT) and the Gabor expansion. Understanding these connections grounds the Zak transform in classical signal processing and, more importantly, tells us how to approximate it numerically.

Definition:

Short-Time Fourier Transform

For a signal f(t)f(t) and a window function g(t)g(t), the short-time Fourier transform (STFT) is Vgf(t,ν)  =  f(s)g(st)ej2πνsds.V_g f(t, \nu) \;=\; \int_{-\infty}^{\infty} f(s)\,\overline{g(s - t)}\,e^{-j 2\pi \nu s}\,ds. The STFT measures the "frequency content of ff near time tt" through the window gg.

When the window gg is a unit-height rectangle of width T0T_0, the STFT becomes closely related to the Zak transform — the integral collapses to a sum over a single period, and the full Zak transform emerges by considering all periods.

Gabor Frames and Why They Matter for OTFS

The Gabor system is a lattice of time-frequency shifted copies of a prototype pulse gg: {gm,n(t)=g(tmT0)ej2πnν0t}m,nZ\{g_{m, n}(t) = g(t - m T_0)\,e^{j 2\pi n \nu_0 t}\}_{m, n \in \mathbb{Z}}. A signal can be expanded in this system: f(t)  =  m,ncm,ngm,n(t),f(t) \;=\; \sum_{m, n} c_{m, n}\,g_{m, n}(t), with coefficients cm,nc_{m, n} determined by inner products with a dual window. This is exactly the OTFS modulator structure: the transmit signal is a sum of delay-Doppler shifted copies of a prototype pulse, with coefficients equal to the data symbols placed on the DD grid.

The central question — when is this expansion unique? — is answered by the Zak transform. For the critical lattice T0ν0=1T_0 \cdot \nu_0 = 1, the dual window is a 1:1 inverse of the prototype in Zak coordinates: g~(t)=Z1[1/Zg(t,ν)]\tilde{g}(t) = Z^{-1}[1 / Z_g(t, \nu)]. Good prototypes have ZgZ_g bounded away from zero; "bad" prototypes (like narrow Gaussians with σT0\sigma \ll T_0) have zeros in ZgZ_g, making the Gabor expansion ill-conditioned.

Theorem: Existence of Gabor Frames (Critical Lattice)

Consider the Gabor lattice {g(tmT0)ej2πnν0t}m,nZ\{g(t - m T_0)\,e^{j 2\pi n \nu_0 t}\}_{m, n \in \mathbb{Z}} with T0ν0=1T_0 \nu_0 = 1 (critical density). The system is a Riesz basis of L2(R)L^2(\mathbb{R}) if and only if 0<ess inf(t,ν)T2Zg(t,ν)    ess sup(t,ν)T2Zg(t,ν)<.0 < \underset{(t, \nu) \in \mathbb{T}^2}{\operatorname{ess\,inf}} |Z_g(t, \nu)| \;\leq\; \underset{(t, \nu) \in \mathbb{T}^2}{\operatorname{ess\,sup}} |Z_g(t, \nu)| < \infty. The dual window is g~=Z1[1/Zg]\tilde{g} = Z^{-1}[1/Z_g].

The Zak transform of the prototype pulse gg directly determines whether the Gabor system is well-conditioned. Zeros in ZgZ_g correspond to "holes" in the tiling — the Gabor system fails to span certain directions of the signal space. For OTFS design, this is a concrete pulse-shaping criterion: choose gg so that Zg|Z_g| is bounded away from zero on T2\mathbb{T}^2.

,

Zak Transform of a Chirp

A chirp ϕ(t)=ejπαt2\phi(t) = e^{j \pi \alpha t^2} with sweep rate α\alpha has a Zak transform that is localized along a diagonal of the fundamental domain — the DD-plane signature of a linear frequency sweep. As α\alpha varies, the diagonal tilts. Notice that at critical sweep rates (where αT02\alpha T_0^2 is an integer), the Zak transform "folds" onto itself due to the Heisenberg–Weyl group's metaplectic action.

Parameters
1
4
4

Gabor-Friendly vs Gabor-Hostile Pulses

Plot Zg(t,ν)|Z_g(t, \nu)| for three prototype pulses: (1) a rectangle of width T0T_0 (flat Zak, ideal Gabor), (2) a Gaussian with σ=0.3T0\sigma = 0.3\,T_0 (narrow Gaussian — zeros in Zak, breaks Gabor expansion at critical lattice), and (3) a raised-cosine with roll-off 0.5 (smooth, no zeros). Observe how the lowest value of Zg|Z_g| over the fundamental domain predicts the condition number of the Gabor expansion.

Parameters
0.25
0.4

Historical Note: Joshua Zak and the 1967 Paper

1967

Joshua Zak, then at the Technion in Haifa, published "Finite Translations in Solid-State Physics" in the October 1967 issue of Physical Review Letters. His motivation had nothing to do with signal processing — he was studying the spectrum of electrons in a crystal under an applied magnetic field (the integer quantum Hall effect, decades before its discovery). To handle the two conflicting periodicities (crystal lattice in space, magnetic phase in momentum), he introduced what is now called the Zak transform.

Two decades later, Augustus Janssen (Philips Research Labs, Eindhoven) recognized that Zak's construction solved a central problem in Gabor analysis: characterizing when the Gabor system at critical density is a Riesz basis. Janssen's 1988 paper "The Zak Transform: a Signal Transform for Sampled Time-Continuous Signals" brought Zak's idea into signal processing.

A further three decades later, Hadani and Rakib realized that the Zak transform was exactly the tool needed to build a waveform on the delay-Doppler plane. OTFS was born. Solid-state physics, harmonic analysis, wireless communications — the same mathematical object serves all three.

The Gabor Expansion Is the OTFS Modulator

Looking ahead: in Chapter 6 we will write the OTFS transmit waveform as x(t)  =  =0M1k=0N1X[,k]g(tΔτ)ej2πkΔνt,x(t) \;=\; \sum_{\ell = 0}^{M-1}\sum_{k=0}^{N-1} X[\ell, k]\,g(t - \ell\,\Delta\tau)\,e^{j 2\pi k\,\Delta\nu\, t}, where g(t)g(t) is a prototype pulse and X[,k]X[\ell, k] are the data symbols on the DD grid. This is exactly a Gabor expansion. The theorems of this section tell us when this expansion is invertible (reconstruction at the receiver) and how to choose the pulse gg (bounded Zg|Z_g| from above and below). Chapter 20 revisits pulse shaping with this in mind.

Common Mistake: The "Ideal OTFS Pulse" Is a Fiction

Mistake:

Early OTFS papers (Hadani 2017, Raviteja 2018) assume a prototype pulse gg that is simultaneously rectangular in time and rectangular in frequency — i.e., a "bi-orthogonal" pulse with flat Zak transform on both sides. This is not achievable by any finite-energy signal.

Correction:

No L2L^2 function can be compactly supported in both time and frequency (this is the uncertainty principle). The "ideal pulse" is a pedagogical device. Practical OTFS uses a raised-cosine or truncated Gaussian pulse, and the non-ideality introduces second-order correction terms to the DD input-output relation. These corrections are captured by off-diagonal entries in the DD channel matrix (Chapter 4) and are small for typical system parameters — but they are never zero. A clean treatment is Mohammed–Hadani–Chockalingam–Caire (2022).

Gabor frame

A Gabor frame is a collection of time-frequency shifted copies of a prototype pulse gg, {g(tmT0)ej2πnν0t}m,nZ\{g(t - m T_0)\,e^{j 2\pi n \nu_0 t}\}_{m, n \in \mathbb{Z}}, that spans L2(R)L^2(\mathbb{R}). When T0ν0=1T_0 \nu_0 = 1 (critical density) and the Zak transform ZgZ_g is bounded away from zero, the Gabor system is a Riesz basis. The OTFS modulator is a Gabor expansion at critical density.

Related: Zak Transform, Otfs Modulation