Ferkans — Interactive Telecom Tutor

Definition:
Score Function and Score Matching

The score function is the gradient of the log-density:

$\mathbf{s}_\theta(\mathbf{x}) \approx \nabla_\mathbf{x} \log p(\mathbf{x})$

Score matching trains $\mathbf{s}_\theta$ without knowing $p(\mathbf{x})$ . Denoising score matching adds noise and learns:

$\mathbf{s}_\theta(\tilde{\mathbf{x}}, \sigma) \approx \nabla_{\tilde{\mathbf{x}}} \log p_\sigma(\tilde{\mathbf{x}})$

Definition:
Flow Matching

Flow matching learns a velocity field $\mathbf{v}_\theta(\mathbf{x}_t, t)$ that transports noise $\mathbf{x}_0 \sim \mathcal{N}(0, I)$ to data $\mathbf{x}_1 \sim p_{\text{data}}$ along straight paths:

$\mathbf{x}_t = (1-t)\mathbf{x}_0 + t\mathbf{x}_1$ $L = \mathbb{E}_{t, \mathbf{x}_0, \mathbf{x}_1}\left[\|\mathbf{v}_\theta(\mathbf{x}_t, t) - (\mathbf{x}_1 - \mathbf{x}_0)\|^2\right]$

Sampling: solve the ODE $d\mathbf{x}/dt = \mathbf{v}_\theta(\mathbf{x}, t)$ from $t=0$ to $t=1$ .

Flow matching is simpler to train than score-based SDEs and often requires fewer sampling steps.

Example: Flow Matching Training

Implement flow matching training for 2D data.

Solution

Implementation

for x1 in data_loader:  # target data
    t = torch.rand(x1.size(0), 1)
    x0 = torch.randn_like(x1)  # noise
    xt = (1 - t) * x0 + t * x1
    v_target = x1 - x0
    v_pred = model(xt, t)
    loss = F.mse_loss(v_pred, v_target)

Score-Based Models and Flow Matching

Definition: Score Function and Score Matching

Definition: Flow Matching

Example: Flow Matching Training

Implementation

Definition:
Score Function and Score Matching

Definition:
Flow Matching