Bridging the Sim-to-Real Gap
The Central Challenge
The sim-to-real gap -- the performance degradation when deploying simulation-trained models on real data -- is arguably the most pressing open problem in learned RF imaging. Models trained on point-scatterer simulations routinely lose 10--15 dB PSNR on measured data. Until this gap is closed, learned imaging methods remain laboratory curiosities rather than deployed systems. We do not yet know how to reliably quantify this gap before collecting real data, nor how to guarantee that a domain-adapted model will not fail catastrophically on unseen environments.
Definition: The Sim-to-Real Gap
The Sim-to-Real Gap
The sim-to-real gap arises from the distribution shift between simulated and real data:
where is a model trained on . Sources of the gap include:
- Model mismatch: Born vs multipath, point vs extended targets.
- Hardware impairments: IQ imbalance, phase noise, mutual coupling not modelled in simulation.
- Scene complexity: real scenes have materials, textures, and clutter absent from simple simulations.
- Environmental factors: interference, temperature drift, multipath from unmapped objects.
Definition: Domain Adaptation Approaches
Domain Adaptation Approaches
Several strategies address the sim-to-real gap:
-
Domain randomisation: randomise simulator parameters (noise level, material properties, array errors) during training to cover the real distribution.
-
Fine-tuning: pre-train on simulation, fine-tune on a small real dataset (10--100 examples).
-
Adversarial domain adaptation: train a domain discriminator alongside the imaging network, minimising to learn domain-invariant features.
-
Physics-based simulation: use high-fidelity simulators (ray tracing via Sionna RT, FDTD) that model the dominant real-world effects.
-
Self-supervised learning: train on real data without labels using self-consistency losses (predict held-out measurements from the reconstruction).
Visualises the distribution shift between simulated and real RF data in a 2D feature space. The overlap region determines how well a simulation-trained model transfers.
Parameters
Definition: Meta-Learning for Few-Shot Adaptation
Meta-Learning for Few-Shot Adaptation
Meta-learning trains a model to adapt quickly from few examples. For RF imaging, model-agnostic meta-learning (MAML) optimises initial parameters such that a few gradient steps on real-data examples yield good performance:
where denotes gradient steps on support set . Each "task" is a different environment (room, hardware platform). The meta-learned initialisation captures scene priors that transfer across environments while allowing rapid adaptation to each.
Example: Sim-to-Real Budget Analysis
A learned indoor imaging system at 5 GHz is trained on 50,000 Born-model simulations. Real deployment on a TI IWR6843 in an office loses 12 dB PSNR. Decompose the gap into sources and propose a mitigation strategy with a budget of 20 real scenes.
Gap decomposition
Estimate each source by ablation: (1) Model mismatch (Born vs multipath): dB. (2) Off-grid targets: dB. (3) Phase noise / mutual coupling: dB. (4) Clutter not in training: dB. Total: dB, consistent with the observed gap.
Mitigation with 20 real scenes
(1) Add ray-tracing simulations to cover multipath (recovers dB). (2) Fine-tune with 10 real scenes, held-out 10 for validation (recovers dB). (3) Domain randomisation over hardware impairments (recovers dB). Expected residual gap: dB, within practical tolerance.
Practical Sim-to-Real Transfer Pipeline
A production sim-to-real pipeline for RF imaging deployment: (1) Tier 1: Born-model simulation with domain randomisation (-- scenes). (2) Tier 2: ray-tracing simulation with calibrated material models ( scenes). (3) Tier 3: fine-tuning on 10--50 real measured scenes with self-supervised measurement-consistency loss. (4) Monitoring: track the normalised residual at deployment; if it exceeds a calibrated threshold, flag for re-adaptation. This tiered approach is the current best practice, though the optimal allocation across tiers remains an open problem.
Common Mistake: Overfitting to Few Real Scenes
Mistake:
Fine-tuning a high-capacity network (e.g., U-Net with parameters) on 10 real scenes without regularisation. The network memorises the 10 scenes and fails on scene 11.
Correction:
Use strong regularisation: freeze early layers, reduce learning rate by --, apply weight decay, and use early stopping on a held-out validation scene. Alternatively, use self-supervised loss (measurement consistency) which does not require ground-truth labels and thus avoids label-based overfitting.
Open Questions in Sim-to-Real Transfer
-
A priori gap prediction: given a simulator and target scenario, can we predict without real data? This would enable informed decisions about simulation fidelity investment.
-
Cross-hardware transfer: can a model trained on TI IWR6843 data transfer to Infineon BGT60 with different antenna arrays and waveforms?
-
Worst-case guarantees: domain adaptation minimises the average gap, but can we guarantee the worst-case performance on any real scene?
-
Continual adaptation: can the model improve continuously as it encounters new environments, without forgetting previously learned ones?
These questions define a rich research programme that connects RF imaging to the broader machine learning literature on robustness and distribution shift.
Quick Check
Which sim-to-real mitigation strategy is most effective when only 5 real measured scenes are available?
Adversarial domain adaptation
Self-supervised measurement consistency
Increasing simulation dataset size to 500,000
Training only on the 5 real scenes from scratch
Self-supervised loss uses all measurements without requiring ground-truth labels, maximising information from each scene and avoiding label-based overfitting.
Sim-to-Real Gap
The performance degradation (typically 10--15 dB PSNR) observed when deploying a simulation-trained model on real measured data. Caused by distribution shift in forward model, hardware, and environment.
Related: Domain Adaptation, Inverse Crime
Domain Adaptation
Machine learning techniques that reduce the distribution shift between source (simulation) and target (real) domains. Includes domain randomisation, fine-tuning, adversarial methods, and self-supervised approaches.
Related: Sim-to-Real Gap
Key Takeaway
The sim-to-real gap (10--15 dB) is the primary barrier to deploying learned RF imaging. The most effective current approach is tiered simulation (Born + ray tracing) combined with self-supervised fine-tuning on a small real dataset. Predicting the gap a priori and guaranteeing worst-case robustness remain fundamental open problems.