NeRF-Squared (Zhao et al.)
The First NeRF for Wireless Propagation
NeRF (Zhao et al., 2023) was among the first works to adapt neural radiance fields for radio-frequency propagation modelling. Rather than rendering images, NeRF predicts received signal strength (RSS) and channel state information (CSI) between arbitrary transmitter--receiver pairs by learning a volumetric representation of the RF environment. The key question: can a single neural scene function, trained on sparse measurements, predict RF propagation more accurately than ray tracing or empirical models?
Definition: NeRF Architecture
NeRF Architecture
NeRF represents the RF environment as two MLPs:
-
Geometry MLP : maps position (with positional encoding ) to volume density and an intermediate feature vector .
-
Signal MLP : maps the feature vector , ray direction , and frequency to a complex signal contribution .
The predicted received power between a transmitter at and receiver at is:
with the sum taken over samples along the direct ray from to .
The architecture mirrors optical NeRF: the geometry MLP captures where material exists (walls, furniture), while the signal MLP captures how that material interacts with RF waves (reflection/absorption).
Theorem: NeRF Training Objective
The training loss combines received power prediction error and a regularisation term:
where is the set of measured transmitter--receiver pairs, , and controls weight decay.
The dB-domain loss is essential because received power spans orders of magnitude ( dB variation from path loss). Operating in dB normalises gradient magnitudes across near and far measurements, preventing nearby Tx--Rx pairs from dominating the training.
Why dB-domain loss
In linear domain, a measurement at 10 m ( dBm) and one at 1 m ( dBm) differ by a factor of . The loss in linear scale would be dominated by the near measurement. In dB, both contribute comparably to the gradient.
Regularisation role
The regulariser prevents overfitting when the measurement set is sparse. RF datasets have -- measurements, versus -- pixels in optical NeRF. Without regularisation, the MLP memorises training points and fails on novel positions.
Definition: Positional Encoding for RF Scenes
Positional Encoding for RF Scenes
NeRF uses the standard positional encoding
with for position and for direction. The maximum encoded frequency is rad/m, corresponding to a spatial scale of mm.
For indoor RF scenes, the relevant spatial scales range from cm (wall boundaries) to m (room dimensions). provides adequate bandwidth. Over-parameterising causes the network to fit measurement noise; under-parameterising produces over-smooth reconstructions that miss shadow boundaries.
Example: NeRF for Indoor RSS Prediction
An indoor office ( m) has 4 Wi-Fi access points at known positions. A site survey collects RSS at 500 receiver locations. Train NeRF and predict RSS at a new receiver location m.
Data preparation
Training set: 500 receiver locations 4 APs (Tx, Rx, RSS) triples. Positional encoding: (position), (direction). Normalise RSS to range.
Training
MLP: 8 layers, 256 hidden units, ReLU activations. Optimiser: Adam, learning rate , batch size 256 rays. Train for 50,000 iterations ( min on a single GPU). Validation: hold out 100 measurements for early stopping.
Prediction
For each AP, cast a ray from to , sample 128 points, evaluate the MLPs, and render received power via the discrete volume rendering sum. Typical accuracy: 3--5 dB RMSE, which is competitive with site-specific ray tracing.
The Multipath Limitation
NeRF integrates along the direct ray only. This captures line-of-sight attenuation and through-wall propagation but misses reflected and diffracted paths. In indoor environments, multipath contributes 30--50% of received power.
Why this matters: In a room with a concrete wall blocking the direct path, NeRF predicts near-zero received power. In reality, reflections off the ceiling and floor may provide usable signal strength.
Extensions address this via:
- Casting multiple rays (reflected rays via learned or geometric reflection models) --- see WiNeRT in Section 24.4;
- Adding a learned multipath correction network;
- Using the geometry MLP to identify reflective surfaces and trace secondary rays.
Common Mistake: Overfitting with Sparse RF Measurements
Mistake:
Training a large NeRF model (8 layers 256 units 530k parameters) on a few hundred RSS measurements without regularisation, and expecting it to generalise.
Correction:
RF datasets are orders of magnitude smaller than optical datasets. Mitigation strategies:
- Weight decay (--);
- Smaller MLPs (4 layers 128 units);
- Hash encoding (Instant-NGP) with smaller tables ( instead of );
- Early stopping on a validation set;
- Data augmentation: add noise to Tx/Rx positions.
RF NeRF Volume Rendering
Visualise the RF NeRF volume rendering process along a single ray. The top panel shows the volume density profile (walls appear as density peaks). The bottom panel shows the accumulated complex signal magnitude and the final rendered received power. Observe how increasing the number of samples improves accuracy near sharp density transitions (walls), and how changing the frequency shifts the interference pattern.
Parameters
NeRF for Network Planning
NeRF is most practical for offline network planning where a site survey collects RSS measurements, the model trains overnight, and the trained model predicts coverage maps for AP placement optimisation. Real-time channel prediction (e.g., for beam management) requires faster inference methods.
Practical numbers: Training on 2,000 measurements takes minutes. Inference for 10,000 grid points takes seconds (batched GPU evaluation). Accuracy: 3--5 dB RMSE, comparable to calibrated ray tracing but without requiring a detailed 3D building model.
- β’
Requires known Tx/Rx positions (GPS or floor plan coordinates)
- β’
Per-scene training: a new model is needed for each environment
- β’
Single-frequency training; multi-frequency requires architecture changes
Quick Check
Why does NeRF use a dB-domain loss function rather than a linear-domain MSE?
dB is more intuitive for engineers
To normalise gradients across measurements with widely varying power levels
To make the loss function convex
To enable closed-form optimisation
Correct. Received power varies by orders of magnitude ( dB range). The dB-domain loss ensures that measurements at all distances contribute comparably to the gradient.
NeRF for RF Propagation Modelling
Zhao et al. introduced the first neural radiance field for RF propagation modelling. By replacing the colour output with a complex signal MLP and training on sparse RSS measurements, NeRF achieves 3--5 dB RSS prediction accuracy in indoor environments without requiring a detailed 3D building model. The key insight is that the volume density MLP implicitly learns the scene geometry (walls, furniture) from signal measurements alone. Limitations include single-ray integration (no multipath) and per-scene training.
Key Takeaway
NeRF learns a volumetric RF scene representation from sparse RSS measurements, predicting received power between arbitrary Tx--Rx pairs. The dB-domain loss normalises gradients across the large dynamic range of RF signals. The primary limitation is single-ray integration, which misses multipath contributions significant in indoor environments. Extensions addressing multipath, channel estimation, and material properties are surveyed in Section 24.4.