NeRF-Squared (Zhao et al.)

The First NeRF for Wireless Propagation

NeRF2^2 (Zhao et al., 2023) was among the first works to adapt neural radiance fields for radio-frequency propagation modelling. Rather than rendering images, NeRF2^2 predicts received signal strength (RSS) and channel state information (CSI) between arbitrary transmitter--receiver pairs by learning a volumetric representation of the RF environment. The key question: can a single neural scene function, trained on sparse measurements, predict RF propagation more accurately than ray tracing or empirical models?

Definition:

NeRF2^2 Architecture

NeRF2^2 represents the RF environment as two MLPs:

  1. Geometry MLP fΟƒf_\sigma: maps position x∈R3\mathbf{x} \in \mathbb{R}^3 (with positional encoding Ξ³(x)\gamma(\mathbf{x})) to volume density Οƒ(x)β‰₯0\sigma(\mathbf{x}) \geq 0 and an intermediate feature vector h∈R256\mathbf{h} \in \mathbb{R}^{256}.

  2. Signal MLP fsf_s: maps the feature vector h\mathbf{h}, ray direction d\mathbf{d}, and frequency ff to a complex signal contribution s(x,d,f)∈Cs(\mathbf{x}, \mathbf{d}, f) \in \mathbb{C}.

The predicted received power between a transmitter at xtx\mathbf{x}_{\mathrm{tx}} and receiver at xrx\mathbf{x}_{\mathrm{rx}} is:

P^rx=βˆ£βˆ‘i=1NTiΞ±i si eβˆ’j2Ο€fβˆ₯xiβˆ’xtxβˆ₯/c∣2,\hat{P}_{\mathrm{rx}} = \Bigl|\sum_{i=1}^{N} T_i \alpha_i\, s_i\,e^{-j 2\pi f \|\mathbf{x}_i - \mathbf{x}_{\mathrm{tx}}\|/c}\Bigr|^2,

with the sum taken over NN samples along the direct ray from xtx\mathbf{x}_{\mathrm{tx}} to xrx\mathbf{x}_{\mathrm{rx}}.

The architecture mirrors optical NeRF: the geometry MLP captures where material exists (walls, furniture), while the signal MLP captures how that material interacts with RF waves (reflection/absorption).

Theorem: NeRF2^2 Training Objective

The training loss combines received power prediction error and a regularisation term:

L(ΞΈ)=1∣Dβˆ£βˆ‘(xtx,xrx)∈D∣P^rx(dB)βˆ’Prx(dB)∣2+Ξ»βˆ₯ΞΈβˆ₯22,\mathcal{L}(\theta) = \frac{1}{|\mathcal{D}|}\sum_{(\mathbf{x}_{\mathrm{tx}}, \mathbf{x}_{\mathrm{rx}}) \in \mathcal{D}} \bigl|\hat{P}_{\mathrm{rx}}^{(\mathrm{dB})} - P_{\mathrm{rx}}^{(\mathrm{dB})}\bigr|^2 + \lambda \|\theta\|_2^2,

where D\mathcal{D} is the set of measured transmitter--receiver pairs, P^rx(dB)=10log⁑10P^rx\hat{P}_{\mathrm{rx}}^{(\mathrm{dB})} = 10\log_{10}\hat{P}_{\mathrm{rx}}, and λ\lambda controls weight decay.

The dB-domain loss is essential because received power spans orders of magnitude (∼100\sim 100 dB variation from path loss). Operating in dB normalises gradient magnitudes across near and far measurements, preventing nearby Tx--Rx pairs from dominating the training.

Definition:

Positional Encoding for RF Scenes

NeRF2^2 uses the standard positional encoding

Ξ³(x)=[sin⁑(20Ο€x),β€…β€Šcos⁑(20Ο€x),β€…β€Šβ€¦,β€…β€Šsin⁑(2Lβˆ’1Ο€x),β€…β€Šcos⁑(2Lβˆ’1Ο€x)]\gamma(\mathbf{x}) = \bigl[\sin(2^0 \pi \mathbf{x}),\; \cos(2^0 \pi \mathbf{x}),\;\ldots,\; \sin(2^{L-1} \pi \mathbf{x}),\; \cos(2^{L-1} \pi \mathbf{x})\bigr]

with L=10L = 10 for position and L=4L = 4 for direction. The maximum encoded frequency is 2Lβˆ’1Ο€β‰ˆ16082^{L-1}\pi \approx 1608 rad/m, corresponding to a spatial scale of ∼4\sim 4 mm.

For indoor RF scenes, the relevant spatial scales range from ∼1\sim 1 cm (wall boundaries) to ∼10\sim 10 m (room dimensions). L=10L = 10 provides adequate bandwidth. Over-parameterising LL causes the network to fit measurement noise; under-parameterising produces over-smooth reconstructions that miss shadow boundaries.

,

Example: NeRF2^2 for Indoor RSS Prediction

An indoor office (20Γ—1520 \times 15 m) has 4 Wi-Fi access points at known positions. A site survey collects RSS at 500 receiver locations. Train NeRF2^2 and predict RSS at a new receiver location xrx=(8,6,1.5)\mathbf{x}_{\mathrm{rx}} = (8, 6, 1.5) m.

The Multipath Limitation

NeRF2^2 integrates along the direct ray only. This captures line-of-sight attenuation and through-wall propagation but misses reflected and diffracted paths. In indoor environments, multipath contributes 30--50% of received power.

Why this matters: In a room with a concrete wall blocking the direct path, NeRF2^2 predicts near-zero received power. In reality, reflections off the ceiling and floor may provide usable signal strength.

Extensions address this via:

  • Casting multiple rays (reflected rays via learned or geometric reflection models) --- see WiNeRT in Section 24.4;
  • Adding a learned multipath correction network;
  • Using the geometry MLP to identify reflective surfaces and trace secondary rays.
,

Common Mistake: Overfitting with Sparse RF Measurements

Mistake:

Training a large NeRF2^2 model (8 layers Γ—\times 256 units β‰ˆ\approx 530k parameters) on a few hundred RSS measurements without regularisation, and expecting it to generalise.

Correction:

RF datasets are orders of magnitude smaller than optical datasets. Mitigation strategies:

  • Weight decay (Ξ»=10βˆ’4\lambda = 10^{-4}--10βˆ’310^{-3});
  • Smaller MLPs (4 layers Γ—\times 128 units);
  • Hash encoding (Instant-NGP) with smaller tables (T=216T = 2^{16} instead of 2192^{19});
  • Early stopping on a validation set;
  • Data augmentation: add noise to Tx/Rx positions.

RF NeRF Volume Rendering

Visualise the RF NeRF volume rendering process along a single ray. The top panel shows the volume density Οƒ(x)\sigma(\mathbf{x}) profile (walls appear as density peaks). The bottom panel shows the accumulated complex signal magnitude and the final rendered received power. Observe how increasing the number of samples improves accuracy near sharp density transitions (walls), and how changing the frequency shifts the interference pattern.

Parameters
128
5
πŸ”§Engineering Note

NeRF2^2 for Network Planning

NeRF2^2 is most practical for offline network planning where a site survey collects RSS measurements, the model trains overnight, and the trained model predicts coverage maps for AP placement optimisation. Real-time channel prediction (e.g., for beam management) requires faster inference methods.

Practical numbers: Training on 2,000 measurements takes ∼30\sim 30 minutes. Inference for 10,000 grid points takes ∼10\sim 10 seconds (batched GPU evaluation). Accuracy: 3--5 dB RMSE, comparable to calibrated ray tracing but without requiring a detailed 3D building model.

Practical Constraints
  • β€’

    Requires known Tx/Rx positions (GPS or floor plan coordinates)

  • β€’

    Per-scene training: a new model is needed for each environment

  • β€’

    Single-frequency training; multi-frequency requires architecture changes

Quick Check

Why does NeRF2^2 use a dB-domain loss function rather than a linear-domain MSE?

dB is more intuitive for engineers

To normalise gradients across measurements with widely varying power levels

To make the loss function convex

To enable closed-form optimisation

πŸŽ“CommIT Contribution(2023)

NeRF2^2 for RF Propagation Modelling

Y. Zhao, X. Zhu, Y. C. Eldar β€” arXiv:2305.06118

Zhao et al. introduced the first neural radiance field for RF propagation modelling. By replacing the colour output with a complex signal MLP and training on sparse RSS measurements, NeRF2^2 achieves 3--5 dB RSS prediction accuracy in indoor environments without requiring a detailed 3D building model. The key insight is that the volume density MLP implicitly learns the scene geometry (walls, furniture) from signal measurements alone. Limitations include single-ray integration (no multipath) and per-scene training.

nerfrf-propagationrss-predictionneural-fields

Key Takeaway

NeRF2^2 learns a volumetric RF scene representation from sparse RSS measurements, predicting received power between arbitrary Tx--Rx pairs. The dB-domain loss normalises gradients across the large dynamic range of RF signals. The primary limitation is single-ray integration, which misses multipath contributions significant in indoor environments. Extensions addressing multipath, channel estimation, and material properties are surveyed in Section 24.4.