Volume Rendering Adapted for RF

The RF Rendering Equation

We now derive the complete RF volume rendering equation from first principles, connecting the neural NeRF framework to the physics-based forward model of Chapters 5--6. The central question is: under what conditions does neural volumetric rendering agree with the Born-approximation forward model y=Ac+w\mathbf{y} = \mathbf{A}\mathbf{c} + \mathbf{w}?

The answer reveals both the power and the limitations of the NeRF approach for RF imaging.

,

Theorem: RF Volume Rendering Equation

For a monochromatic RF signal at frequency ff, the received complex baseband signal along a ray r(t)=o+td\mathbf{r}(t) = \mathbf{o} + t\mathbf{d} from the transmitter at origin o\mathbf{o} is:

S^(r,f)=∫tntfT(t,f) σ(r(t)) ρ(r(t),d,f) eβˆ’jΞΊβ‹…2t dt,\hat{S}(\mathbf{r}, f) = \int_{t_n}^{t_f} T(t, f)\,\sigma(\mathbf{r}(t))\, \rho(\mathbf{r}(t), \mathbf{d}, f)\, e^{-j\kappa \cdot 2t}\,dt,

where ρ(x,d,f)∈C\rho(\mathbf{x}, \mathbf{d}, f) \in \mathbb{C} is the complex reflectivity, the factor eβˆ’jΞΊβ‹…2te^{-j\kappa \cdot 2t} captures round-trip propagation phase (ΞΊ=2Ο€f/c\kappa = 2\pi f/c), and the RF transmittance is:

T(t,f)=exp⁑ ⁣(βˆ’βˆ«tntΞ±(r(s),f) ds),T(t, f) = \exp\!\left(-\int_{t_n}^{t} \alpha(\mathbf{r}(s), f)\,ds\right),

with Ξ±(x,f)\alpha(\mathbf{x}, f) the frequency-dependent attenuation coefficient.

Each point along the ray contributes a complex phasor to the received signal. The transmittance TT accounts for cumulative absorption (energy lost traversing walls). The reflectivity ρ\rho determines how much and at what phase the material scatters. The exponential carries the round-trip propagation delay.

Definition:

Frequency-Dependent RF Transmittance

Unlike optical transmittance, the RF transmittance depends on frequency:

T(t,f)=exp⁑ ⁣(βˆ’βˆ«tntΞ±(r(s),f) ds).T(t, f) = \exp\!\left(-\int_{t_n}^{t} \alpha(\mathbf{r}(s), f)\,ds\right).

For a material with linear frequency dependence Ξ±(x,f)=Ξ±0(x)+Ξ±1(x)f\alpha(\mathbf{x}, f) = \alpha_0(\mathbf{x}) + \alpha_1(\mathbf{x}) f, the transmittance factors as:

T(t,f)=T0(t)β‹…T1(t,f),T0(t)=eβˆ’βˆ«Ξ±0 ds,T1(t,f)=eβˆ’f∫α1 ds.T(t, f) = T_0(t) \cdot T_1(t, f), \qquad T_0(t) = e^{-\int \alpha_0\,ds}, \quad T_1(t, f) = e^{-f\int \alpha_1\,ds}.

This factorisation is useful for multi-frequency training: the geometry MLP learns Ξ±0\alpha_0 (frequency-independent loss), while the signal MLP learns Ξ±1\alpha_1 (frequency-dependent dispersion).

Definition:

View-Dependent RF Reflectivity

The complex reflectivity ρ(x,d,f)\rho(\mathbf{x}, \mathbf{d}, f) depends on position, viewing direction, and frequency. For a planar surface with normal n^\hat{\mathbf{n}} at x\mathbf{x}, the Fresnel reflection coefficient gives:

ρ(x,d,f)=Ξ“(ΞΈi,Ξ΅r(f)) δ(dβˆ’dr),\rho(\mathbf{x}, \mathbf{d}, f) = \Gamma(\theta_i, \varepsilon_r(f))\, \delta(\mathbf{d} - \mathbf{d}_r),

where ΞΈi=arccos⁑(∣dβ‹…n^∣)\theta_i = \arccos(|\mathbf{d} \cdot \hat{\mathbf{n}}|) is the incidence angle, dr=dβˆ’2(dβ‹…n^)n^\mathbf{d}_r = \mathbf{d} - 2(\mathbf{d} \cdot \hat{\mathbf{n}})\hat{\mathbf{n}} is the specular reflection direction, and Ξ“\Gamma is the Fresnel coefficient.

In the neural field, the MLP implicitly learns an approximation to this directional dependence. The stronger the specular component, the more capacity the MLP must allocate to directional features.

Theorem: Connection to the Born Forward Model

Under the Born approximation (weak scattering, single scattering), the RF volume rendering equation reduces to the linear forward model:

S^(r,f)β‰ˆβˆ«Ξ©Ο(p) eβˆ’jΞΊ(βˆ₯pβˆ’sβˆ₯+βˆ₯pβˆ’rβˆ₯) dp,\hat{S}(\mathbf{r}, f) \approx \int_\Omega \rho(\mathbf{p})\, e^{-j\kappa(\|\mathbf{p} - \mathbf{s}\| + \|\mathbf{p} - \mathbf{r}\|)}\, d\mathbf{p},

which is the continuous form of y=Ac\mathbf{y} = \mathbf{A}\mathbf{c}. Specifically, when:

  1. T(t,f)β‰ˆ1T(t, f) \approx 1 everywhere (negligible attenuation --- weak scattering);
  2. Οƒ(x)\sigma(\mathbf{x}) is an indicator function of the scatterer support;
  3. ρ\rho is view-independent (isotropic scattering);

then the neural rendering integral becomes the Born integral with the sensing kernel given by the free-space Green's function.

,

Example: 1D RF Volume Rendering with Two Walls

A ray traverses two walls at distances t1=3t_1 = 3 m and t2=7t_2 = 7 m. Each wall has density Οƒ=5\sigma = 5 mβˆ’1^{-1} and thickness 0.20.2 m. Free space has Οƒ=0\sigma = 0. The reflectivities are ρ1=0.8ejΟ€/4\rho_1 = 0.8 e^{j\pi/4} and ρ2=0.3eβˆ’jΟ€/6\rho_2 = 0.3 e^{-j\pi/6}. At f=5f = 5 GHz, compute the rendered received signal S^\hat{S} using N=4N = 4 representative samples.

Discrete RF Volume Rendering

Complexity: O(N)\mathcal{O}(N) per ray, where NN is the number of samples. Each sample requires one MLP evaluation: O(Nβ‹…CMLP)\mathcal{O}(N \cdot C_{\mathrm{MLP}}) total, where CMLPC_{\mathrm{MLP}} is the cost of a forward pass.
Input: ray samples {(x_i, d)}, MLP F_ΞΈ, frequency f
Output: complex received signal Ŝ
1. For i = 1 to N:
2. (Οƒ_i, ρ_i) ← F_ΞΈ(Ξ³(x_i), Ξ³(d), f)
3. Ξ΄_i ← t_{i+1} - t_i
4. Ξ±_i ← 1 - exp(-Οƒ_i Β· Ξ΄_i)
5. phase_i ← exp(-j Β· 2ΞΊ Β· t_i) // ΞΊ = 2Ο€f/c
6. T_1 ← 1
7. Ŝ ← 0
8. For i = 1 to N:
9. Ŝ ← Ŝ + T_i Β· Ξ±_i Β· ρ_i Β· phase_i
10. T_{i+1} ← T_i Β· (1 - Ξ±_i)
11. Return Ŝ

The algorithm is identical to optical volume rendering except for lines 2 (complex output), 5 (round-trip phase), and 9 (complex accumulation). Backpropagation through this algorithm is straightforward because all operations are differentiable.

,

When NeRF Disagrees with Born

The connection between neural volumetric rendering and the Born forward model breaks down when:

  1. Strong scattering: Attenuation is significant (Tβ‰ͺ1T \ll 1), violating the Born single-scattering assumption. The NeRF transmittance T(t,f)T(t, f) captures this naturally, while the Born model does not.

  2. Multiple scattering: The Born model assumes single scattering; NeRF's single-ray rendering likewise misses multi-bounce paths. Both fail in the same way.

  3. View dependence: The Born model assumes isotropic scattering (ρ\rho independent of d\mathbf{d}). NeRF can model view-dependent reflectivity, which is necessary for specular surfaces but adds model complexity.

The point is that NeRF extends the Born model by adding attenuation and view dependence, but both share the limitation of single-scattering. Multi-bounce extensions (WiNeRT) address this at the cost of training complexity.

,

Example: Born Model as a Special Case of RF-NeRF

Show explicitly that for a 2D scene with QQ point scatterers at positions {pq}q=1Q\{\mathbf{p}_{q}\}_{q=1}^Q with reflectivities {cq}\{c_q\}, and a single Tx at s\mathbf{s} and Rx at r\mathbf{r}, the Born forward model y=βˆ‘qcq eβˆ’jΞΊ(βˆ₯pqβˆ’sβˆ₯+βˆ₯pqβˆ’rβˆ₯)+wy = \sum_q c_q\, e^{-j\kappa(\|\mathbf{p}_{q} - \mathbf{s}\| + \|\mathbf{p}_{q} - \mathbf{r}\|)} + w is recovered from the RF-NeRF rendering equation under the Born assumptions.

⚠️Engineering Note

Computational Cost of RF Volume Rendering

The computational bottleneck in RF-NeRF is the MLP evaluation at each sample point. For a scene with RR Tx-Rx pairs, NN samples per ray, and KK frequency subcarriers, the total number of MLP evaluations per training iteration is:

C=Bβ‹…Nβ‹…K,C = B \cdot N \cdot K,

where BB is the batch size (number of Tx-Rx pairs per batch).

Practical numbers: B=256B = 256, N=128N = 128, K=234K = 234 (80 MHz Wi-Fi CSI) gives Cβ‰ˆ7.7Γ—106C \approx 7.7 \times 10^6 MLP evaluations per iteration. With an 8-layer, 256-unit MLP, this takes ∼50\sim 50 ms on an A100 GPU. For 100k iterations: ∼83\sim 83 minutes of training.

Mitigation: Hash encoding reduces MLP cost by 10Γ—10\times. Frequency-sharing (evaluating the geometry MLP once per spatial point and only the signal MLP per frequency) reduces cost by ∼K/2\sim K/2.

Practical Constraints
  • β€’

    GPU memory limits batch size for large Nβ‹…KN \cdot K products

  • β€’

    Multi-frequency rendering requires KK signal MLP evaluations per sample

  • β€’

    Hash encoding memory: ∼64\sim 64 MB for T=219T = 2^{19}, F=2F = 2, L=16L = 16

,

RF Volume Rendering vs Born Forward Model

Compare the Born forward model (no attenuation, single scattering) with full RF volume rendering (with attenuation) for a 1D scene. When Ξ±0=0\alpha_0 = 0, both agree perfectly. As attenuation increases, the Born model overestimates contributions from distant scatterers (it ignores shadowing by nearer objects), while the volume rendering correctly applies transmittance.

Parameters
3
0
5
,

Common Mistake: Assuming Born Validity for Strong Scatterers

Mistake:

Using the Born forward model y=Ac+w\mathbf{y} = \mathbf{A}\mathbf{c} + \mathbf{w} for scenes with thick concrete walls or metallic objects, where through-wall attenuation exceeds 10 dB.

Correction:

The Born approximation assumes negligible attenuation (Tβ‰ˆ1T \approx 1). For strong scatterers, the NeRF transmittance model naturally handles shadowing, while the Born model produces physically incorrect predictions. Use the full RF volume rendering equation, or at minimum validate Born predictions against measurements.

Open Problems in RF Volume Rendering

Several fundamental challenges remain:

  1. Multi-bounce rendering: Single-ray NeRF captures only direct paths. WiNeRT adds reflections but at 3Γ—3\times cost. Efficient differentiable multi-bounce rendering remains open.

  2. Diffraction: Volume rendering handles absorption and specular reflection but not diffraction around edges. Incorporating the uniform theory of diffraction (UTD) into neural rendering is an active research direction.

  3. Generalisation: Current RF-NeRFs train per-scene. Foundation models that generalise across environments (trained on diverse RF scenes) would mirror the trajectory of optical NeRF generalisation.

  4. Phase coherence: Many practical systems measure only power (no phase). Training from power-only measurements introduces a phase-retrieval problem within the NeRF framework (see Chapter 16).

,

Quick Check

Under which condition does the RF volume rendering equation reduce to the Born forward model?

When the MLP has infinite capacity

When T(t,f)β‰ˆ1T(t, f) \approx 1, scattering is isotropic, and density is supported on the scatterer locations

When the frequency is above 100 GHz

When hash encoding is used instead of positional encoding

RF Reflectivity

The complex-valued function ρ(x,d,f)\rho(\mathbf{x}, \mathbf{d}, f) describing how a material at position x\mathbf{x} scatters an incident RF wave from direction d\mathbf{d} at frequency ff. The magnitude ∣ρ∣|\rho| determines the scattered power; the phase ∠ρ\angle \rho determines the phase shift upon reflection. For specular surfaces, ρ\rho is strongly peaked around the specular direction.

Related: Volume Density

Key Takeaway

The RF volume rendering equation integrates complex-valued reflectivity, frequency-dependent attenuation, and round-trip propagation phase along each ray. Under the Born-approximation assumptions (no attenuation, isotropic scattering), it reduces exactly to the linear forward model of Chapters 5--6. The NeRF framework extends the Born model by naturally handling shadowing (via transmittance) and specular scattering (via view-dependent reflectivity), at the cost of per-scene neural network training.