Joint Communication-Sensing-Imaging Optimization

Beyond CRB: Optimising for Reconstruction Quality

The ISAC literature (Chapter 29) optimises the tradeoff between communication rate and sensing accuracy, measured by the CRB. But the CRB measures estimation accuracy for a single parameter β€” it does not capture the quality of a reconstructed image. A waveform that minimises the CRB for delay estimation may produce a poor image if it concentrates k-space coverage in one direction.

The final step in the convergence of communication, sensing, and imaging is to optimise the entire pipeline end-to-end: from waveform design, through sensing and reconstruction, to the communication feedback. This section develops the imaging-aware ISAC framework, where the objective is reconstruction quality (NMSE, SSIM) rather than CRB.

,

Definition:

Joint Communication-Sensing-Imaging (JCSI) Framework

The JCSI framework optimises the entire pipeline:

max⁑W,Ξ±R(W)s.t.Jimg(Ξ±,W)≀ϡimg,βˆ₯Wβˆ₯F2≀Pt\max_{\mathbf{W}, \boldsymbol{\alpha}} \quad R(\mathbf{W}) \quad \text{s.t.} \quad \mathcal{J}_{\mathrm{img}}(\boldsymbol{\alpha}, \mathbf{W}) \leq \epsilon_{\mathrm{img}}, \quad \|\mathbf{W}\|_F^2 \leq P_t

where:

  • W\mathbf{W} is the joint transmit waveform (precoding + pilots),
  • Ξ±\boldsymbol{\alpha} are the reconstruction algorithm parameters (regularisation weights, denoiser parameters),
  • R(W)R(\mathbf{W}) is the communication sum-rate,
  • Jimg(Ξ±,W)\mathcal{J}_{\mathrm{img}}(\boldsymbol{\alpha}, \mathbf{W}) is the imaging quality metric (e.g., expected NMSE of the reconstructed scene),
  • PtP_t is the transmit power budget.

The key difference from standard ISAC is that Jimg\mathcal{J}_{\mathrm{img}} depends on the reconstruction algorithm, not just on the waveform β€” creating a bilevel optimisation: the outer level designs the waveform, the inner level reconstructs the image.

Standard ISAC uses CRB as the sensing metric, which is algorithm-independent but does not account for the quality of the reconstructed image. Imaging-aware ISAC replaces CRB with a reconstruction-dependent metric, making the design problem harder but more relevant.

,

Definition:

Rate-Imaging Pareto Frontier

The rate-imaging Pareto frontier is the set of achievable (rate, imaging quality) pairs:

P={(R,J):βˆƒβ€‰WΒ s.t.Β R(W)β‰₯R,β€…β€ŠJimg(W)≀J,β€…β€Šβˆ₯Wβˆ₯F2≀Pt}.\mathcal{P} = \{(R, \mathcal{J}) : \exists\, \mathbf{W} \text{ s.t. } R(\mathbf{W}) \geq R,\; \mathcal{J}_{\mathrm{img}}(\mathbf{W}) \leq \mathcal{J},\; \|\mathbf{W}\|_F^2 \leq P_t\}.

Points on the frontier are Pareto-optimal: no waveform can improve one metric without degrading the other.

At the extremes:

  • Communication-only: W\mathbf{W} maximises rate (e.g., SVD precoding), imaging quality is poor.
  • Imaging-only: W\mathbf{W} minimises Jimg\mathcal{J}_{\mathrm{img}} (e.g., orthogonal pilot grid), communication rate is zero.

The frontier between these extremes characterises the fundamental tradeoff.

Theorem: Gap Between CRB-Optimal and Imaging-Optimal Waveforms

For a scene with KK scatterers and an OFDM waveform with NfN_f subcarriers, the waveform that minimises the CRB for delay estimation concentrates power on the edge subcarriers (maximum bandwidth):

WCRB=arg⁑min⁑WCRB(Ο„)β€…β€ŠβŸΉβ€…β€Šβˆ£Wk∣2∝δ(kβˆ’1)+Ξ΄(kβˆ’Nf).\mathbf{W}_{\mathrm{CRB}} = \arg\min_{\mathbf{W}} \mathrm{CRB}(\tau) \implies |W_{k}|^2 \propto \delta(k - 1) + \delta(k - N_f).

The waveform that minimises imaging NMSE distributes power across subcarriers to ensure uniform k-space coverage:

Wimg=arg⁑min⁑WNMSE(Οƒ^)β€…β€ŠβŸΉβ€…β€Šβˆ£Wk∣2=Pt/Nfβˆ€k.\mathbf{W}_{\mathrm{img}} = \arg\min_{\mathbf{W}} \mathrm{NMSE}(\hat{\boldsymbol{\sigma}}) \implies |W_{k}|^2 = P_t / N_f \quad \forall k.

The NMSE gap between the two waveforms is:

NMSE(WCRB)/NMSE(Wimg)=O(Nf/K)\mathrm{NMSE}(\mathbf{W}_{\mathrm{CRB}}) / \mathrm{NMSE}(\mathbf{W}_{\mathrm{img}}) = O(N_f / K)

which can be >10> 10 dB for typical parameters (Nf=1024N_f = 1024, K=10K = 10).

CRB cares about the sharpest possible estimate of a single parameter (delay). Imaging cares about reconstructing a full scene, which requires uniform frequency coverage. Edge-only power gives excellent delay resolution but leaves most of k-space unilluminated, degrading the image.

,

Communication Rate vs. Imaging Quality Pareto Frontier

Explore the Pareto frontier between communication sum-rate and imaging reconstruction NMSE. The curve shows the achievable tradeoff for different waveform designs.

Compare the CRB-optimal point (good delay estimation, poor imaging) with the imaging-optimal point (good reconstruction, reduced rate) and the Pareto-optimal points in between.

Parameters
64
256
4
20

Example: Joint Waveform Design for ISAC with Imaging

A 64-antenna base station with 256 OFDM subcarriers at 28 GHz serves 4 users while imaging a scene with Ks=8K_s = 8 scatterers. The power budget is Pt=30P_t = 30 dBm and SNR=20\text{SNR} = 20 dB. Design the waveform allocation between communication and imaging, and compute the achievable rate and NMSE.

Definition:

End-to-End Learning for JCSI

The JCSI problem can be solved via end-to-end learning: jointly optimise the waveform W\mathbf{W}, the reconstruction algorithm parameters Ξ±\boldsymbol{\alpha}, and the communication precoder V\mathbf{V} by minimising a combined loss:

L=βˆ’ΞΌR(W,V)+(1βˆ’ΞΌ) Jimg(Ξ±,W)\mathcal{L} = -\mu R(\mathbf{W}, \mathbf{V}) + (1-\mu)\,\mathcal{J}_{\mathrm{img}}(\boldsymbol{\alpha}, \mathbf{W})

where μ∈[0,1]\mu \in [0, 1] controls the rate-imaging tradeoff.

The key technical challenge is that the reconstruction algorithm (LASSO, OAMP) contains non-differentiable operations (β„“1\ell_1 prox, hard thresholding). Deep unfolding (Chapter 18) addresses this by unrolling the algorithm into a differentiable computation graph, enabling end-to-end gradient-based optimisation.

This is the natural culmination of the deep unfolding ideas from Chapter 18: the unrolled reconstruction network is not just used for imaging, but is embedded in a larger end-to-end system that jointly optimises communication and imaging.

,

End-to-End JCSI Optimisation

Complexity: O(Eβ‹…Bβ‹…Lβ‹…Q2)O(E \cdot B \cdot L \cdot Q^2) where LL = unrolled iterations, QQ = image size
Input: Channel realisations {Ht}\{\mathbf{H}_t\}, scene ground truth {Οƒt}\{\boldsymbol{\sigma}_t\}, tradeoff ΞΌ\mu
Output: Waveform Wβˆ—\mathbf{W}^*, reconstruction parameters Ξ±βˆ—\boldsymbol{\alpha}^*, precoder Vβˆ—\mathbf{V}^*
1. Initialise: W←\mathbf{W} \leftarrow uniform power, α←\boldsymbol{\alpha} \leftarrow LASSO defaults, V←\mathbf{V} \leftarrow ZF precoder
2. for epoch =1,…,E= 1, \ldots, E do
3. \quad Sample batch {(Hb,Οƒb)}b=1B\{(\mathbf{H}_b, \boldsymbol{\sigma}_b)\}_{b=1}^{B}
4. \quad Forward pass:
5. \quad\quad Compute sensing returns: yb=A(W)Οƒb+wb\mathbf{y}_{b} = \mathbf{A}(\mathbf{W})\boldsymbol{\sigma}_b + \mathbf{w}_{b}
6. \quad\quad Reconstruct: Οƒ^b=fΞ±(yb,A(W))\hat{\boldsymbol{\sigma}}_b = f_{\boldsymbol{\alpha}}(\mathbf{y}_{b}, \mathbf{A}(\mathbf{W})) (unrolled OAMP)
7. \quad\quad Compute rate: Rb=log⁑2det⁑(I+SNR HbVVHHbH)R_b = \log_2\det(\mathbf{I} + \text{SNR}\,\mathbf{H}_b\mathbf{V}\mathbf{V}^H\mathbf{H}_b^H)
8. \quad\quad Compute imaging loss: Jb=βˆ₯Οƒ^bβˆ’Οƒbβˆ₯2/βˆ₯Οƒbβˆ₯2\mathcal{J}_b = \|\hat{\boldsymbol{\sigma}}_b - \boldsymbol{\sigma}_b\|^2/\|\boldsymbol{\sigma}_b\|^2
9. \quad Loss: L=1Bβˆ‘b[βˆ’ΞΌRb+(1βˆ’ΞΌ)Jb]\mathcal{L} = \frac{1}{B}\sum_b [-\mu R_b + (1-\mu)\mathcal{J}_b]
10. \quad Backward pass: βˆ‡W,Ξ±,VL\nabla_{\mathbf{W},\boldsymbol{\alpha},\mathbf{V}} \mathcal{L} via backpropagation through the unrolled graph
11. \quad Update: Adam step on (W,Ξ±,V)(\mathbf{W}, \boldsymbol{\alpha}, \mathbf{V})
12. end for
13. return (Wβˆ—,Ξ±βˆ—,Vβˆ—)(\mathbf{W}^*, \boldsymbol{\alpha}^*, \mathbf{V}^*)

The unrolled OAMP in step 6 must be differentiable. The soft thresholding operator SΞ»(β‹…)\mathcal{S}_\lambda(\cdot) is already differentiable; the OAMP Onsager correction is a linear operation. The key non-trivial gradient is through the sensing matrix A(W)\mathbf{A}(\mathbf{W}), which depends on the waveform.

Sensing Metrics: CRB vs. Imaging Quality

PropertyCRBImaging NMSEImaging SSIM
What it measuresSingle-parameter estimation boundGlobal pixel-wise errorPerceptual structural similarity
Algorithm-dependent?No (Fisher information only)Yes (depends on reconstruction)Yes (depends on reconstruction)
Captures scene structure?NoPartiallyYes
Optimisation difficultyClosed-form (matrix trace)Bilevel (waveform + reconstruction)Bilevel (waveform + reconstruction)
Suitable for imaging?No β€” misses k-space coverageYes β€” standard metricYes β€” correlates with perception

Quick Check

Why can the CRB-optimal waveform produce a poor image even though it achieves the best delay estimation accuracy?

It concentrates power at edge frequencies, leaving most of k-space unilluminated

It uses too much transmit power

It has lower SNR than the imaging-optimal waveform

Common Mistake: Using CRB as an Imaging Objective

Mistake:

Optimising an ISAC waveform for CRB minimisation and expecting good image reconstruction.

Correction:

CRB is a point-estimation metric: it measures how well you can estimate a single parameter (delay, angle). It does not account for the quality of a full scene reconstruction, which depends on k-space coverage, algorithm choice, and scene structure.

For imaging applications, use reconstruction-aware metrics (NMSE, SSIM, LPIPS) as the sensing objective. The CRB-optimal and imaging-optimal waveforms can differ by >10> 10 dB in NMSE (Theorem 30.3).

Historical Note: From Radar-Communication Coexistence to Joint Design

The relationship between radar and communication has evolved through three phases. In the 2000s, the focus was coexistence: ensuring that radar and communication systems sharing the same spectrum do not interfere with each other (spectrum sharing, interference mitigation). In the 2010s, the dual-function paradigm emerged: a single waveform serves both purposes (ISAC). The CRB-rate tradeoff became the standard design criterion.

The 2020s are bringing the third phase: joint design that accounts for the full imaging pipeline, not just point estimation. This reflects the recognition that sensing in 6G is not just "detecting a target at range rr" but "reconstructing a 3D environment map" β€” an imaging problem that requires different design criteria than the CRB.

,

Why This Matters: From ISAC to JCSI in 6G Standards

The 3GPP Release 19 study item on "AI/ML for NR air interface" includes sensing-assisted communication as a use case. Current proposals focus on CRB-based waveform design, but the imaging community's push toward reconstruction-aware metrics is beginning to influence standardisation. The JCSI framework of this section represents the research frontier: optimising waveforms for image quality rather than point estimation. We expect this to mature toward Release 20/21 as digital twin concepts gain traction in 6G discussions.

Joint Communication-Sensing-Imaging (JCSI)

An extension of ISAC that optimises the joint system for image reconstruction quality (NMSE, SSIM) rather than point-estimation accuracy (CRB). The optimisation is bilevel: the outer level designs the waveform, the inner level runs the reconstruction algorithm.

Related: {{Ref:Def Joint Csic}}

Imaging-Aware ISAC

An ISAC design philosophy where the sensing metric accounts for the quality of the reconstructed image, not just the accuracy of individual parameter estimates. This leads to different optimal waveforms than CRB-based ISAC, with more uniform k-space coverage.

Related: {{Ref:Thm Imaging Crb Gap}}

Key Takeaway

The JCSI framework goes beyond ISAC by optimising for reconstruction quality rather than CRB. The CRB-optimal waveform can produce images >10> 10 dB worse than the imaging-optimal waveform. End-to-end learning via deep unfolding jointly optimises the waveform, the reconstruction algorithm, and the communication precoder. The rate-imaging Pareto frontier characterises the fundamental tradeoff, with practical operating points achieving ∼80%\sim 80\% of maximum rate while maintaining good imaging quality.