Comparing SDR, Manifold, and Element-wise

One Problem, Three Answers

We have three algorithms for the same unit-modulus QCQP. This section quantifies the quality and runtime tradeoffs across realistic RIS sizes, helping you pick the right tool for your deployment.

Algorithm Comparison for Unit-Modulus QCQP

PropertySDR (Section 6.2)Manifold (Section 6.3)Element-wise (Section 6.4)
Complexity per callO(N6.5)O(N^{6.5})O(TN2)O(T N^2), T∼50T \sim 50O(SN2)O(S N^2), S∼5S \sim 5
Quality guarantee(Ο€/4)(\pi/4)-optimal, tight in practiceStationary point (typically near-optimal)Coordinate stationary (local)
Single-user optimal?Yes (rank-1 case)YesYes (one sweep)
ScalabilityN≀128N \leq 128 practicalN≀1024N \leq 1024 practicalN≀4096N \leq 4096 practical
Warm-start friendly?YesYesVery yes (1-2 sweeps)
Real-time suitable?No (>1> 1 s)Moderate (∼100\sim 100 ms)Yes (<1< 1 ms)
Typical gap to global<1%< 1\%11-3%3\%11-10%10\%
Implementation effortHigh (SDP solver)Medium (Manopt recommended)Low (closed-form sweep)

Theorem: Algorithm Quality Hierarchy

For the unit-modulus QCQP with PSD A\mathbf{A}, the three algorithms have the quality ordering

fSDRubβ‰₯fSDRrandβ‰₯fmanifoldβ‰₯felementwise.f_{\text{SDR}}^{\text{ub}} \geq f_{\text{SDR}}^{\text{rand}} \geq f_{\text{manifold}} \geq f_{\text{elementwise}}.

The leftmost is an upper bound (not achievable in general); the right three are lower bounds (achieved by the respective algorithms). The gaps are:

  • fSDRubβˆ’fSDRrandf_{\text{SDR}}^{\text{ub}} - f_{\text{SDR}}^{\text{rand}}: Gaussian randomization sub-optimality, typically <1%< 1\% for L=1000L = 1000.
  • fSDRrandβˆ’fmanifoldf_{\text{SDR}}^{\text{rand}} - f_{\text{manifold}}: small, typically <1%< 1\%.
  • fmanifoldβˆ’felementwisef_{\text{manifold}} - f_{\text{elementwise}}: small at rank 1 (zero), larger at high rank.

Equality holds for rank-1 problems: all three algorithms achieve the global optimum.

SDR, Manifold, Element-wise Benchmark

Benchmark all three algorithms on the same problem instance. Plot achieved objective and runtime across NN from 16 to 256. Element-wise has the lowest runtime across the board but the largest gap; SDR becomes impractical at N>128N > 128.

Parameters
64
4
15
πŸŽ“CommIT Contribution(2023)

Fast Manifold Optimization for Array-Fed RIS

G. Caire, I. Atzeni β€” IEEE Trans. Signal Process. (preprint)

Caire and collaborators (2023) adapt manifold optimization to the array-fed RIS architecture, exploiting the low-rank structure of the BS-RIS near-field channel. The key insight: the RIS aperture only needs to support r≀Kr \leq K eigenmodes (one per user), so the manifold optimization can be restricted to an rr-dimensional submanifold of the full torus. This reduces the per-iteration cost from O(N2)O(N^2) to O(Nr)O(Nr) and converges in 10-20 iterations instead of 50-100. The algorithmic contribution enables real-time AO at N=2048N = 2048 and K=8K = 8 with ∼5\sim 5 ms total optimization time β€” a 10Γ—10\times speedup over generic Manopt. The result is directly instantiated in Chapter 11's array-fed architecture.

manifoldarray-fed-risreal-timecaire-2023

Example: Choosing the Algorithm: Three Scenarios

Recommend the best algorithm for each scenario:

(a) Research benchmark: N=64N = 64, K=4K = 4, goal: tightest possible bound on achievable rate. (b) Real-time deployment: N=512N = 512, K=8K = 8, coherence block 20 ms20\,\text{ms}, goal: best rate within 10 ms. (c) Ultra-low-power IoT RIS: N=32N = 32, K=1K = 1, goal: minimal compute, "good enough" rate.

Key Takeaway

The three algorithms form a quality-speed Pareto frontier. SDR for quality, element-wise for speed, manifold for a balanced middle ground. At production scale (Nβ‰₯256N \geq 256), manifold with warm-starting is the typical choice; SDR is reserved for offline benchmarking; element-wise is the fallback for compute-constrained IoT or low-power scenarios. Rank-1 (single-user) problems are special: all three achieve the global optimum, and element-wise's speed wins unambiguously.

Common Mistake: Don't Use SDR Where It Doesn't Scale

Mistake:

"SDR has the best guarantees, so I'll use it even at N=1024N = 1024."

Correction:

SDR's O(N6.5)O(N^{6.5}) complexity makes N=1024N = 1024 absolutely infeasible: 10246.5β‰ˆ10191024^{6.5} \approx 10^{19} flops, essentially intractable. Even N=256N = 256 takes minutes per SDP solve. For N>128N > 128, use manifold. The quality loss vs. SDR is typically <1%< 1\% β€” negligible compared to the 1000Γ—1000\times speedup. Never force SDR onto a problem where it can't run in a reasonable time; the right choice is always the algorithm that fits the time budget.

Historical Note: The Three Traditions

2000s–2020s

The three algorithms trace to different mathematical traditions:

  • SDR: From convex optimization and operations research. Goemans–Williamson (1995) introduced (Ο€/4)(\pi/4)-style approximations for MAX-CUT; Luo, Ma, and others extended to signal processing in the 2000s.
  • Manifold optimization: From differential geometry. Absil, Mahony, Sepulchre (2008) codified the unified framework; Manopt (Boumal et al. 2014) made it accessible to signal-processing researchers.
  • Element-wise BCD: From classical numerical analysis. Gauss-Seidel dates to 1823; its generalization to non-convex problems was matured by Powell (1970s) and Bertsekas (1990s).

All three converged on the RIS QCQP in ∼2019\sim 2019–20202020 as the unified algorithmic toolkit for passive beamforming. The fact that three independent mathematical traditions all land on the same problem is a sign the problem is fundamental β€” and that the answer will generalize to future programmable-environment paradigms.

Quick Check

For an N=512N = 512 RIS panel running in a real-time deployment (10 ms coherence block), which passive-beamforming algorithm is most appropriate?

SDR with Gaussian randomization

Manifold optimization with warm-starting

Grid search over 3-bit phases

Exhaustive brute force

Semidefinite Relaxation (SDR)

A convex relaxation technique: lift the vector variable Ο•\boldsymbol{\phi} to a PSD matrix X=ϕϕH\mathbf{X} = \boldsymbol{\phi}\boldsymbol{\phi}^H, relax the rank-1 constraint, solve the resulting SDP, and recover a feasible solution via Gaussian randomization. Provides a tight upper bound on the non-convex QCQP optimum but has O(N6.5)O(N^{6.5}) complexity, limiting practical use to N≀128N \leq 128.

Related: Manifold Optimization, Element Wise Bcd

Riemannian Manifold Optimization

Iterative optimization that respects the geometry of a constraint manifold (here, the complex unit-modulus torus). The Euclidean gradient is projected onto the tangent space; after a step, a retraction maps back to the manifold. Scales as O(N2)O(N^2) per iteration, making it the workhorse algorithm for large-NN RIS optimization.

Related: SDR Provides a Tight Upper Bound, Element Wise Bcd, Tangent Space

Why This Matters: Same Algorithms, Other Problems

The unit-modulus QCQP reappears in many wireless contexts: radar waveform design (minimize sidelobe level subject to constant envelope), sensor array beamforming, phase retrieval (recover signal from ∣Ax∣2|\mathbf{A}\mathbf{x}|^2), and hybrid analog-digital beamforming (phase shifters in RF chains). The algorithms of this chapter apply unchanged. The RIS community is a major user of these algorithms, but the tools are not RIS-specific β€” understanding them gives you a toolkit usable across wireless signal processing. Chapter 13 (RIS-ISAC) shows a direct transfer of these tools to the joint comm-radar objective.

See full treatment in Joint Sensing-Communication Signal Model