Element-wise Block Coordinate Descent
The Simplest Algorithm That Works
SDR scales as ; manifold as per iteration. Element-wise BCD β updating one at a time in closed form β scales as per coordinate sweep. For single-user problems, element-wise is optimal (gives the global optimum). For multi-user, it plateaus at a local optimum that is sometimes - dB below SDR, but its speed makes it the default for real-time operation. This section derives the update rule, proves monotone convergence, and characterizes its local-optimum structure.
Definition: The Element-wise Update Rule
The Element-wise Update Rule
Consider the single-user QCQP objective for fixed and . Updating only while holding the others fixed, define
The one-coordinate problem is , which has the closed-form optimizer
For the multi-user quadratic (with not rank 1), the single-coordinate problem becomes maximizing a cosine (real sinusoid plus real constant terms), also admitting a closed form:
The key algorithmic fact: for each coordinate, the conditional optimum is phase-matching β align with the sum of influences from the other elements.
Element-wise BCD for Unit-Modulus QCQP
Complexity: total: - sweeps, each for the coordinate updatesEach sweep is because computing for all involves the matrix . For low-rank with , the sweep cost drops to .
Theorem: Monotone Convergence of Element-wise BCD
The element-wise BCD iterates produce a monotone non-decreasing sequence of objective values: . Under compactness of (automatic for the torus), the iterates have a convergent subsequence. Any limit point satisfies the coordinate-stationarity condition
which is a necessary condition for local optimality on . For the unit-modulus QCQP, the limit is a local maximum (not saddle) under generic channels.
Each single-coordinate update is a constrained maximization, producing a value no worse than the previous. Monotone sequence on a bounded domain converges to a limit; the limit must be a stationary point (zero coordinate gradient in every direction), i.e., a local optimum.
Monotonicity
By the definition of the coordinate update, (equality iff the current was already optimal conditional on ).
Boundedness and limit
is bounded on compact by continuity. The monotone sequence converges to some .
Coordinate stationarity
At any limit point , each coordinate satisfies the local optimality condition (the update doesn't change it). For smooth , this implies all partial derivatives (restricted to the tangent circle at each coordinate) are zero β i.e., the Riemannian gradient is zero.
Single-user: Optimal; Multi-user: Local
For rank-1 , element-wise BCD converges to the global optimum in one sweep: each coordinate update is matched-filter alignment with the others, which is the global solution for the rank-1 case.
For rank- with , element-wise converges to a local optimum that can be - dB below SDR. The bigger the rank (more users / streams), the larger the typical gap. For - users, the gap is tolerable; for , SDR or manifold is noticeably better.
Example: Element-wise Sweep: Toy Problem
with . Starting from , perform one sweep of element-wise BCD.
Update $\phi_1$
. , so . , so . . . Update: .
Update $\phi_2$
With updated , compute . (conjugate of ), , so . . . . rad. Update .
Update $\phi_3$
Similarly compute and update . After one sweep, is close to the matched filter solution , which is . Check: , so . The rank-1 case reaches optimality in one sweep.
Element-wise BCD Convergence Trace
Plot per sweep. Observe monotone convergence; plateau at 3-5 sweeps for rank-1 problems; slower convergence and higher gap for higher-rank problems.
Parameters
Common Mistake: Element-wise Gets Stuck in Rank- Multi-User Problems
Mistake:
"Element-wise BCD worked great for single-user; I'll use it for my 8-user sum-rate problem too."
Correction:
For higher-rank objectives, element-wise BCD can plateau at strictly suboptimal local maxima that differ significantly from the global optimum. Empirical benchmarks (Yu et al. 2020) show element-wise is typically within dB of SDR for , but the gap widens to dB at . If the application cares about the last dB, use manifold or SDR. If dB gap is tolerable, element-wise's simplicity and speed are hard to beat.
Element-wise for Real-Time Operation
Element-wise BCD is typically the only algorithm feasible for real-time RIS operation at large :
- , : Element-wise sweep: ms. Manifold trust-region: ms. SDR: s.
- Warm start across coherence blocks with previous : element-wise converges in 1-2 sweeps.
- Approximate low-rank : exploit rank structure for sweeps.
For offline analysis or simulation benchmarking, use SDR as a quality ceiling. In deployment, element-wise at dB gap is the pragmatic choice.
- β’
Per-sweep cost for dense at : ms on a modern CPU in NumPy.
- β’
Warm-start across coherence blocks reduces sweep count from to .
- β’
For 1-bit quantized phase (Ch. 8), element-wise BCD extends directly with projection to the discrete phase grid.