Exercises
ex-ris-ch06-01
EasyShow that the unit-modulus QCQP with rank-1 PSD has the closed-form solution and objective value .
Use the triangle inequality.
Bound
.
Equality
Holds iff all have the same phase. Pick : , all real positive.
ex-ris-ch06-02
MediumDerive the SDR relaxation of the unit-modulus QCQP. Write both the primal SDP and its dual.
Lift to .
Primal
s.t. for all .
Dual
s.t. . (Lagrange multipliers for the diagonal-equality constraints.)
Strong duality
Both primal and dual are bounded convex SDPs with interior solutions (Slater's condition), so strong duality holds and the primal and dual optima coincide.
ex-ris-ch06-03
MediumCompute the -approximation factor by computing for where is unit-modulus, .
with and uniform.
Distribution of $Z$
For (scalar case), . .
Approximation ratio
In the scalar case, the unit-modulus projection is , with objective . The SDR bound is also 1. So the ratio is 1. The factor applies to higher-dimensional cases where projection loses correlation; the scalar case is tight.
Reference
See Zhang and Huang (2006) for the general derivation on the complex sphere.
ex-ris-ch06-04
MediumCompute the SDR complexity for , , . Which are feasible on a modern desktop ( flops/s)?
Interior-point: .
Compute
: . s β 17 min. Feasible offline. : . s β 6 months. Infeasible. : . s β 3000 years. Infeasible.
Interpretation
SDR's scaling is a hard wall. Above , switch to manifold or element-wise. For research at , SDR is usable but slow; for deployment at , off-limits.
ex-ris-ch06-05
MediumCompute the Riemannian gradient of at (element-wise). Show it is zero.
At the optimum, the Euclidean gradient is parallel to .
Euclidean gradient
. At : . So .
Projection to tangent space
. Component-wise: , real. So , and .
Conclusion
Riemannian gradient is zero β is a stationary point, confirming the matched-filter optimum.
ex-ris-ch06-06
MediumDerive the element-wise update rule for the multi-user quadratic with general PSD .
Expand as a function of only.
Separate $\phi_n$
. Separate : . Since , (constant). The -dependent part is where .
Maximize
is maximized at (same phase alignment argument as in Ex. 5.5).
Computational note
Computing is per coordinate, so a full sweep is . For low-rank , tracking the product incrementally reduces the cost to where .
ex-ris-ch06-07
HardConstruct an example where element-wise BCD converges to a strictly suboptimal local maximum (not the global maximum).
Take and a rank-2 chosen so the objective landscape has multiple local maxima.
Construct $\mathbf{A}$
Let β this has eigenvalues , so it's indefinite, not PSD. For a PSD example: β both coordinate-stationary points are . Global max at is the unique maximum; element-wise from would need to move to which is what the update rule gives. No strict local optima for this case.
A better example
Take with (block-diagonal). Then coordinate updates don't interact: each is matched-aligned with the corresponding -diagonal. If , update is trivial: is any unit-modulus (the objective doesn't depend on 's phase for diagonal ).
Higher-rank non-convex case
Try with having off-diagonal terms that create a "saddle landscape": some coordinate moves increase the objective, others decrease. Random initialization can land in different basins, converging to different local maxima. The full analysis uses the Hessian signature; see Yu et al. (2020) for examples with documented local-opt gaps.
ex-ris-ch06-08
MediumWhy does Gaussian randomization with samples effectively achieve the -approximation guarantee?
Consider the CDF of the single-sample rounded objective.
Single-sample distribution
for a random is a random variable with mean .
Max of $L$ samples
By standard concentration arguments, the max of i.i.d. samples from a distribution with mean concentrates near the essential supremum of the distribution. For sub-Gaussian-like distributions, the gap to the supremum is β at , roughly 2.6 standard deviations from the mean.
Practical factor
The empirical peak at is typically within - of , much better than the worst-case . The theoretical bound is a worst case; most problem instances deliver higher.
ex-ris-ch06-09
MediumImplement the element-wise BCD as pseudocode for a sum-rate multi-user problem where the objective is with fixed weights .
The coordinate-wise function is still a quadratic in ; argmax is phase matching.
Coordinate function
For fixed , the dependence on is a sum of squared moduli; each contributes a quadratic in .
Combine
The -dependent part collects to , where are computable from the current state. Under , the term is constant, and .
Write out $B_n$
. Each term is per coordinate pair, so per coordinate, per sweep.
ex-ris-ch06-10
MediumShow that the retraction (element-wise) maps tangent vectors back to the complex unit torus.
Verify that the result has for each .
Component-wise check
is a complex number divided by its own modulus, so whenever .
Existence check
iff . For tangent vectors with , this is impossible. The retraction is well-defined on the tangent bundle restricted to bounded tangent vectors.
First-order accuracy
For small , β the tangent-space projection of . This is the defining property of a first-order retraction.
ex-ris-ch06-11
HardProve that Armijo backtracking on the manifold with step size halving guarantees monotone ascent: for some whenever .
Use the first-order Taylor expansion of along the retraction curve.
Taylor expansion
. (Using the fact that the Riemannian gradient defines the first-order change along the retraction.)
Sufficient decrease
Pick . The Armijo condition asks . By the Taylor expansion, this holds for all small enough : the LHS is , which is when is small.
Halving converges
If the Armijo condition fails at , try . By the Taylor expansion, the condition must hold at some , so halving terminates in finitely many steps.
ex-ris-ch06-12
MediumCompare the number of iterations needed by SDR, manifold, and element-wise to converge to objective precision on a typical problem.
SDR: one solve. Manifold: ~100 iterations. Element-wise: ~5-10 sweeps.
SDR
One SDP solve returns the optimum in precision call. Total: 1 "iteration" (but expensive).
Manifold
Linear convergence; each iteration halves the gap. To get from 100% gap to 0.1% gap: iterations. In practice, 50-200 iterations due to the constant factors.
Element-wise
One sweep reduces the gap to the local-optimum sub-level set. Subsequent sweeps converge geometrically fast. Typically 3-5 sweeps suffice.
ex-ris-ch06-13
HardCompare SDR and manifold quality when rank (e.g., two-user sum rate). Quantify the typical gap.
Empirical observation: manifold within 1% of SDR for rank 2.
Quality argument
Manifold converges to a local stationary point; SDR provides a tight upper bound. For rank-2, the SDR upper bound is typically achieved by a "rank-2 lift" that is not directly feasible, but Gaussian randomization with gets within of the bound.
Manifold quality
Manifold with random multi-start (5 trials) typically matches SDR-randomization. With warm-starting from the previous coherence block, even better: gap.
Conclusion
At rank 2, manifold is effectively as good as SDR at a fraction of the cost. The gap widens at rank but rarely exceeds in practice.
ex-ris-ch06-14
ChallengeOpen-ended: Design a warm-starting protocol for manifold optimization across RIS coherence blocks that accounts for block-to-block channel correlation.
Use the previous as initial point; reduce step size; reduce iteration count.
Warm-start initialization
At coherence block , initialize .
Adaptive parameters
Given the block-to-block channel (a correlation indicator):
- If small (slow motion): use 5 manifold iterations, small step size.
- If large (fast motion): use 30 iterations, bigger step size, re-initialize from random.
Implementation
Monitor the objective improvement per iteration; if improvement slows below a threshold, exit. Typical speedup: - vs. cold-start.
Safeguards
If the channel changes suddenly (blockage event), the warm-started may be far from the new optimum. Detect via sudden drop in ; fall back to cold-start random initialization.
ex-ris-ch06-15
MediumFor , , estimate the achieved rate improvement from SDR vs. element-wise, in bits/s/Hz.
Typical gap at rank 4: - dB in SNR, - bit/s/Hz in rate.
Gap in SNR
Rank 4, moderate : expect dB SDR advantage over element-wise local optimum.
Gap in rate
at high SNR β . 2 dB of SNR is a factor of . Rate difference: bits/s/Hz per user.
Total
Per-user bit/s/Hz; users β sum-rate advantage bits/s/Hz for SDR over element-wise. Significant enough to justify SDR for research, too small to justify the compute cost in deployment β use element-wise with warm-start for production.