The Active-Beamforming Subproblem

Fix the Channel, Choose the Precoder

Once Φ\boldsymbol{\Phi} is fixed, the RIS-aided system looks identical to a standard MU-MIMO downlink with the effective channel Hk,effH=hk,dH+hk,2HΦH1\mathbf{H}_{k,\text{eff}}^H = \mathbf{h}_{k,d}^H + \mathbf{h}_{k,2}^H \boldsymbol{\Phi} \mathbf{H}_1. The active beamforming subproblem is therefore any known MU-MIMO precoder design: MRT for single-user, ZF for interference-nulling, WMMSE for sum-rate maximization, etc. We recap the WMMSE scheme here because it is the default choice inside the AO loop and because it has an elegant closed-form update structure.

Definition:

WMMSE Reformulation of Sum-Rate Maximization

Consider the sum-rate maximization max⁑W:tr(WHW)≀Ptβˆ‘klog⁑2(1+SINRk)\max_{\mathbf{W}: \text{tr}(\mathbf{W}^{H}\mathbf{W}) \leq P_t} \sum_k \log_2(1 + \text{SINR}_k) with SINRk\text{SINR}_k as in DThe Joint Active-Passive Beamforming Problem. Introducing auxiliary variables uk∈CNt\mathbf{u}_k \in \mathbb{C}^{N_t} (combiner weights) and wk>0w_k > 0 (WMMSE weights), the sum-rate problem is equivalent to the WMMSE problem

min⁑W,{uk},{wk}βˆ‘kwk ek(W,uk)βˆ’log⁑2wks.t.Β tr(WHW)≀Pt,\min_{\mathbf{W}, \{\mathbf{u}_k\}, \{w_k\}} \sum_k w_k\, e_k(\mathbf{W}, \mathbf{u}_k) - \log_2 w_k \quad \text{s.t.}\ \text{tr}(\mathbf{W}^{H} \mathbf{W}) \leq P_t,

where the MSE at user kk is ek=E[∣ukHykβˆ’sk∣2]e_k = \mathbb{E}[|\mathbf{u}_k^H y_k - s_k|^2]. The two problems have the same KKT conditions; the WMMSE reformulation is block-convex in (W,{uk},{wk})(\mathbf{W}, \{\mathbf{u}_k\}, \{w_k\}) and admits closed-form coordinate updates.

The WMMSE identity is a classical result from MU-MIMO optimization (Christensen et al. 2008, Shi et al. 2011). It is reused here unchanged: once Ξ¦\boldsymbol{\Phi} fixes the effective channels, the RIS disappears from the active-beamformer problem entirely.

Theorem: Closed-Form WMMSE Coordinate Updates

At block-coordinate optima of the WMMSE problem:

  1. Combiner update (MMSE receiver): uk⋆=(βˆ‘jHk,effHvjvjHHk,eff+Οƒ2I)βˆ’1Hk,effHvk\mathbf{u}_k^\star = (\sum_j \mathbf{H}_{k,\text{eff}}^H \mathbf{v}_{j} \mathbf{v}_{j}^{H} \mathbf{H}_{k,\text{eff}} + \sigma^2 \mathbf{I})^{-1} \mathbf{H}_{k,\text{eff}}^H \mathbf{v}_{k}.
  2. Weight update: wk⋆=1/(1βˆ’uk⋆HHk,effHvk)w_k^\star = 1/(1 - \mathbf{u}_k^{\star H} \mathbf{H}_{k,\text{eff}}^H \mathbf{v}_{k}). Equivalently, wk⋆=1/ek⋆w_k^\star = 1 / e_k^\star.
  3. Precoder update (regularized ZF): vk⋆=wk(βˆ‘jwjHj,effujujHHj,effH+ΞΌI)βˆ’1Hk,effuk\mathbf{v}_{k}^\star = w_k \big(\sum_j w_j \mathbf{H}_{j,\text{eff}} \mathbf{u}_j \mathbf{u}_j^H \mathbf{H}_{j,\text{eff}}^H + \mu \mathbf{I}\big)^{-1} \mathbf{H}_{k,\text{eff}} \mathbf{u}_k, where ΞΌβ‰₯0\mu \geq 0 is the Lagrange multiplier for the power constraint, found by bisection.

The algorithm converges monotonically to a local optimum of the original sum-rate problem.

Each of the three variable blocks (uk\mathbf{u}_k, wkw_k, W\mathbf{W}) has a closed-form optimum given the others: MMSE combiner, rate weight, and a dual-variable-regulated linear precoder.

WMMSE Sum-Rate Maximization (Inner Loop of AO)

Complexity: O(TWMMSEβ‹…KNt3)O(T_{\text{WMMSE}} \cdot K N_t^{3}) per outer AO iteration; TWMMSE∼5T_{\text{WMMSE}} \sim 5-1010 typical
Input: effective channels Hk,eff\mathbf{H}_{k,\text{eff}} (fixed, given Ξ¦\boldsymbol{\Phi});
power budget PtP_t; tolerance Ο΅in\epsilon_{\text{in}}.
Output: precoder W⋆\mathbf{W}^\star.
1. Initialize W(0)\mathbf{W}^{(0)}, e.g., zero-forcing on direct channels, scaled to satisfy power.
2. For t=0,1,2,…t = 0, 1, 2, \ldots:
3. \quad Combiner update: uk(t+1)=\mathbf{u}_k^{(t+1)} = MMSE receiver given W(t)\mathbf{W}^{(t)}.
4. \quad Weight update: wk(t+1)=1/ek(t+1)w_k^{(t+1)} = 1 / e_k^{(t+1)}.
5. \quad Precoder update: Bisection on ΞΌ\mu to satisfy tr(W(t+1),HW(t+1))=Pt\text{tr}(\mathbf{W}^{(t+1),H} \mathbf{W}^{(t+1)}) = P_t; each inner problem is a linear system.
6. \quad If sum-rate change <Ο΅in< \epsilon_{\text{in}}: break.
7. return W(t+1)\mathbf{W}^{(t+1)}.

Each WMMSE inner loop is O(KNt3)\mathcal{O}(KN_t^{3}) per iteration, dominated by the NtΓ—NtN_t \times N_t matrix inversion in the precoder update. For moderate NtN_t (say ≀64\leq 64), this is sub-millisecond on modern hardware. The bisection on ΞΌ\mu converges in ∼10\sim 10 iterations.

Example: Zero-Forcing as a Simplified Alternative

At high SNR, the optimal precoder approaches zero-forcing (ZF), which nulls inter-user interference. Derive the ZF precoder given the RIS-adjusted effective channel matrix HeffMU∈CKΓ—Nt\mathbf{H}_{\text{eff}}^{\text{MU}} \in \mathbb{C}^{K \times N_t} (rows = per-user effective channels), and state the SINR under ZF.

Rate vs. Transmit Power: With/Without RIS

Compare the sum-rate achievable by alternating optimization (WMMSE

  • RIS) against a no-RIS baseline (direct-channel WMMSE only). The RIS curve shows the joint optimization gain. Change NN to see how larger RIS widens the gap.
Parameters
64
8
2
-10
30

Single-User: MRT + Element-Wise = Closed Form

For K=1K = 1, the WMMSE reduces to MRT (matched filter) β€” a closed-form expression with no iteration. The passive subproblem reduces to element-wise matching (Chapter 6). AO converges in 3-5 iterations for single-user, and the rate is within <0.1Β bits/s/Hz< 0.1\text{ bits/s/Hz} of the global optimum. This is the clean case where RIS optimization is essentially solved. Multi-user complications require the full WMMSE machinery and Chapter 6's SDR/manifold methods on the passive side.

Common Mistake: WMMSE Initialization Matters

Mistake:

"Initialize W(0)=0\mathbf{W}^{(0)} = \mathbf{0} to satisfy the power constraint trivially."

Correction:

WMMSE diverges from W(0)=0\mathbf{W}^{(0)} = \mathbf{0}: the MSE is 1 per user, the weights are 1, and the combiner update has no signal to lock onto. Reasonable initializations: (i) ZF scaled to the power budget; (ii) MRT with equal power per user; (iii) random precoder with power constraint satisfied. All converge to the same local optimum within AO, but (i) is typically fastest.