The Joint Beamforming Problem

Two Beamformers, One Channel

The RIS introduces a new kind of beamforming to the wireless toolkit: the passive beamformer, expressed through the phase-shift matrix Ξ¦\boldsymbol{\Phi}. It coexists with the classical active beamformer W\mathbf{W} at the BS. Both aim to shape the same effective channel, but they do so differently β€” active beamforming spends transmit power to focus energy; passive beamforming reshapes the propagation environment at no RF-power cost. The joint design question: how do we choose both simultaneously?

A naive guess might be that the two decouple β€” optimize W\mathbf{W} given the channel, then tune Ξ¦\boldsymbol{\Phi} to make the channel look nicer. This is wrong: the effective channel depends on Ξ¦\boldsymbol{\Phi}, so the optimal W\mathbf{W} depends on Ξ¦\boldsymbol{\Phi} too, and vice versa. The problem is irreducibly coupled.

Definition:

The Joint Active-Passive Beamforming Problem

Consider a KK-user MISO downlink: BS with NtN_t antennas, KK single-antenna UEs, single NN-element RIS. The BS transmits x=βˆ‘k=1Kvksk\mathbf{x} = \sum_{k=1}^K \mathbf{v}_{k} s_k, where vk∈CNt\mathbf{v}_{k} \in \mathbb{C}^{N_t} is user kk's beamformer and sks_k is the data symbol with E∣sk∣2=1\mathbb{E}|s_k|^2 = 1. Stack the beamformers: W=[v1,…,vK]∈CNtΓ—K\mathbf{W} = [\mathbf{v}_{1}, \ldots, \mathbf{v}_{K}] \in \mathbb{C}^{N_t \times K}.

User kk's received signal is yk=hk,effHx+wky_k = \mathbf{h}_{k,\text{eff}}^H \mathbf{x} + w_k, with hk,effH=hk,dH+hk,2HΦH1\mathbf{h}_{k,\text{eff}}^H = \mathbf{h}_{k,d}^H + \mathbf{h}_{k,2}^H \boldsymbol{\Phi} \mathbf{H}_1. The per-user SINR is

SINRk(W,Ξ¦)=∣hk,effHvk∣2βˆ‘jβ‰ k∣hk,effHvj∣2+Οƒ2.\text{SINR}_k(\mathbf{W}, \boldsymbol{\Phi}) = \frac{|\mathbf{h}_{k,\text{eff}}^H \mathbf{v}_{k}|^2}{\sum_{j \neq k} |\mathbf{h}_{k,\text{eff}}^H \mathbf{v}_{j}|^2 + \sigma^2}.

The joint sum-rate problem is

max⁑W,Ξ¦βˆ‘k=1Klog⁑2(1+SINRk)s.t.tr(WHW)≀Pt,Β βˆ£Ο•n∣=1Β βˆ€n.\boxed{ \max_{\mathbf{W}, \boldsymbol{\Phi}} \sum_{k=1}^K \log_2(1 + \text{SINR}_k) \quad \text{s.t.} \quad \text{tr}(\mathbf{W}^{H} \mathbf{W}) \leq P_t,\ |\phi_n| = 1\ \forall n. }

This is the central optimization of the book. Chapters 5–8 solve variants of it (single-user, multi-user, max-min fairness, discrete phases); Chapter 11 solves it under the array-fed architecture. Understanding its structure β€” bilinear objective, two constraint sets, one convex and one not β€” is the foundation for everything that follows.

,

Theorem: The Joint Problem Is Non-Convex

The feasible set {(W,Ξ¦):tr(WHW)≀Pt,Β βˆ£Ο•n∣=1}\{(\mathbf{W}, \boldsymbol{\Phi}) : \text{tr}(\mathbf{W}^{H} \mathbf{W}) \leq P_t,\ |\phi_n| = 1\} is non-convex: any convex combination of two feasible Ξ¦(1),Ξ¦(2)\boldsymbol{\Phi}^{(1)}, \boldsymbol{\Phi}^{(2)} violates the unit-modulus constraint (by Eex-ris-ch01-09). Moreover, for fixed W\mathbf{W}, the objective βˆ‘klog⁑2(1+SINRk)\sum_k \log_2(1 + \text{SINR}_k) is non-concave in Ο•\boldsymbol{\phi} due to the SINR denominator's dependence on inter-user interference.

As a consequence, no polynomial-time algorithm is known to produce the global optimum. All practical algorithms (alternating optimization, SDR, manifold methods) produce local optima and rely on multiple random initializations to find good solutions.

The unit-modulus constraint βˆ£Ο•n∣=1|\phi_n| = 1 defines a circle in C\mathbb{C}, which is non-convex. Its NN-dimensional product is a torus β€” also non-convex. Beyond the feasibility set, even the objective is non-concave in the joint variable. Non-convex problems can have multiple local optima and no efficient way to find the global one.

Sub-problems Can Be Convex

The joint problem is non-convex, but its coordinate sub-problems have better structure:

  1. Fix Ξ¦\boldsymbol{\Phi}, optimize W\mathbf{W}: the effective channel is known, so this reduces to a standard MU-MIMO precoding problem β€” convex when reformulated as WMMSE (weighted MMSE), solvable in closed form or semidefinite programming.
  2. Fix W\mathbf{W}, optimize Ξ¦\boldsymbol{\Phi}: the effective channel depends linearly on Ο•\boldsymbol{\phi}, so the objective is a quadratic in Ο•\boldsymbol{\phi} subject to unit-modulus β€” a quadratic with unit-modulus constraint, still non-convex but amenable to SDR, manifold methods, etc. (Chapter 6).

This alternating-convex structure is what the alternating optimization algorithm of Section 5.2 exploits. It is also the organizing principle of the whole optimization portion of the book.

Example: Single-User MISO-RIS: The Clean Case

Consider a single-user MISO-RIS system with K=1K = 1. The SINR simplifies to SNR=∣heffHv∣2/Οƒ2\text{SNR} = |\mathbf{h}_{\text{eff}}^H \mathbf{v}|^2 / \sigma^2 (no interference term). Derive the optimal (v⋆,Φ⋆)(\mathbf{v}^\star, \boldsymbol{\Phi}^\star).

Key Takeaway

The joint problem has bilinear structure. For any fixed Ξ¦\boldsymbol{\Phi}, the active subproblem reduces to standard MU-MIMO precoding (convex). For any fixed W\mathbf{W}, the passive subproblem reduces to unit-modulus quadratic optimization (non-convex but tractable). This separation is the algorithmic DNA of alternating optimization and of nearly every RIS optimization algorithm in the literature.

The Bilinear Structure of Joint RIS Beamforming

The Bilinear Structure of Joint RIS Beamforming
The two-variable optimization landscape. Black curves are level sets of the sum-rate objective. Red vertical lines show the passive sub-problem (fix W\mathbf{W}, optimize Ξ¦\boldsymbol{\Phi}); blue horizontal lines show the active sub-problem. Alternating optimization walks a staircase through this landscape, taking one conditional optimum at a time.

Common Mistake: Don't Decouple Active and Passive Beamforming

Mistake:

"Set Ξ¦=I\boldsymbol{\Phi} = \mathbf{I} (all zeros), optimize W\mathbf{W} for the direct channel. Then re-optimize Ξ¦\boldsymbol{\Phi} for the resulting beamformer."

Correction:

The active and passive beamformers are coupled through the cascaded channel. Decoupling produces suboptimal solutions that can miss the N2N^2 coherent gain entirely. Specifically, the optimal Ξ¦\boldsymbol{\Phi} depends on W\mathbf{W}, and the optimal W\mathbf{W} depends on Ξ¦\boldsymbol{\Phi}. The alternating approach iterates between the two sub-problems, converging to a joint local optimum β€” the minimum-effort correct approach. A one-shot decoupled design typically loses 33–1010 dB of coherent gain.

Why This Matters: The MU-MIMO Precoder Analogy

If you already understand MU-MIMO precoding (MIMO Ch. 6, Telecom Ch. 17), the RIS joint problem is a generalization: in MU-MIMO, we choose the precoder W\mathbf{W} to shape the transmit signal for fixed channel. In RIS-MU-MIMO, we additionally choose Ξ¦\boldsymbol{\Phi} to shape the channel itself. The WMMSE iteration (the standard MU-MIMO workhorse) becomes one step of an alternating scheme; the other step is the unit-modulus RIS subproblem. Think of RIS beamforming as "MU-MIMO precoding plus one extra variable" β€” the mental model scales.

See full treatment in Sum-Rate Maximization via WMMSE