OMP and Alternating Minimization

The Factorization Problem

Sections 20.1-20.3 argued that the hybrid architecture is the right answer at mmWave. We now face the central algorithmic problem: given a target fully-digital precoder Wopt\mathbf{W}_{\text{opt}} derived from the channel (for example, the top-KK right singular vectors of H\mathbf{H}), find FRF\mathbf{F}_{\text{RF}} (constant modulus) and FBB\mathbf{F}_{\text{BB}} (unconstrained) such that

(FRF,FBB)=arg⁑min⁑FRF,FBBβˆ₯Woptβˆ’FRFFBBβˆ₯F2(\mathbf{F}_{\text{RF}}, \mathbf{F}_{\text{BB}}) = \arg\min_{\mathbf{F}_{\text{RF}}, \mathbf{F}_{\text{BB}}} \|\mathbf{W}_{\text{opt}} - \mathbf{F}_{\text{RF}} \mathbf{F}_{\text{BB}}\|_F^2

subject to ∣[FRF]m,n∣=1/Nt|[\mathbf{F}_{\text{RF}}]_{m,n}| = 1/\sqrt{N_t} and βˆ₯FRFFBBβˆ₯F2≀Pt\|\mathbf{F}_{\text{RF}} \mathbf{F}_{\text{BB}}\|_F^2 \leq P_t.

This problem is NP-hard in general because of the constant-modulus constraint. The algorithmic insight of the mmWave literature is to exploit the sparsity of the mmWave channel: since H\mathbf{H} has only LL dominant propagation paths, Wopt\mathbf{W}_{\text{opt}} lives in a low-dimensional subspace spanned by transmit steering vectors. Restricting FRF\mathbf{F}_{\text{RF}} to have columns from a dictionary of steering vectors turns the problem into a sparse recovery problem solvable by OMP - and yields optimal solutions whenever NRFβ‰₯LN_{\text{RF}} \geq L.

Definition:

Sparse mmWave Channel Model

The narrowband mmWave channel from a NtN_t-element transmitter to a NrN_r-element receiver with LL dominant paths is

H=NtNrLβˆ‘β„“=1Lαℓ a^(Ο•β„“r) a(Ο•β„“t)H,\mathbf{H} = \sqrt{\frac{N_tN_r}{L}} \sum_{\ell=1}^{L} \alpha_\ell \, \hat{\mathbf{a}}(\phi_\ell^r) \, \mathbf{a}(\phi_\ell^t)^H,

where Ξ±β„“βˆΌCN(0,1)\alpha_\ell \sim \mathcal{CN}(0, 1) is the complex path gain, Ο•β„“t,Ο•β„“r\phi_\ell^t, \phi_\ell^r are the angles of departure and arrival, and LL is typically 1-5 at 28 GHz in an outdoor LOS/NLOS mix. Stacking the steering vectors as columns of At=[a(Ο•1t),…,a(Ο•Lt)]\mathbf{A}_t = [\mathbf{a}(\phi_1^t), \ldots, \mathbf{a}(\phi_L^t)] and similarly for Ar\mathbf{A}_r, the channel factors as

H=Ar Σ AtH,\mathbf{H} = \mathbf{A}_r \, \mathbf{\Sigma} \, \mathbf{A}_t^H,

with Σ\mathbf{\Sigma} diagonal containing the scaled gains. The effective rank is min⁑(L,Nt,Nr)\min(L, N_t, N_r) and is usually equal to LL.

The key observation is that Wopt\mathbf{W}_{\text{opt}} (the top right singular vectors of H\mathbf{H}) lies in the column span of At\mathbf{A}_t. So a good hybrid precoder should use columns of FRF\mathbf{F}_{\text{RF}} drawn from At\mathbf{A}_t. Discretizing the angles to a grid yields a finite dictionary, and OMP selects the best NRFN_{\text{RF}} columns.

,

Spatially Sparse Precoding via OMP

Complexity: O(NRFβ‹…Gβ‹…Ntβ‹…K)O(N_{\text{RF}} \cdot G \cdot N_t \cdot K), dominated by dictionary correlation
Input: Target precoder Wopt∈CNtΓ—K\mathbf{W}_{\text{opt}} \in \mathbb{C}^{N_t \times K},
Tx steering-vector dictionary At∈CNtΓ—G\mathbf{A}_t \in \mathbb{C}^{N_t \times G},
number of RF chains NRFN_{\text{RF}}.
Output: FRF,FBB\mathbf{F}_{\text{RF}}, \mathbf{F}_{\text{BB}}
1. FRF←[ ]\mathbf{F}_{\text{RF}} \leftarrow [\,] (empty)
2. Wres←Wopt\mathbf{W}_{\text{res}} \leftarrow \mathbf{W}_{\text{opt}}
3. for i=1,2,…,NRFi = 1, 2, \ldots, N_{\text{RF}} do
4. \quad Ψ←AtHWres\boldsymbol{\Psi} \leftarrow \mathbf{A}_t^H \mathbf{W}_{\text{res}}
5. \quad k⋆←arg⁑max⁑k=1,…,Gβˆ₯Ξ¨[k,:]βˆ₯22k^{\star} \leftarrow \arg\max_{k=1,\ldots,G} \|\boldsymbol{\Psi}[k, :]\|_2^2
6. \quad FRF←[FRF,At[:,k⋆]]\mathbf{F}_{\text{RF}} \leftarrow [\mathbf{F}_{\text{RF}}, \mathbf{A}_t[:, k^{\star}]]
7. \quad FBB←(FRFHFRF)βˆ’1FRFHWopt\mathbf{F}_{\text{BB}} \leftarrow (\mathbf{F}_{\text{RF}}^H \mathbf{F}_{\text{RF}})^{-1} \mathbf{F}_{\text{RF}}^H \mathbf{W}_{\text{opt}} (LS fit)
8. \quad Wres←Woptβˆ’FRFFBBβˆ₯Woptβˆ’FRFFBBβˆ₯F\mathbf{W}_{\text{res}} \leftarrow \dfrac{\mathbf{W}_{\text{opt}} - \mathbf{F}_{\text{RF}} \mathbf{F}_{\text{BB}}}{\|\mathbf{W}_{\text{opt}} - \mathbf{F}_{\text{RF}} \mathbf{F}_{\text{BB}}\|_F}
9. end for
10. FBB←Ptβˆ₯FRFFBBβˆ₯FFBB\mathbf{F}_{\text{BB}} \leftarrow \dfrac{\sqrt{P_t}}{\|\mathbf{F}_{\text{RF}} \mathbf{F}_{\text{BB}}\|_F} \mathbf{F}_{\text{BB}} (power normalization)

OMP is a greedy residual-matching algorithm: at each iteration, it selects the steering vector most correlated with the current residual and absorbs it into FRF\mathbf{F}_{\text{RF}}, then re-fits FBB\mathbf{F}_{\text{BB}} in least-squares sense. The outer power normalization in line 10 ensures the total transmit power does not exceed PtP_t.

Theorem: Optimality of OMP Under Ideal Sparsity

Suppose the target precoder can be written as Wopt=AtZ\mathbf{W}_{\text{opt}} = \mathbf{A}_t \mathbf{Z} for some matrix Z∈CLΓ—K\mathbf{Z} \in \mathbb{C}^{L \times K}, where At\mathbf{A}_t is a dictionary of LL orthonormal (or weakly correlated) transmit steering vectors. If the dictionary is contained in the OMP search set and NRFβ‰₯LN_{\text{RF}} \geq L, then OMP recovers FRF=At\mathbf{F}_{\text{RF}} = \mathbf{A}_t and FBB=Z\mathbf{F}_{\text{BB}} = \mathbf{Z}, achieving βˆ₯Woptβˆ’FRFFBBβˆ₯F=0\|\mathbf{W}_{\text{opt}} - \mathbf{F}_{\text{RF}}\mathbf{F}_{\text{BB}}\|_F = 0.

Each OMP iteration reduces the residual by projecting out the best dictionary direction. If the target is exactly a linear combination of LL dictionary vectors and those are orthogonal, LL iterations peel them off one by one; the residual vanishes at iteration LL.

Alternating Minimization (MO-AltMin)

Complexity: O(Titerβ‹…Nt2β‹…NRF)O(T_{\text{iter}} \cdot N_t^{2} \cdot N_{\text{RF}}) per outer loop, with Titer∼20T_{\text{iter}} \sim 20-5050
Input: Target Wopt\mathbf{W}_{\text{opt}}, initial FRF(0)\mathbf{F}_{\text{RF}}^{(0)}
with constant-modulus entries.
Output: Hybrid precoder (FRF,FBB)(\mathbf{F}_{\text{RF}}, \mathbf{F}_{\text{BB}}).
1. t←0t \leftarrow 0
2. repeat
3. \quad FBB(t+1)←(FRF(t) HFRF(t))βˆ’1FRF(t) HWopt\mathbf{F}_{\text{BB}}^{(t+1)} \leftarrow (\mathbf{F}_{\text{RF}}^{(t)\,H} \mathbf{F}_{\text{RF}}^{(t)})^{-1} \mathbf{F}_{\text{RF}}^{(t)\,H} \mathbf{W}_{\text{opt}} # LS step (unconstrained)
4. \quad for m=1,…,Ntm = 1, \ldots, N_t, n=1,…,NRFn = 1, \ldots, N_{\text{RF}} do
5. \qquad [FRF(t+1)]m,n←1Ntβ‹…sign ⁣([WoptFBB(t+1) H]m,n)[\mathbf{F}_{\text{RF}}^{(t+1)}]_{m,n} \leftarrow \dfrac{1}{\sqrt{N_t}} \cdot \text{sign}\!\left([\mathbf{W}_{\text{opt}} \mathbf{F}_{\text{BB}}^{(t+1)\,H}]_{m,n}\right) # phase-only projection
6. \quad end for
7. \quad t←t+1t \leftarrow t + 1
8. until βˆ₯Woptβˆ’FRF(t)FBB(t)βˆ₯F2\|\mathbf{W}_{\text{opt}} - \mathbf{F}_{\text{RF}}^{(t)}\mathbf{F}_{\text{BB}}^{(t)}\|_F^2 converges

Alternating minimization (AltMin) alternates between the unconstrained optimum over FBB\mathbf{F}_{\text{BB}} (least squares) and the constrained optimum over FRF\mathbf{F}_{\text{RF}} (entry-wise phase-only projection). Each step is monotonically non-increasing in the objective, so convergence to a stationary point is guaranteed. The fixed point may be a local minimum, so multiple random initializations are used in practice. The "MO-AltMin" variant of Yu et al. (2016) replaces the phase-only projection with a manifold-optimization step that is provably globally convergent.

Alternating Minimization Convergence

Trace the objective function βˆ₯Woptβˆ’FRFFBBβˆ₯F2\|\mathbf{W}_{\text{opt}} - \mathbf{F}_{\text{RF}}\mathbf{F}_{\text{BB}}\|_F^2 versus iteration for alternating minimization on a synthetic target precoder. The number of RF chains controls the fundamental approximation error; under NRFβ‰₯LN_{\text{RF}} \geq L the residual drops to machine precision.

Parameters
64
4
3
40

Hybrid vs Digital Energy Efficiency

Compare the energy efficiency (bits/sec/Hz/W) of a fully-digital and a hybrid architecture as the number of antennas grows. The fully-digital architecture scales its power linearly in NtN_t; the hybrid architecture scales only in NRFN_{\text{RF}}. The crossover point is where hybrid becomes the better choice.

Parameters
10
8
1

Example: OMP-Based Hybrid Precoding: A Small Case

Consider a 28 GHz channel with Nt=32N_t = 32, Nr=4N_r = 4, L=2L = 2 paths at azimuths Ο•1t=βˆ’30∘\phi^t_1 = -30^\circ, Ο•2t=20∘\phi^t_2 = 20^\circ. The fully-digital precoder aims to transmit K=2K = 2 streams along these two directions. Design an OMP-based hybrid precoder with NRF=2N_{\text{RF}} = 2 RF chains.

πŸ”§Engineering Note

Dictionary Granularity in Practice

The theoretical optimality of OMP (Theorem TOptimality of OMP Under Ideal Sparsity) assumes the true steering directions lie on the dictionary grid. In practice they do not, and the residual floor is governed by the grid resolution. A dictionary of G=NtG = N_t directions (one per orthogonal DFT beam) is typically too coarse; G=4NtG = 4 N_t or more is standard, trading correlation complexity (O(GNt)O(G N_t) per iteration) for approximation accuracy. For Nt=256N_t = 256, a dictionary of G=1024G = 1024 is a reasonable balance. Atomic norm minimization and grid-free methods (e.g., root-MUSIC-style approaches) avoid the discretization entirely but add optimization complexity; they are rarely used in real-time hybrid precoding.

Practical Constraints
  • β€’

    Dictionary too coarse: loss proportional to sinc2(Nt/G)\text{sinc}^2(N_t/G)

  • β€’

    Dictionary too fine: OMP correlation step becomes the runtime bottleneck

  • β€’

    Recommended: G=2NtG = 2N_t to 4Nt4N_t for typical mmWave arrays

Common Mistake: AltMin Gets Stuck in Local Minima

Mistake:

Because AltMin is a descent method with guaranteed convergence, one might assume it finds the global optimum of the hybrid factorization problem.

Correction:

Alternating minimization converges to a stationary point, not a global optimum. The constant-modulus constraint set is non-convex, and the objective has many local minima. In practice one runs AltMin from 10-50 random initializations and keeps the best result, or uses the manifold-optimization variant (MO-AltMin) which has stronger convergence guarantees. OMP, in contrast, is deterministic given the dictionary and is optimal when the channel is truly sparse - but suffers more from grid mismatch in the off-grid regime.

πŸŽ“CommIT Contribution(2022)

CommIT Multiuser Multibeam Array-Fed Architecture

G. Bartoli, R. Abdolee, G. Caire β€” IEEE Trans. Wireless Communications

A distinctive CommIT contribution to hybrid beamforming is the array-fed reflector architecture introduced by Bartoli, Abdolee, and Caire (2022). Instead of placing the phase-shifter network between the RF chains and a planar active array, a small active array (NRFN_{\text{RF}} elements) illuminates a passive parabolic or lens reflector. The reflector implements a fixed (frequency-independent) angular-to-spatial mapping: each point on the focal plane launches a pencil beam in a distinct direction. Multi-user, multi-beam transmission is then achieved by placing one active element per user beam, with the digital precoder handling inter-beam interference.

The architecture has two striking properties. First, it eliminates the phase-shifter network and its insertion loss, replacing it with a lossless quasi-optical structure. Second, the hardware cost scales with the number of users, not the aperture: a reflector with 1 m diameter at 300 GHz has a beamforming aperture of ∼105\sim 10^5 antennas-equivalent while using only NRF=8N_{\text{RF}} = 8 active chains. This is the same principle behind Starlink's V2 user terminals and is increasingly the design point for sub-THz 6G research. Chapter 21 develops the related array-fed RIS concept.

hybrid-beamformingmmwavesub-thzcommitarray-fedView Paper β†’

Quick Check

Under what condition is OMP-based hybrid precoding provably exact (zero residual) for an LL-path mmWave channel?

NRF=LN_{\text{RF}} = L and the dictionary contains the true steering vectors

NRF=NtN_{\text{RF}} = N_t, regardless of the dictionary

NRFβ‰₯2KN_{\text{RF}} \geq 2K, by Theorem 20.1

L=1L = 1 (rank-1 LOS only)

AltMin Residual Trajectory Animation

Animated version of the AltMin convergence plot. The objective βˆ₯Woptβˆ’FRFFBBβˆ₯F2\|\mathbf{W}_{\text{opt}} - \mathbf{F}_{\text{RF}}\mathbf{F}_{\text{BB}}\|_F^2 is traced iteration by iteration for two configurations side by side: NRFβ‰₯LN_{\text{RF}} \geq L (green, drops to machine precision) and NRF<LN_{\text{RF}} < L (red, plateaus at a positive floor).

Parameters
64
3
3
40

Key Takeaway

Two algorithmic paradigms dominate hybrid precoder design. OMP-based sparse precoding exploits the mmWave channel's angular sparsity and is provably optimal when NRFβ‰₯LN_{\text{RF}} \geq L and the dictionary contains the true steering vectors. Alternating minimization (AltMin / MO-AltMin) works on arbitrary channels but converges to local minima and needs careful initialization. Both are descent methods; the choice between them is governed by whether channel sparsity can be assumed.