OMP and Alternating Minimization
The Factorization Problem
Sections 20.1-20.3 argued that the hybrid architecture is the right answer at mmWave. We now face the central algorithmic problem: given a target fully-digital precoder derived from the channel (for example, the top- right singular vectors of ), find (constant modulus) and (unconstrained) such that
subject to and .
This problem is NP-hard in general because of the constant-modulus constraint. The algorithmic insight of the mmWave literature is to exploit the sparsity of the mmWave channel: since has only dominant propagation paths, lives in a low-dimensional subspace spanned by transmit steering vectors. Restricting to have columns from a dictionary of steering vectors turns the problem into a sparse recovery problem solvable by OMP - and yields optimal solutions whenever .
Definition: Sparse mmWave Channel Model
Sparse mmWave Channel Model
The narrowband mmWave channel from a -element transmitter to a -element receiver with dominant paths is
where is the complex path gain, are the angles of departure and arrival, and is typically 1-5 at 28 GHz in an outdoor LOS/NLOS mix. Stacking the steering vectors as columns of and similarly for , the channel factors as
with diagonal containing the scaled gains. The effective rank is and is usually equal to .
The key observation is that (the top right singular vectors of ) lies in the column span of . So a good hybrid precoder should use columns of drawn from . Discretizing the angles to a grid yields a finite dictionary, and OMP selects the best columns.
Spatially Sparse Precoding via OMP
Complexity: , dominated by dictionary correlationOMP is a greedy residual-matching algorithm: at each iteration, it selects the steering vector most correlated with the current residual and absorbs it into , then re-fits in least-squares sense. The outer power normalization in line 10 ensures the total transmit power does not exceed .
Theorem: Optimality of OMP Under Ideal Sparsity
Suppose the target precoder can be written as for some matrix , where is a dictionary of orthonormal (or weakly correlated) transmit steering vectors. If the dictionary is contained in the OMP search set and , then OMP recovers and , achieving .
Each OMP iteration reduces the residual by projecting out the best dictionary direction. If the target is exactly a linear combination of dictionary vectors and those are orthogonal, iterations peel them off one by one; the residual vanishes at iteration .
In each iteration, compute and show that its rows are zero except at the remaining true support.
Show that the LS fit step exactly recovers the coefficients of the selected dictionary atoms.
Induction on the iteration count: at iteration , the residual lies in the span of the unused dictionary vectors.
Orthogonality of $\mathbf{A}_t$
By hypothesis , so the coefficient matrix satisfies . Non-zero rows of correspond to active dictionary indices, zero rows to inactive ones.
First iteration selects an active index
At iteration 1, . The row-norm argmax selects an index where , which is by assumption an active index .
LS fit and residual orthogonality
After line 7, exactly, and the residual has the -row of its projection equal to zero. The residual lies in the span of the remaining active dictionary vectors.
Induction and termination
By induction, after iterations the residual lies in the span of dictionary vectors. At iteration the residual reaches zero; for the argmax picks an inactive index whose coefficient vanishes. Hence suffices and exactly.
Alternating Minimization (MO-AltMin)
Complexity: per outer loop, with -Alternating minimization (AltMin) alternates between the unconstrained optimum over (least squares) and the constrained optimum over (entry-wise phase-only projection). Each step is monotonically non-increasing in the objective, so convergence to a stationary point is guaranteed. The fixed point may be a local minimum, so multiple random initializations are used in practice. The "MO-AltMin" variant of Yu et al. (2016) replaces the phase-only projection with a manifold-optimization step that is provably globally convergent.
Alternating Minimization Convergence
Trace the objective function versus iteration for alternating minimization on a synthetic target precoder. The number of RF chains controls the fundamental approximation error; under the residual drops to machine precision.
Parameters
Hybrid vs Digital Energy Efficiency
Compare the energy efficiency (bits/sec/Hz/W) of a fully-digital and a hybrid architecture as the number of antennas grows. The fully-digital architecture scales its power linearly in ; the hybrid architecture scales only in . The crossover point is where hybrid becomes the better choice.
Parameters
Example: OMP-Based Hybrid Precoding: A Small Case
Consider a 28 GHz channel with , , paths at azimuths , . The fully-digital precoder aims to transmit streams along these two directions. Design an OMP-based hybrid precoder with RF chains.
Dictionary construction
Build a angular dictionary by sampling to in steps of . Each dictionary column is .
First OMP iteration
Compute correlations ; the row with largest norm corresponds to the dictionary index closest to (or , depending on path power). Add this column to and re-fit .
Second OMP iteration
The residual is now orthogonal to the first selected vector. The argmax selects the dictionary entry near the other angle. Adding it produces up to the dictionary grid resolution.
Final residual
With and the dictionary containing the true steering vectors (exactly or to within grid resolution), the residual is zero modulo the dictionary discretization - typically below in normalized units.
Dictionary Granularity in Practice
The theoretical optimality of OMP (Theorem TOptimality of OMP Under Ideal Sparsity) assumes the true steering directions lie on the dictionary grid. In practice they do not, and the residual floor is governed by the grid resolution. A dictionary of directions (one per orthogonal DFT beam) is typically too coarse; or more is standard, trading correlation complexity ( per iteration) for approximation accuracy. For , a dictionary of is a reasonable balance. Atomic norm minimization and grid-free methods (e.g., root-MUSIC-style approaches) avoid the discretization entirely but add optimization complexity; they are rarely used in real-time hybrid precoding.
- β’
Dictionary too coarse: loss proportional to
- β’
Dictionary too fine: OMP correlation step becomes the runtime bottleneck
- β’
Recommended: to for typical mmWave arrays
Common Mistake: AltMin Gets Stuck in Local Minima
Mistake:
Because AltMin is a descent method with guaranteed convergence, one might assume it finds the global optimum of the hybrid factorization problem.
Correction:
Alternating minimization converges to a stationary point, not a global optimum. The constant-modulus constraint set is non-convex, and the objective has many local minima. In practice one runs AltMin from 10-50 random initializations and keeps the best result, or uses the manifold-optimization variant (MO-AltMin) which has stronger convergence guarantees. OMP, in contrast, is deterministic given the dictionary and is optimal when the channel is truly sparse - but suffers more from grid mismatch in the off-grid regime.
CommIT Multiuser Multibeam Array-Fed Architecture
A distinctive CommIT contribution to hybrid beamforming is the array-fed reflector architecture introduced by Bartoli, Abdolee, and Caire (2022). Instead of placing the phase-shifter network between the RF chains and a planar active array, a small active array ( elements) illuminates a passive parabolic or lens reflector. The reflector implements a fixed (frequency-independent) angular-to-spatial mapping: each point on the focal plane launches a pencil beam in a distinct direction. Multi-user, multi-beam transmission is then achieved by placing one active element per user beam, with the digital precoder handling inter-beam interference.
The architecture has two striking properties. First, it eliminates the phase-shifter network and its insertion loss, replacing it with a lossless quasi-optical structure. Second, the hardware cost scales with the number of users, not the aperture: a reflector with 1 m diameter at 300 GHz has a beamforming aperture of antennas-equivalent while using only active chains. This is the same principle behind Starlink's V2 user terminals and is increasingly the design point for sub-THz 6G research. Chapter 21 develops the related array-fed RIS concept.
Quick Check
Under what condition is OMP-based hybrid precoding provably exact (zero residual) for an -path mmWave channel?
and the dictionary contains the true steering vectors
, regardless of the dictionary
, by Theorem 20.1
(rank-1 LOS only)
Under sparsity, iterations suffice: each OMP step peels off one path from the residual. Dictionary containment is essential to avoid grid mismatch.
AltMin Residual Trajectory Animation
Animated version of the AltMin convergence plot. The objective is traced iteration by iteration for two configurations side by side: (green, drops to machine precision) and (red, plateaus at a positive floor).
Parameters
Key Takeaway
Two algorithmic paradigms dominate hybrid precoder design. OMP-based sparse precoding exploits the mmWave channel's angular sparsity and is provably optimal when and the dictionary contains the true steering vectors. Alternating minimization (AltMin / MO-AltMin) works on arbitrary channels but converges to local minima and needs careful initialization. Both are descent methods; the choice between them is governed by whether channel sparsity can be assumed.