Conjugate Beamforming in the DD Domain

Conjugate Beamforming: Simple and Near-Optimal

The precoder at each AP must steer energy toward its UEs in a distributed way — each AP acts on local channel knowledge, without global coordination at symbol level. Conjugate beamforming (sometimes called "matched filter precoding") applies the conjugate of the channel estimate as the precoder vector. It is locally computed, globally coherent when the APs are synchronized, and asymptotically optimal as LL \to \infty. This section develops it in the DD domain.

,

Definition:

Conjugate Beamforming in the DD Domain

At AP ll, the conjugate beamforming vector for UE kk is v(l,k)[,m]  =  (H(l,k)[,m])HH(l,k)F,\mathbf{v}^{(l, k)}[\ell, m] \;=\; \frac{(\mathbf{H}^{(l, k)}[\ell, m])^H}{\|\mathbf{H}^{(l, k)}\|_F}, applied per DD cell (,m)(\ell, m). The AP transmits x(l)[,m]  =  k=1Kαkv(l,k)[,m]sk[,m],\mathbf{x}^{(l)}[\ell, m] \;=\; \sum_{k=1}^{K} \sqrt{\alpha_k} \mathbf{v}^{(l, k)}[\ell, m] s_k[\ell, m], where αk\alpha_k is the per-UE power allocation (kαk=Pt/L\sum_k \alpha_k = P_t/L) and sks_k is UE kk's DD data symbol.

Aggregate received signal at UE kk: yk[,m]  =  l=1LαkH(l,k)v(l,k)sk[,m]+multi-user interferencefrom kk+wk.y_k[\ell, m] \;=\; \sum_{l=1}^{L} \sqrt{\alpha_k} \mathbf{H}^{(l, k)} \mathbf{v}^{(l, k)} s_k[\ell, m] + \underbrace{\text{multi-user interference}}_{\text{from } k' \neq k} + \mathbf{w}_{k}. The first term is the "channel-hardened" signal: lH(l,k)2sk\sum_l |\mathbf{H}^{(l,k)}|^2 s_k — a real-valued positive sum proportional to the total channel energy.

,

Theorem: Conjugate Beamforming Optimality

As LL \to \infty with KK fixed, conjugate beamforming in cell-free OTFS achieves the asymptotic SINR SINRk    αkLβˉkσw2/βˉk+multi-user interference,\mathrm{SINR}_k \;\to\; \frac{\alpha_k \cdot L \cdot \bar\beta_k}{\sigma_w^2 / \bar\beta_k + \text{multi-user interference}}, where βˉk=El[H(l,k)2]\bar\beta_k = \mathbb{E}_l[\|\mathbf{H}^{(l, k)}\|^2] is the average channel magnitude squared.

Interpretations:

  • Linear SINR scaling: SINRkL\mathrm{SINR}_k \propto L (signal scales with number of APs).
  • Channel hardening: the effective channel becomes deterministic at large LL. Fading variance 0\to 0.
  • MU-MMSE near-optimal: adding multi-user interference cancellation at the CPU recovers a few additional dB.

For L=50L = 50, K=20K = 20, pilot contamination κ=0.3\kappa = 0.3: SINRk17\mathrm{SINR}_k \approx 17 dB. Compare cellular single-BS: 10\sim 10 dB. Cell-free advantage: 7\sim 7 dB — 30-40% in rate.

Conjugate beamforming is the distributed version of matched filtering. At each AP, the precoder is the complex conjugate of the channel — pointing signal energy back along the same path the channel brings it. When APs synchronize, the individual signal contributions add coherently at the UE, while interference averages out by the law of large numbers. At LL \to \infty: perfect beamforming, no interference. Finite LL: interference scales with pilot contamination and user separation.

,

Key Takeaway

Conjugate beamforming is both simple and near-optimal in cell- free. Each AP computes v(l,k)=(H(l,k))H/H(l,k)\mathbf{v}^{(l, k)} = (\mathbf{H}^{(l, k)})^H / \|\mathbf{H}^{(l, k)}\| from its local estimate. No inter-AP coordination at signal level. Asymptotic SINR scales linearly with LL. This simplicity is why cell-free OTFS is deployable — the CPU only aggregates estimates, not per-symbol decisions.

Definition:

Regularized ZF for Finite LL

For small-to-moderate LL (L100L \leq 100), conjugate beamforming suffers from residual multi-user interference. Regularized ZF precoding reduces this: VRZF  =  HDDH(HDDHDDH+μI)1,\mathbf{V}_{\mathrm{RZF}} \;=\; \mathbf{H}_{\mathrm{DD}}^H (\mathbf{H}_{\mathrm{DD}} \mathbf{H}_{\mathrm{DD}}^H + \mu \mathbf{I})^{-1}, where HDD\mathbf{H}_{\mathrm{DD}} stacks all UEs' DD channel vectors, and μ\mu is a regularization parameter (μ=Kσw2/L\mu = K \sigma_w^2 / L near-optimal).

Tradeoff: RZF needs joint channel inversion (LKL \cdot K system), requiring CPU coordination. Conjugate BF is fully distributed; RZF gives 3\sim 3 dB gain at cost of centralization.

Practical rule: Use conjugate for L100L \geq 100; RZF for L50L \leq 50.

,

Theorem: Cell-Free OTFS BER Under Mobility

For cell-free OTFS with LL APs, KK UEs, conjugate BF, and Doppler spread νmax\nu_{\max}, the BER at target SNR γ\gamma is BER    (2P1P)1(Lβˉ/K/(σw2/γ))P,\mathrm{BER} \;\approx\; \binom{2 P - 1}{P} \cdot \frac{1}{\left(L \bar\beta / K / (\sigma_w^2/\gamma) \right)^P}, where PP is the average number of resolvable paths per UE-AP link.

Consequence: The BER exponent is PP (full DD diversity) and the pre-factor is LL (macro-diversity). At high mobility, this vastly outperforms cellular. Example: L=50L = 50, K=20K = 20, P=8P = 8, γ=20\gamma = 20 dB:

  • Cellular (1 BS): BER 105\sim 10^{-5}.
  • Cell-free OTFS: BER 1013\sim 10^{-13}8 orders of magnitude better.

Cell-free macro-diversity compounds with OTFS's DD-diversity (PP). The aggregate diversity is LPL \cdot P, and the BER decay is exponential in this total. For realistic numbers, the effective diversity is so high that BER falls below 101010^{-10} at 15-20 dB SNR — unheard of in classical MIMO. This is the reliability underpinning the 35% throughput gain.

Example: Cell-Free OTFS vs Cellular at High Mobility

Compare BER at 20 dB SNR, 120 km/h mobility, for: (a) Single-BS cellular OFDM. (b) Single-BS cellular OTFS. (c) Cell-free OFDM (L=50L = 50). (d) Cell-free OTFS (L=50L = 50).

Cell-Free OTFS BER vs Mobility

Plot BER vs UE velocity (0-300 km/h) for four configurations. Sliders: LL, KK, NaN_a.

Parameters
50
20
4
🎓CommIT Contribution(2023)

Conjugate Beamforming in the DD Domain for Cell-Free OTFS

M. Mohammadi, H. Q. Ngo, M. Matthaiou, G. CaireIEEE Trans. Wireless Communications

The CommIT contribution extends conjugate beamforming — the workhorse of cellular massive MIMO — to the DD domain for cell- free architectures. Three key results:

  1. Distributed DD-conjugate BF: each AP computes its precoder locally from its DD channel estimate. No symbol-level inter-AP coordination needed.
  2. Asymptotic SINR analysis: derives the exact scaling SINRLβˉ\mathrm{SINR} \propto L \bar\beta for the DD setting, accounting for Doppler phase coherence across APs.
  3. Quantitative performance: at L=50L = 50, K=20K = 20, 120 km/h, 20 dB SNR: 7\sim 7 dB SINR gain over cellular OTFS, 10\sim 10 dB over cellular OFDM.

Combined with the embedded-pilot estimation (§2), this yields the 35% improvement in 95%-likely per-user throughput. The DD-domain framework is essential: without it, conjugate BF at distributed APs cannot maintain Doppler-coherent combining.

commitcell-freeconjugate-bf
🔧Engineering Note

CPU Compute Scaling

CPU processing requirements in cell-free OTFS:

  • Channel aggregation: CPU receives LL per-AP DD estimates per UE per frame. Aggregation: O(LKMN)\mathcal{O}(L K MN) per frame.
  • Precoder computation (RZF if used): O(L3)\mathcal{O}(L^3) for full system, O(Lk3)\mathcal{O}(L_k^3) per UE for user-centric.
  • Resource allocation: O(LK)\mathcal{O}(L K) per frame.
  • Detection coordination: O(LK)\mathcal{O}(L K) per frame.

Total per frame: 106\sim 10^6-10710^7 ops for L=100L = 100, K=200K = 200. At 100 Hz frame rate: 10810^8-10910^9 ops/sec — well within a modern server CPU (2024-era Intel Xeon: 100 GFLOPS per core, 10+ cores).

Scaling to 1000 APs: conjugate BF scales linearly; RZF cubically. At L=1000L = 1000: user-centric clustering (Lk=10L_k = 10) keeps it tractable. Without clustering: need GPU acceleration.

Practical Constraints
  • Conjugate BF: O(LK) per frame

  • RZF: O(L³) — needs user-centric clustering

  • Modern server CPU handles L=100, K=200

  • L=1000+: requires user-centric + GPU

Common Mistake: Conjugate BF Fails Without Phase Sync

Mistake:

Running conjugate beamforming with unsynchronized APs. If AP phases are random, the coherent combining at the UE is lost — signals add non-coherently, and gain drops from LL to L\sqrt{L} (a L\sqrt{L} factor of lost rate).

Correction:

Phase synchronization across APs is mandatory for conjugate BF. Options:

  • GNSS-PPS: ±50\pm 50 ns phase accuracy. Works for sub-6 GHz.
  • PTP-1588v2 over fiber: ±10\pm 10 ns. Works for mmWave.
  • Bi-directional calibration: bootstrap phases at deployment, refresh periodically.

Deployment checklist: verify cross-AP phase lock (coherence) at center frequency before turning on conjugate BF. Automatic fallback to MRT or per-AP-independent beamforming if sync fails.