Rate Scaling and Power Scaling Laws

The Promise of Massive MIMO: How Far Can We Push?

The closed-form rate expressions from the previous sections reveal that massive MIMO performance improves with the number of antennas. But how does it improve? And can we exploit the array gain to reduce transmit power while maintaining the same rate?

These are the questions addressed by scaling law analysis. We study two complementary regimes:

  1. Rate scaling: Fix the transmit power and let NtN_t \to \infty. How does the sum rate grow?

  2. Power scaling: Let Pt=Et/NtαP_t = E_t / N_t^\alpha decrease as NtN_t grows. What is the largest exponent α\alpha that still yields a positive rate?

The answers depend dramatically on the combining scheme and on whether pilot contamination is present. Power scaling is arguably the most practically important result in the massive MIMO literature: it shows that green communication is not just a slogan but a mathematical consequence of the array gain.

,

Theorem: Sum Rate Scaling with MRC

With MRC combining, fixed transmit power PtP_t, and optimal user loading (KK chosen to maximize sum rate), the sum spectral efficiency scales as

SMRC=Θ ⁣(NtlnNt)[bits/s/Hz],S^{\text{MRC}} = \Theta\!\left(\frac{N_t}{\ln N_t}\right) \quad \text{[bits/s/Hz]},

where the optimal number of users is K=Θ(Nt/lnNt)K^* = \Theta(N_t / \ln N_t).

With a fixed ratio K/Nt=αK / N_t = \alpha, the sum rate scales as SMRC=Θ(Klog2Nt)S^{\text{MRC}} = \Theta(K \log_2 N_t).

With MRC, each additional user adds log2(Nt/K)\log_2(N_t / K) to the sum rate but also adds interference to all other users. The balance yields a sum rate that grows slightly slower than linearly in NtN_t.

,

Theorem: Sum Rate Scaling with ZF

With ZF combining, fixed transmit power, perfect estimation (γk=βk\gamma_k = \beta_{k}), and optimal user loading, the sum spectral efficiency scales as

SZF=Θ(Nt)[bits/s/Hz],S^{\text{ZF}} = \Theta(N_t) \quad \text{[bits/s/Hz]},

with the optimal number of users growing as K=Θ(Nt)K^* = \Theta(N_t).

ZF eliminates inter-user interference, so the per-user rate is RkZF=log2(1+(NtK)SNR)R_k^{\text{ZF}} = \log_2(1 + (N_t - K)\text{SNR}). This is interference-free — the only penalty from adding users is the loss of degrees of freedom. The optimal loading uses roughly half the antennas for users (KNt/2K^* \approx N_t/2 at high SNR), and each user achieves log2(Nt/2SNR)\log_2(N_t/2 \cdot \text{SNR}), yielding linear scaling of the sum rate.

,

Rate Scaling: MRC vs. ZF vs. MMSE

The scaling law comparison reveals a fundamental hierarchy:

  • MRC: S=Θ(Nt/lnNt)S = \Theta(N_t / \ln N_t) — near-linear but penalized by interference
  • ZF: S=Θ(Nt)S = \Theta(N_t) — linear in NtN_t
  • MMSE: S=Θ(Nt)S = \Theta(N_t) — same scaling as ZF, but with better constants (lower denominator)

The point is that all three schemes achieve sum rates that grow at least linearly in NtN_t (up to logarithmic factors). This is the spatial multiplexing gain of massive MIMO: the base station can serve Θ(Nt)\Theta(N_t) users simultaneously, each at a positive rate.

Definition:

Power Scaling Regime

In the power scaling regime, the transmit power of each user is reduced as the number of antennas grows:

Pt=EtNtα,P_t = \frac{E_t}{N_t^\alpha},

where Et>0E_t > 0 is a fixed energy parameter and α0\alpha \geq 0 is the power scaling exponent. The goal is to find the largest α\alpha for which the achievable rate remains bounded away from zero as NtN_t \to \infty.

When α=1\alpha = 1, the transmit power scales as 1/Nt1/N_t, meaning the total radiated energy is reduced by a factor of NtN_t compared to the single-antenna case. This is the most aggressive power reduction compatible with nonzero rate.

,

Theorem: Power Scaling with MRC

With MRC combining and i.i.d. Rayleigh fading, if the transmit power scales as Pt=Et/NtP_t = E_t / N_t, then the per-user rate converges to

RkMRClog2 ⁣(1+Etβk2τpEpσ2(βkτpEp+σ2))>0R_k^{\text{MRC}} \to \log_2\!\left(1 + \frac{E_t \, \beta_{k}^{2} \, \tau_p \, E_p}{\sigma^2(\beta_{k} \tau_p E_p + \sigma^2)}\right) > 0

as NtN_t \to \infty, where Ep=PtpNtE_p = {P_t}_{p} N_t is the pilot energy (also scaled). Hence MRC supports power scaling with exponent α=1\alpha = 1.

If α>1\alpha > 1, the rate converges to zero. The maximum power scaling exponent for MRC is α=1\alpha^* = 1.

The MRC SINR has numerator NtPtγkN_t P_t \gamma_k and a denominator independent of NtN_t. Substituting Pt=Et/NtP_t = E_t / N_t makes the numerator Etγk(Nt)E_t \gamma_k(N_t), which converges to a positive constant as NtN_t \to \infty (since γk\gamma_k also adjusts with the pilot power scaling). The key: the array gain from NtN_t antennas exactly compensates the 1/Nt1/N_t power reduction.

,

Theorem: Power Scaling with ZF and MMSE

With ZF or MMSE combining and i.i.d. Rayleigh fading, both schemes support power scaling with exponent α=1\alpha = 1, and the limiting per-user rates are

RkZFlog2 ⁣(1+Etβk2τpEpσ22),RkMMSERkZF.R_k^{\text{ZF}} \to \log_2\!\left(1 + \frac{E_t \, \beta_{k}^{2} \, \tau_p E_p}{{\sigma^2}^{2}}\right), \quad R_k^{\text{MMSE}} \geq R_k^{\text{ZF}}.

The ZF limiting rate is strictly higher than the MRC limiting rate because the denominator contains σ22{\sigma^2}^{2} instead of σ2(βkτpEp+σ2)\sigma^2(\beta_{k} \tau_p E_p + \sigma^2).

With ZF, the interference-from-estimation-error terms in the denominator also scale as 1/Nt1/N_t and vanish, leaving only σ2\sigma^2. But the ZF denominator is jPtj(βjγj)+σ2\sum_j {P_t}_{j}(\beta_{j} - \gamma_j) + \sigma^2, and with power scaling, Ptj(βjγj)0{P_t}_{j}(\beta_{j} - \gamma_j) \to 0. The extra factor of σ2\sigma^2 in the MRC denominator comes from the interference that MRC does not suppress — and that interference vanishes when each user's power is 1/Nt1/N_t.

The Green Communication Implication

The power scaling result is one of the most compelling arguments for massive MIMO. Consider a base station serving K=10K = 10 users. With Nt=100N_t = 100 antennas, each user can reduce its transmit power by a factor of 100 (20 dB) compared to the single-antenna case, while maintaining the same rate.

For a user transmitting at 200 mW in a conventional system, this means only 2 mW is needed with massive MIMO. The battery life improvement for mobile devices is enormous. Equivalently, the base station can serve the same users with 20 dB less total radiated power, reducing electromagnetic exposure and energy consumption.

This is not an asymptotic curiosity — practical 5G massive MIMO deployments with 64 antennas already benefit from significant power reductions.

🎓CommIT Contribution(2013)

Energy and Spectral Efficiency of Very Large Multiuser MIMO Systems

H. Q. Ngo, E. G. Larsson, T. L. Marzetta, G. CaireIEEE Transactions on Communications

Ngo, Larsson, and Marzetta (2013) established the foundational power scaling laws for massive MIMO. Their key results — that transmit power can be reduced as 1/Nt1/N_t with MRC and as 1/Nt1/N_t with ZF, while maintaining nonzero rates — provided the first rigorous quantification of the "green communication" potential of massive arrays.

The paper also showed that with pilot contamination from other cells, the power scaling exponent is limited to α=1/2\alpha = 1/2 rather than α=1\alpha = 1, highlighting the fundamental impact of pilot contamination on energy efficiency. Caire's subsequent work on pilot decontamination via spatial correlation (Chapter 3) restored the full α=1\alpha = 1 scaling by exploiting the structure of realistic channel models.

power-scalingenergy-efficiencymassive-MIMOView Paper →

Theorem: Pilot Contamination Rate Ceiling

In a multi-cell system with LL cells sharing the same pilot sequences, the MRC rate for user kk in cell \ell is upper bounded by

Rk()log2 ⁣(1+βk2lβkl2)as Nt,R_k^{(\ell)} \leq \log_2\!\left(1 + \frac{\beta_{k\ell}^{2}}{\sum_{l \neq \ell} \beta_{kl}^{2}}\right) \quad \text{as } N_t \to \infty,

where βkl\beta_{kl} is the large-scale fading from user kk's pilot-sharing counterpart in cell ll to the base station in cell \ell. This ceiling is finite and independent of NtN_t.

Pilot contamination means the base station's estimate of its own user's channel is corrupted by the channels of pilot-sharing users in other cells. As NtN_t \to \infty, MRC coherently combines both the desired signal and the contaminating interference, so the SINR saturates.

This result, first observed by Marzetta (2010), was initially thought to be a fundamental limitation of massive MIMO. Chapter 3 discusses how spatial correlation can overcome this ceiling.

,

Example: Rate Ceiling with Two Cells

Consider a two-cell system (L=2L = 2) where user kk in cell 1 has path loss βk1=0.1\beta_{k1} = 0.1 to its own base station and βk2=0.01\beta_{k2} = 0.01 to the interfering base station. Both cells use the same pilot sequence for user kk. What is the rate ceiling with MRC as NtN_t \to \infty?

Power Scaling Regimes

Explore how the achievable rate behaves as the transmit power is scaled down with the number of antennas. Adjust the power scaling exponent α\alpha and observe the rate behavior for MRC, ZF, and MMSE.

Parameters
1

Power scaling exponent: $P_t = E_t / N_t^\alpha$

10
20

Fixed energy parameter (dB)

-10
⚠️Engineering Note

Practical Power Scaling in 5G NR

In practice, 5G NR does not implement the theoretical 1/Nt1/N_t power scaling directly. Instead, the power control loop adjusts transmit power based on measured SINR and a target BLER (block error rate). However, the array gain from massive MIMO implicitly enables power reduction: users automatically reduce their transmit power when the uplink SINR target is easily met.

Field measurements from Ericsson's 64-antenna massive MIMO trials show 10-15 dB uplink power reduction compared to 4-antenna base stations — consistent with the theoretical prediction of 10log10(64/4)=1210 \log_{10}(64/4) = 12 dB for the array gain.

Practical Constraints
  • 3GPP power control adjusts in 1 dB steps

  • Maximum power reduction limited by cell-edge coverage requirements

  • Pilot power must also be reduced consistently

⚠️Engineering Note

Pilot Overhead and Net Throughput

The rate expressions derived in this chapter are spectral efficiencies per channel use during the data phase. The net throughput must account for pilot overhead:

Rknet=(1τpτc)Rk,R_k^{\text{net}} = \left(1 - \frac{\tau_p}{\tau_c}\right) R_k,

where τp\tau_p is the number of pilot symbols and τc\tau_c is the coherence interval (in symbols). For τc=200\tau_c = 200 (typical for sub-6 GHz at moderate mobility) and τp=K=10\tau_p = K = 10, the overhead is 5%, which is minor. But if K=100K = 100, the overhead is 50% — half the resource is consumed by pilots.

This overhead is the fundamental bottleneck that limits the number of simultaneously served users, regardless of NtN_t.

Practical Constraints
  • 5G NR: SRS overhead depends on periodicity and number of ports

  • High-mobility scenarios reduce τc\tau_c, increasing relative overhead

  • FDD systems have additional downlink pilot overhead

Common Mistake: Forgetting to Scale Pilot Power

Mistake:

When applying power scaling (Pt=Et/NtP_t = E_t / N_t), only scaling the data power while keeping pilot power fixed. This leads to incorrect (optimistic) rate expressions because the estimation quality γk\gamma_k depends on the pilot power.

Correction:

Both data power and pilot power must be scaled consistently. If Pt=Et/NtP_t = E_t / N_t, then typically Ptp=Ep/Nt{P_t}_{p} = E_p / N_t as well. The estimation quality then becomes γk=O(1/Nt)\gamma_k = O(1/N_t), and the product NtγkN_t \gamma_k converges to a positive constant — which is what makes the rate nonzero in the limit. If pilot power is kept fixed, γk\gamma_k remains constant and the rate expression is different (and more optimistic).

Quick Check

What is the maximum power scaling exponent α\alpha (where Pt=Et/NtαP_t = E_t / N_t^\alpha) that yields a nonzero rate as NtN_t \to \infty with MRC and no pilot contamination?

α=0\alpha = 0 (no power reduction)

α=1/2\alpha = 1/2

α=1\alpha = 1

α=2\alpha = 2

Why This Matters: Power Scaling and IoT Device Lifetime

The power scaling result has transformative implications for Internet-of-Things (IoT) devices. Many IoT sensors operate on batteries that must last years without replacement. If a massive MIMO base station with Nt=256N_t = 256 antennas serves an IoT device, the device can reduce its transmit power by a factor of 256 (24 dB).

A device that would drain its battery in 1 year at 100 mW transmit power could last 256 years at the reduced power level (ignoring circuit power). While circuit power consumption prevents this extreme extrapolation, the energy savings from the 1/Nt1/N_t power scaling are the primary physical-layer enabler for massive machine-type communication (mMTC) in 5G NR.

Key Takeaway

Massive MIMO sum rates grow as Θ(Nt)\Theta(N_t) with ZF/MMSE and Θ(Nt/lnNt)\Theta(N_t/\ln N_t) with MRC when users are optimally loaded. Transmit power can be reduced as 1/Nt1/N_t (exponent α=1\alpha = 1) with no rate loss — the array gain perfectly compensates. Pilot contamination reduces the maximum exponent to α=1/2\alpha = 1/2, motivating the decontamination techniques of Chapter 3.