Visibility Regions in XL-MIMO
The End of Spatial Stationarity
Throughout Chapters 1β16 we treated the base-station array as a compact, homogeneous sensor: every antenna element saw every user through the same fading process, perhaps with spatial correlation. That picture rests on a hidden assumption: the physical aperture of the array is small compared to both the propagation range and the scatterer spread. When grows to metres β the so-called extra-large or XL-MIMO regime β the assumption fails. A user on one side of the array may be hidden from the other side by blockage, by range-dependent path loss across the aperture, or simply because its scatterers subtend a limited angular window that only a fraction of the array can resolve. The energy a user radiates lives on a spatial subset of the array which we call its visibility region (VR).
Every piece of machinery we built for stationary massive MIMO β orthogonal pilots, MMSE estimation with a single , channel hardening, favorable propagation β degrades gracefully once VRs shrink below the full aperture. Worse, failing to estimate the VR and treating out-of-VR antennas as useful only pumps noise into the combiner. This chapter develops a principled pipeline: detect the VR, estimate the channel on the VR, and decode. The CommIT contribution of Xu and Caire (a 2D Markov prior on the VR mask) gives this pipeline its statistical backbone.
Definition: The XL-MIMO Regime
The XL-MIMO Regime
An array is called extra-large MIMO (XL-MIMO) when its physical aperture is large enough that at least one of the following fails over the typical user range :
- Far-field: , so the wavefront is planar across the array.
- Power stationarity: the per-antenna received power is approximately constant across antennas .
- Angular stationarity: the angular power spectrum seen by antenna is (approximately) the same for every .
Representative numbers at GHz ( cm) with m give m, so any user inside the cell can sit in the near field and simultaneously see only part of the array. Similar numbers arise at mmWave with cm and GHz.
The name is pragmatic. "XL-MIMO" emphasizes that the aperture β not the antenna count β is what breaks the stationary model. A dense 256-element sub-wavelength array with is still comfortably stationary; a sparse 64-element array spread over m is not.
Definition: Visibility Region and Binary Mask
Visibility Region and Binary Mask
Let denote the full-aperture uplink channel of user . The visibility region of user is the index set of antennas whose received signal is above the noise floor with non-negligible probability:
for a small threshold (typically ). Its binary mask is
On a uniform planar array with elements, we reshape into a 2D mask . The observed channel is the element-wise product
where is the idealized stationary channel that would be observed if the whole aperture were illuminated.
The threshold is a modelling choice, not a physical constant. Soft masks are natural generalizations but do not change the structure of the problem. In the joint estimation of Section 18.5, we replace the hard mask by the variational marginal .
Visibility region (VR)
The subset of antennas of an XL-MIMO array on which a given user's received power is above the noise floor. Equivalent to the support of the binary mask . For stationary massive MIMO, and the concept collapses to the classical model.
Related: Subarray Partition of an XL-MIMO Array, Spatial non-stationarity, Subarray Partition of an XL-MIMO Array
Spatial non-stationarity
The property that the second-order statistics of the channel (, angular spread, delay spread) vary across the antennas of a single array. In XL-MIMO this arises from near-field range variation across the aperture, blockage of subsets of the array, and user-dependent multipath clustering.
Related: Visibility region (VR), Subarray Partition of an XL-MIMO Array
Three Physical Causes of VRs
VRs arise from three distinct physical mechanisms, often acting together:
-
Aperture-range geometry. For a user at range with aperture and , the spherical wavefront curvature causes the per-antenna amplitude to roll off as where is the distance from the user to antenna . Far edges of the array see weaker signals.
-
Blockage. A human body, a vehicle, a pillar, or a wall can block the line of sight between the user and a contiguous block of antennas. Blockages are essentially binary and strongly correlated spatially.
-
Multipath clustering. Finite scatterers around the user illuminate only a limited angular window. The corresponding subset of the array that sees the cluster defines the VR. The VR boundary is soft but still localized.
All three mechanisms produce spatially contiguous VRs. This is the statistical regularity the 2D Markov prior of Section 18.2 exploits.
Visibility Region on a 2D Array
Generate a synthetic VR on an UPA. The heatmap shows the per-antenna received power after masking by the VR. Try changing the VR size, shape, and SNR. Observe how the VR is a contiguous spatial blob, not a random subset of antennas.
Parameters
Theorem: LS Estimation Penalty from VR Mismatch
Consider the LS channel estimate under orthonormal pilots , with the receiver applying a hard mask to the estimate. Assume the true channel on the true VR is and zero off , and let . Then the normalized mean-square error is
where and is the support of .
The first term is noise leakage: every antenna the receiver declares active but that carries no signal contributes pure noise β bigger means more noise. The second term is missed energy: every antenna inside the true VR that the receiver misses loses its signal entirely. Shrinking trades one penalty for the other, and the optimum is the true VR.
Split the error into three antenna sets
Partition the index set into (hit), (false alarm), (miss), = rest. On , both and are zero by the mask, so it contributes nothing.
Compute errors on A, B, C
On : with , so . On : but , so . On : but is a fresh , contributing .
Normalize
Total error is . Normalizing by and noting yields the stated formula.
Key Takeaway
VR mismatch is a two-sided penalty. Declaring too many antennas active floods the combiner with noise (); declaring too few throws away signal energy (). Every VR detection algorithm in this chapter is, at heart, an effort to balance these two costs.
Example: How Large is the VR Mismatch Penalty?
An XL-MIMO array has antennas. A user's true VR has antennas. The operating SNR is dB (). Compare the NMSE of three detectors: (a) "All on" (), (b) "True VR" (), (c) "Too small" with , containing true-VR antennas and out-of-VR antennas.
All on
, miss = 0. .
True VR
, miss = 0. .
Too small
, miss , so miss fraction . .
Interpretation
The correct VR is 4x more accurate than "all on" and 5x more accurate than the undersized estimate. Crucially, over-detection degrades linearly with , while under-detection degrades linearly with the miss count but with a much larger coefficient ( for useful SNRs). The break-even point favors being slightly generous β but not as generous as the full aperture.
Common Mistake: Averaging Across VRs Destroys Information
Mistake:
A tempting shortcut is to compute a single spatial covariance and use it as a stationary surrogate for every user. This mimics the massive-MIMO pipeline and seems to retain the "many antennas β good estimates" benefit.
Correction:
Averaging mixes disjoint VRs and hands the estimator a covariance with support covering the union of all VRs. The resulting MMSE estimator pumps noise through every antenna, cancelling the VR advantage. The right object is the per-user spatial covariance estimated from that user's pilot correlator alone, regularized by the 2D Markov prior of Section 18.2. The computational cost is controlled by the subarray decomposition of Section 18.3.
Historical Note: From Holographic MIMO to XL-MIMO
2014βpresentThe idea that spatial non-stationarity would eventually dominate massive MIMO has a long prehistory. Marzetta's original 2010 proposal assumed a compact antenna panel where the stationary Rayleigh model was a good approximation. By 2014 Payami and Tufvesson had measured non-stationarity on a 7.4 m linear array at 2.6 GHz and reported that users illuminated only 20β30% of the aperture β the first empirical demonstration of VRs. Theoretical follow-ups by Amiri, Angjelichinoski, de Carvalho, and Popovski (2018β2020) introduced the term "visibility region" and formulated sparsity-based estimation. BjΓΆrnson and Sanguinetti (2019) argued that holographic and XL-MIMO arrays will be the dominant regime for 6G. By 2022 the CommIT group (Xu and Caire) had developed the 2D Markov prior framework that is the backbone of this chapter.
Why This Matters: XL-MIMO in 6G Radio Architectures
Two 6G deployment candidates force XL-MIMO into the spotlight: distributed massive MIMO panels covering entire facades or ceilings (sometimes called holographic surfaces), and sub-THz arrays where the wavelength is so short that even a 20 cm panel holds thousands of elements and sits in the near field for any indoor user. In both scenarios the stationary model from Part I is hopelessly optimistic, and the techniques of this chapter become mandatory. The connection to RIS (Chapter 21) is also direct: an RIS is physically an XL aperture with its own visibility structure determined by the active user location.
Quick Check
An XL-MIMO array has antennas. A user's VR covers 512 antennas. The operating SNR is 0 dB. Which of the four NMSE values below corresponds to a detector that correctly identifies the VR (no false alarms, no misses)?
0.25
1.00
4.00
0.00
For the true VR, and the miss term is zero. at SNR = 0 dB.