Stationary Distribution

Long-Run Behavior

The central question of Markov chain theory: if we run the chain for a very long time, does the distribution of XnX_n settle down to a limit, independent of where the chain started? The answer is yes, under the right conditions, and the limiting distribution is the stationary distribution Ο€\boldsymbol{\pi}. This distribution governs the long-run fraction of time spent in each state and is the key to computing steady-state performance metrics in communications systems.

Definition:

Stationary (Invariant) Distribution

A probability distribution Ο€=(Ο€1,Ο€2,…)\boldsymbol{\pi} = (\pi_1, \pi_2, \ldots) on the state space S\mathcal{S} is a stationary distribution (or invariant distribution) for the DTMC with transition matrix P\mathbf{P} if

Ο€=Ο€P,\boldsymbol{\pi} = \boldsymbol{\pi} \mathbf{P},

i.e., Ο€j=βˆ‘i∈SΟ€i pij\pi_j = \sum_{i \in \mathcal{S}} \pi_i \, p_{ij} for all j∈Sj \in \mathcal{S}, with Ο€jβ‰₯0\pi_j \geq 0 and βˆ‘jΟ€j=1\sum_j \pi_j = 1.

If X0βˆΌΟ€X_0 \sim \boldsymbol{\pi}, then XnβˆΌΟ€X_n \sim \boldsymbol{\pi} for all nn: the distribution does not change over time.

The equation Ο€=Ο€P\boldsymbol{\pi} = \boldsymbol{\pi} \mathbf{P} says that Ο€\boldsymbol{\pi} is a left eigenvector of P\mathbf{P} for eigenvalue 1. Since P\mathbf{P} is stochastic, eigenvalue 1 always exists (the all-ones vector is a right eigenvector). The question is whether a non-negative left eigenvector with entries summing to 1 exists and is unique.

,

Stationary distribution

A probability vector Ο€\boldsymbol{\pi} satisfying Ο€=Ο€P\boldsymbol{\pi} = \boldsymbol{\pi}\mathbf{P}. If the chain starts in this distribution, it stays in it forever.

Related: Stationary (Invariant) Distribution

Theorem: Existence and Uniqueness of the Stationary Distribution

Let {Xn}\{X_n\} be an irreducible DTMC on a countable state space S\mathcal{S}.

  1. If the chain is positive recurrent (i.e., the expected return time Ei[Ti]<∞\mathbb{E}_i[T_i] < \infty for some, hence all, states ii), then there exists a unique stationary distribution given by

    Ο€i=1Ei[Ti],i∈S,\pi_i = \frac{1}{\mathbb{E}_i[T_i]}, \quad i \in \mathcal{S},

    where Ti=min⁑{nβ‰₯1:Xn=i}T_i = \min\{n \geq 1 : X_n = i\} is the first return time.

  2. If the chain is transient or null recurrent (Ei[Ti]=∞\mathbb{E}_i[T_i] = \infty), then no stationary distribution exists.

For a finite irreducible chain, positive recurrence is automatic, so a unique stationary distribution always exists.

The stationary probability of state ii is inversely proportional to the mean return time: states that take longer to revisit are less likely to be occupied in steady state. For finite chains, every irreducible chain is positive recurrent because there are only finitely many states to wander among.

,

Example: Stationary Distribution of the Two-State Chain

Find the stationary distribution of the two-state chain from Example ETwo-State Markov Chain (Sunny/Rainy Weather) with P=(1βˆ’Ξ±Ξ±Ξ²1βˆ’Ξ²)\mathbf{P} = \begin{pmatrix} 1 - \alpha & \alpha \\ \beta & 1 - \beta \end{pmatrix}, where Ξ±=0.3\alpha = 0.3, Ξ²=0.5\beta = 0.5.

Example: Doubly Stochastic Chains

A DTMC on S={1,2,…,M}\mathcal{S} = \{1, 2, \ldots, M\} has a doubly stochastic transition matrix (both rows and columns sum to 1). Find the stationary distribution.

Theorem: Convergence Theorem (Ergodic Theorem for Markov Chains)

Let {Xn}\{X_n\} be an irreducible, aperiodic, positive recurrent DTMC with stationary distribution Ο€\boldsymbol{\pi}. Then for any initial distribution:

lim⁑nβ†’βˆžpij(n)=Ο€j,forΒ allΒ i,j∈S.\lim_{n \to \infty} p_{ij}^{(n)} = \pi_j, \quad \text{for all } i, j \in \mathcal{S}.

In matrix form: lim⁑nβ†’βˆžPn=1Ο€\lim_{n \to \infty} \mathbf{P}^{n} = \mathbf{1} \boldsymbol{\pi}, where 1Ο€\mathbf{1} \boldsymbol{\pi} is the matrix with every row equal to Ο€\boldsymbol{\pi}.

A chain satisfying all three conditions (irreducible, aperiodic, positive recurrent) is called ergodic.

After sufficiently many steps, the chain "forgets" its initial state and settles into the stationary distribution. Every row of Pn\mathbf{P}^{n} becomes approximately Ο€\boldsymbol{\pi}, regardless of the starting state.

,

What Happens When the Chain is Periodic?

If the chain is irreducible and positive recurrent but has period d>1d > 1, then pii(n)p_{ii}^{(n)} does not converge to Ο€i\pi_i. Instead, pii(n)p_{ii}^{(n)} is 0 unless d∣nd \mid n, and lim⁑nβ†’βˆž,d∣npii(n)=dβ‹…Ο€i\lim_{n \to \infty, d \mid n} p_{ii}^{(n)} = d \cdot \pi_i. The time-averaged version still holds:

1nβˆ‘k=0nβˆ’1pij(k)β†’Ο€jasΒ nβ†’βˆž.\frac{1}{n} \sum_{k=0}^{n-1} p_{ij}^{(k)} \to \pi_j \quad \text{as } n \to \infty.

So the stationary distribution still governs long-run averages, even without pointwise convergence.

Computing and Visualizing the Stationary Distribution

Enter a transition matrix and see its stationary distribution computed as the left eigenvector for eigenvalue 1. Also displays the row distribution of Pn\mathbf{P}^{n} converging to Ο€\boldsymbol{\pi} (for ergodic chains).

Parameters
0

Track the distribution starting from this state

Historical Note: Perron-Frobenius and the Spectral Theory of Non-negative Matrices

Early 20th century

The existence and uniqueness of the stationary distribution is intimately connected to the Perron-Frobenius theorem (Oskar Perron, 1907; Georg Frobenius, 1912). For a positive (or irreducible non-negative) matrix, this theorem guarantees a unique largest eigenvalue that is real and positive, with a corresponding eigenvector that has all positive entries. For stochastic matrices, this largest eigenvalue is exactly 1, and the corresponding left eigenvector (normalized) is the stationary distribution. Google's PageRank algorithm is arguably the most famous modern application of Perron-Frobenius theory applied to Markov chains.

Quick Check

For a doubly stochastic irreducible chain on 5 states, what is Ο€3\pi_3?

1/31/3

1/51/5

Cannot determine without knowing P\mathbf{P}

00

Common Mistake: Stationary Distribution vs Limiting Distribution

Mistake:

Assuming that if Ο€\boldsymbol{\pi} is a stationary distribution, then pij(n)β†’Ο€jp_{ij}^{(n)} \to \pi_j. This conflates existence of Ο€\boldsymbol{\pi} with convergence.

Correction:

Convergence pij(n)β†’Ο€jp_{ij}^{(n)} \to \pi_j requires aperiodicity in addition to irreducibility and positive recurrence. A periodic chain has a stationary distribution but pij(n)p_{ij}^{(n)} oscillates and does not converge pointwise.

Key Takeaway

An irreducible, positive recurrent DTMC has a unique stationary distribution π\boldsymbol{\pi} with πi=1/Ei[Ti]\pi_i = 1/\mathbb{E}_i[T_i]. Adding aperiodicity gives the convergence theorem: Pn→1π\mathbf{P}^{n} \to \mathbf{1}\boldsymbol{\pi}. For finite irreducible chains, positive recurrence is automatic — only aperiodicity needs checking.