Branching Processes and Compound Distributions

From Sums to Compositions

The PGF product rule handles sums of a fixed number of independent RVs. But what if the number of summands is itself random? This arises naturally: the total number of packets in a burst is a random sum of individual packet sizes; the total load on a server is the sum of a random number of jobs.

The compounding theorem says that the PGF of a random sum is a composition of PGFs: GS(s)=GN(GX(s))G_S(s) = G_N(G_X(s)). This simple formula has far-reaching consequences, including the complete analysis of Galton-Watson branching processes.

Theorem: The Compounding Theorem (Random Sums)

Let X1,X2,…X_1, X_2, \ldots be i.i.d. nonneg. integer-valued RVs with common PGF GX(s)G_X(s), and let NN be a nonneg. integer-valued RV independent of the XiX_i's with PGF GN(s)G_N(s). Then the random sum

S=βˆ‘i=1NXiS = \sum_{i=1}^{N} X_i

has PGF

GS(s)=GN(GX(s)).G_S(s) = G_N(G_X(s)).

Condition on N=nN = n: given N=nN = n, SS is a fixed sum of nn i.i.d. RVs, so E[sS∣N=n]=[GX(s)]n\mathbb{E}[s^S | N = n] = [G_X(s)]^n. Average over NN: GS(s)=βˆ‘n[GX(s)]n pN[n]=GN(GX(s))G_S(s) = \sum_n [G_X(s)]^n\,p_N[n] = G_N(G_X(s)).

Example: Compound Poisson: Eggs and Chicks

A hen lays N∼Poi(λ)N \sim \text{Poi}(\lambda) eggs. Each egg hatches independently with probability pp (Bernoulli). How many chicks KK does she produce?

Definition:

Galton-Watson Branching Process

A Galton-Watson branching process {Zn:n=0,1,2,…}\{Z_n : n = 0, 1, 2, \ldots\} evolves as follows:

  • Z0=1Z_0 = 1 (start with one individual).
  • Each individual in generation nn independently produces a random number of offspring with common PMF pp and PGF G(s)G(s).
  • Zn+1=βˆ‘i=1ZnYiZ_{n+1} = \sum_{i=1}^{Z_n} Y_i where YiY_i are i.i.d. with PGF GG.

The mean offspring number is ΞΌ=Gβ€²(1)=E[Z1]\mu = G'(1) = \mathbb{E}[Z_1] and the variance is Οƒ2=Gβ€²β€²(1)+Gβ€²(1)βˆ’[Gβ€²(1)]2\sigma^2 = G''(1) + G'(1) - [G'(1)]^2.

,

Theorem: PGF of the nn-th Generation

The PGF of the population size at generation nn is the nn-fold composition:

Gn(s)=GZn(s)=G(G(β‹―G(s)⋯ ))⏟nΒ times=G∘n(s).G_n(s) = G_{Z_n}(s) = \underbrace{G(G(\cdots G(s)\cdots))}_{n\text{ times}} = G^{\circ n}(s).

Moreover, Gn+m(s)=Gm(Gn(s))=Gn(Gm(s))G_{n+m}(s) = G_m(G_n(s)) = G_n(G_m(s)).

Theorem: Moments of the Branching Process

E[Zn]=ΞΌn,Var(Zn)={nΟƒ2ifΒ ΞΌ=1,Οƒ2(ΞΌnβˆ’1)ΞΌnβˆ’1ΞΌβˆ’1ifΒ ΞΌβ‰ 1.\mathbb{E}[Z_n] = \mu^n, \qquad \text{Var}(Z_n) = \begin{cases} n\sigma^2 & \text{if } \mu = 1, \\ \frac{\sigma^2(\mu^n - 1)\mu^{n-1}}{\mu - 1} & \text{if } \mu \neq 1. \end{cases}$

Theorem: Extinction Probability of a Branching Process

The probability of ultimate extinction

Ξ·=lim⁑nβ†’βˆžP(Zn=0)=lim⁑nβ†’βˆžGn(0)\eta = \lim_{n \to \infty}\mathbb{P}(Z_n = 0) = \lim_{n \to \infty} G_n(0)

is the smallest non-negative solution of s=G(s)s = G(s).

  • If μ≀1\mu \leq 1 (and Οƒ2>0\sigma^2 > 0): Ξ·=1\eta = 1 (certain extinction).
  • If ΞΌ>1\mu > 1: Ξ·<1\eta < 1 (survival with positive probability 1βˆ’Ξ·1-\eta).

Extinction occurs when the population hits zero β€” an absorbing state. The fixed-point equation Ξ·=G(Ξ·)\eta = G(\eta) arises because the process either dies in generation 11 with probability G(0)G(0), or reaches some generation-11 size and then each sub-tree independently faces the same extinction question.

,

Branching Process: PGF Fixed Points and Extinction

Explore the Galton-Watson branching process. The plot shows y=G(s)y = G(s) and y=sy = s. The extinction probability Ξ·\eta is the smallest intersection point. Adjust the offspring distribution to see how ΞΌ=Gβ€²(1)\mu = G'(1) determines whether extinction is certain (μ≀1\mu \leq 1) or not (ΞΌ>1\mu > 1).

Parameters
0.3
0.3
0.4

Example: Geometric Offspring Distribution

Suppose the offspring distribution is geometric: p[k]=(1βˆ’p)pkp[k] = (1-p)p^k for k=0,1,2,…k = 0, 1, 2, \ldots with PGF G(s)=(1βˆ’p)/(1βˆ’ps)G(s) = (1-p)/(1-ps). Find the extinction probability.

πŸŽ“CommIT Contribution(2001)

Branching Process Analysis of Random Access Protocols

G. Caire, D. Tuninetti β€” IEEE Transactions on Information Theory

The Galton-Watson branching process has direct applications to the analysis of tree-splitting random access protocols (ALOHA variants). When a collision occurs, each colliding user independently retransmits with probability pp, creating a branching structure. The stability condition β€” whether the backlog grows unboundedly β€” corresponds precisely to the subcritical condition ΞΌ<1\mu < 1. Caire and Tuninetti analyzed the throughput of coded random access using similar branching arguments, connecting the extinction probability to the packet loss rate.

random-accessbranching-processesthroughput-analysisView Paper β†’

Historical Note: Galton, Watson, and the Problem of Family Names

19th-20th century

In 1874, Francis Galton posed a question to the readers of the Educational Times: what is the probability that a family name eventually goes extinct? Galton was motivated by observing that many once-prominent English surnames had disappeared. Reverend Henry Watson responded with the generating function analysis β€” essentially the theory developed in this section.

The irony is that Watson's original solution contained an error: he concluded that extinction was certain regardless of the mean offspring number. The correct result β€” that survival is possible when ΞΌ>1\mu > 1 β€” was established by Steffensen (1930) and later clarified by Harris (1963).

Waring's Theorem and Inclusion-Exclusion

The PGF also enables elegant combinatorial results. Waring's theorem provides a formula for P(X=k)\mathbb{P}(X = k) where XX counts the number of events from A1,…,AnA_1, \ldots, A_n that occur:

P(X=k)=βˆ‘m=kn(βˆ’1)mβˆ’k(mk)Sm,\mathbb{P}(X = k) = \sum_{m=k}^{n} (-1)^{m-k}\binom{m}{k}S_m,

where Sm=βˆ‘i1<β‹―<imP(Ai1βˆ©β‹―βˆ©Aim)S_m = \sum_{i_1 < \cdots < i_m}\mathbb{P}(A_{i_1} \cap \cdots \cap A_{i_m}). The proof is a direct consequence of the relation GX(s)=GS(sβˆ’1)G_X(s) = G_S(s-1) between the PGF of XX and the generating function of the SmS_m sequence. This is the probabilistic generalization of inclusion-exclusion.

Quick Check

In a Galton-Watson branching process, the offspring PGF satisfies G(s)=0.2+0.5s+0.3s2G(s) = 0.2 + 0.5s + 0.3s^2. What is the mean offspring number ΞΌ\mu, and will the process go extinct with probability 11?

ΞΌ=1.1\mu = 1.1, extinction probability <1< 1

ΞΌ=1.1\mu = 1.1, extinction probability =1= 1

ΞΌ=0.8\mu = 0.8, extinction probability =1= 1

ΞΌ=1.0\mu = 1.0, extinction probability =1= 1

Galton-Watson Branching Process

A stochastic process {Zn}\{Z_n\} where each individual independently produces offspring with a common distribution. Z0=1Z_0 = 1, Zn+1=βˆ‘i=1ZnYiZ_{n+1} = \sum_{i=1}^{Z_n} Y_i. The PGF of ZnZ_n is G∘n(s)G^{\circ n}(s).

Related: Probability Generating Function (PGF), Rate Function

Compound Distribution

The distribution of S=βˆ‘i=1NXiS = \sum_{i=1}^N X_i where NN is random and independent of the i.i.d. XiX_i's. Its PGF is GS(s)=GN(GX(s))G_S(s) = G_N(G_X(s)).

Related: Probability Generating Function (PGF), Galton-Watson Branching Process