Ferkans — Interactive Telecom Tutor

Why Eigenvalues Matter

Eigenvalues and eigenvectors expose the intrinsic geometry of a linear transformation. When you decompose a matrix into its eigenstructure you discover the directions along which the map acts as simple scaling — the natural modes of the system.

In telecommunications this idea is pervasive:

Channel covariance. The eigenvalues of the spatial covariance matrix $\mathbf{R} = \mathbb{E}[\mathbf{h}\mathbf{h}^H]$ quantify how much energy the channel concentrates along each angular direction.
MIMO capacity. The capacity-achieving input covariance is diagonal in the eigenbasis of $\mathbf{H}^H \mathbf{H}$ ; the water-filling solution allocates power to eigenmodes whose eigenvalues (channel gains) exceed a threshold.
Vibration and stability. Eigenvalues of a system matrix determine whether feedback loops oscillate, converge, or diverge — a concept inherited directly from the theory of differential equations.
Principal Component Analysis (PCA). The dominant eigenvectors of a data covariance matrix are the directions of maximum variance, used for dimensionality reduction in signal processing and machine learning.

Understanding eigendecomposition is therefore not a detour; it is the very language in which channel capacity, beamforming optimality, and adaptive filtering are expressed.

Definition:
Eigenvalue and Eigenvector

Let $\mathbf{A} \in \mathbb{C}^{n \times n}$ . A scalar $\lambda \in \mathbb{C}$ is an eigenvalue of $\mathbf{A}$ if there exists a nonzero vector $\mathbf{v} \in \mathbb{C}^n$ such that $\mathbf{A}\mathbf{v} = \lambda \mathbf{v}.$ The vector $\mathbf{v}$ is called an eigenvector associated with $\lambda$ . The set of all eigenvalues is the spectrum of $\mathbf{A}$ , denoted $\sigma(\mathbf{A})$ .

Eigenvalues are the roots of the characteristic polynomial $p(\lambda) = \det(\mathbf{A} - \lambda \mathbf{I}) = 0$ , which is a degree- $n$ polynomial in $\lambda$ . By the Fundamental Theorem of Algebra, $p$ has exactly $n$ roots (counted with multiplicity) in $\mathbb{C}$ .

Definition:
Characteristic Polynomial

The characteristic polynomial of $\mathbf{A} \in \mathbb{C}^{n \times n}$ is $p(\lambda) = \det(\mathbf{A} - \lambda \mathbf{I}).$ It is a monic polynomial of degree $n$ . Its roots are precisely the eigenvalues of $\mathbf{A}$ .

The algebraic multiplicity of an eigenvalue $\lambda_0$ is its multiplicity as a root of $p(\lambda)$ . The geometric multiplicity is $\dim \mathcal{N}(\mathbf{A} - \lambda_0 \mathbf{I})$ , i.e., the dimension of the corresponding eigenspace. One always has $1 \leq \text{geometric multiplicity} \leq \text{algebraic multiplicity}.$

When the algebraic and geometric multiplicities coincide for every eigenvalue, the matrix is diagonalizable.

Definition:
Eigendecomposition (Diagonalization)

A matrix $\mathbf{A} \in \mathbb{C}^{n \times n}$ is diagonalizable if there exists an invertible matrix $\mathbf{P} \in \mathbb{C}^{n \times n}$ and a diagonal matrix $\mathbf{\Lambda} = \operatorname{diag}(\lambda_1, \ldots, \lambda_n)$ such that $\mathbf{A} = \mathbf{P} \mathbf{\Lambda} \mathbf{P}^{-1}.$ The columns of $\mathbf{P}$ are eigenvectors of $\mathbf{A}$ , and the diagonal entries of $\mathbf{\Lambda}$ are the corresponding eigenvalues.

A matrix is diagonalizable if and only if it possesses $n$ linearly independent eigenvectors.

Not every matrix is diagonalizable. For example, the matrix $\mathbf{A} = \begin{pmatrix} 0 & 1 \\ 0 & 0 \end{pmatrix}$ has eigenvalue $0$ with algebraic multiplicity $2$ but geometric multiplicity $1$ . However, every Hermitian (or more generally, normal) matrix is diagonalizable, and moreover can be diagonalized by a unitary matrix.

Definition:
Hermitian Matrix

A matrix $\mathbf{A} \in \mathbb{C}^{n \times n}$ is Hermitian (or self-adjoint) if $\mathbf{A}^H = \mathbf{A},$ where $\mathbf{A}^H = \overline{\mathbf{A}}^T$ denotes the conjugate transpose.

In particular, the diagonal entries of a Hermitian matrix are real, and the off-diagonal entries satisfy $a_{ij} = \overline{a_{ji}}$ .

A Hermitian matrix with real entries is called symmetric.

In wireless communications, covariance matrices $\mathbf{R} = \mathbb{E}[\mathbf{x}\mathbf{x}^H]$ and Gram matrices $\mathbf{H}^H \mathbf{H}$ are always Hermitian (and positive semidefinite).

Theorem: Eigenvalues of Hermitian Matrices Are Real

Let $\mathbf{A} \in \mathbb{C}^{n \times n}$ be Hermitian. Then every eigenvalue of $\mathbf{A}$ is real.

Proof

Let $\lambda$ be an eigenvalue with eigenvector $\mathbf{v} \neq \mathbf{0}$ , so $\mathbf{A}\mathbf{v} = \lambda \mathbf{v}$ . Left-multiply both sides by $\mathbf{v}^H$ : $\mathbf{v}^H \mathbf{A} \mathbf{v} = \lambda \, \mathbf{v}^H \mathbf{v}.$ Since $\mathbf{v} \neq \mathbf{0}$ , we have $\mathbf{v}^H \mathbf{v} = \|\mathbf{v}\|^2 > 0$ , so $\lambda = \frac{\mathbf{v}^H \mathbf{A} \mathbf{v}}{\mathbf{v}^H \mathbf{v}}.$ Now take the conjugate of both sides. Using $\mathbf{A}^H = \mathbf{A}$ : $\bar{\lambda} = \frac{(\mathbf{v}^H \mathbf{A} \mathbf{v})^*}{\mathbf{v}^H \mathbf{v}} = \frac{\mathbf{v}^H \mathbf{A}^H \mathbf{v}}{\mathbf{v}^H \mathbf{v}} = \frac{\mathbf{v}^H \mathbf{A} \mathbf{v}}{\mathbf{v}^H \mathbf{v}} = \lambda.$ Hence $\lambda = \bar{\lambda}$ , which means $\lambda \in \mathbb{R}$ . $\blacksquare$

Theorem: Spectral Theorem for Hermitian Matrices

Every Hermitian matrix $\mathbf{A} \in \mathbb{C}^{n \times n}$ can be decomposed as $\mathbf{A} = \mathbf{Q} \mathbf{\Lambda} \mathbf{Q}^H$ where $\mathbf{Q}$ is unitary ( $\mathbf{Q}^H \mathbf{Q} = \mathbf{I}$ ) and $\mathbf{\Lambda} = \operatorname{diag}(\lambda_1, \ldots, \lambda_n)$ with all $\lambda_i \in \mathbb{R}$ . Equivalently, $\mathbf{A} = \sum_{i=1}^{n} \lambda_i \, \mathbf{q}_i \mathbf{q}_i^H$ , where $\{\mathbf{q}_1, \ldots, \mathbf{q}_n\}$ is an orthonormal basis of eigenvectors.

A Hermitian matrix acts by stretching space along its eigenvectors (which are orthogonal) by real scaling factors (the eigenvalues). It has no "rotational" component. The spectral decomposition separates the geometry (the orthonormal eigenvectors) from the magnitude of the action (the real eigenvalues).

Show Hint

First show eigenvalues of Hermitian matrices must be real.

Then show eigenvectors for distinct eigenvalues are orthogonal.

Use Schur's theorem to guarantee a unitary triangularization, then show the triangular factor must be diagonal.

Alternatively, use induction on matrix dimension via deflation.

Proof

Step 1: Eigenvalues are real

This was established in TEigenvalues of Hermitian Matrices Are Real. For completeness: if $\mathbf{A}\mathbf{v} = \lambda \mathbf{v}$ with $\mathbf{v} \neq \mathbf{0}$ , then $\lambda = \frac{\mathbf{v}^H \mathbf{A} \mathbf{v}}{\|\mathbf{v}\|^2} = \frac{(\mathbf{v}^H \mathbf{A} \mathbf{v})^*}{\|\mathbf{v}\|^2} = \bar{\lambda},$ where the second equality uses $\mathbf{A}^H = \mathbf{A}$ and the scalar identity $(z^H w)^* = w^H z$ . Hence $\lambda \in \mathbb{R}$ .

Step 2: Eigenvectors for distinct eigenvalues are orthogonal

Let $\mathbf{A}\mathbf{v}_1 = \lambda_1 \mathbf{v}_1$ and $\mathbf{A}\mathbf{v}_2 = \lambda_2 \mathbf{v}_2$ with $\lambda_1 \neq \lambda_2$ . Compute: $\lambda_1 \, \mathbf{v}_2^H \mathbf{v}_1 = \mathbf{v}_2^H (\mathbf{A}\mathbf{v}_1) = (\mathbf{A}^H \mathbf{v}_2)^H \mathbf{v}_1 = (\mathbf{A}\mathbf{v}_2)^H \mathbf{v}_1 = \overline{\lambda_2} \, \mathbf{v}_2^H \mathbf{v}_1 = \lambda_2 \, \mathbf{v}_2^H \mathbf{v}_1,$ where the last equality uses $\lambda_2 \in \mathbb{R}$ (Step 1). Rearranging: $(\lambda_1 - \lambda_2)\, \mathbf{v}_2^H \mathbf{v}_1 = 0.$ Since $\lambda_1 \neq \lambda_2$ , we conclude $\mathbf{v}_2^H \mathbf{v}_1 = 0$ , i.e., $\mathbf{v}_1 \perp \mathbf{v}_2$ .

Step 3: Induction on dimension via deflation

We prove the full spectral decomposition by strong induction on $n$ .

Base case ( $n = 1$ ). A $1 \times 1$ Hermitian matrix $\mathbf{A} = [a]$ has $a \in \mathbb{R}$ , and the decomposition is trivially $\mathbf{A} = [1]\,[a]\,[1]^H$ .

Inductive step. Assume the theorem holds for all Hermitian matrices of size $(n-1) \times (n-1)$ . Let $\mathbf{A} \in \mathbb{C}^{n \times n}$ be Hermitian.

Existence of an eigenvalue. The characteristic polynomial $\det(\mathbf{A} - \lambda \mathbf{I})$ has degree $n$ and therefore possesses at least one root $\lambda_1 \in \mathbb{C}$ . By Step 1, $\lambda_1 \in \mathbb{R}$ . Let $\mathbf{q}_1$ be a corresponding unit eigenvector: $\mathbf{A}\mathbf{q}_1 = \lambda_1 \mathbf{q}_1$ , $\|\mathbf{q}_1\| = 1$ .

Deflation. Extend $\mathbf{q}_1$ to an orthonormal basis $\{\mathbf{q}_1, \mathbf{u}_2, \ldots, \mathbf{u}_n\}$ of $\mathbb{C}^n$ (e.g., via Gram--Schmidt). Form the unitary matrix $\mathbf{U}_1 = [\mathbf{q}_1 \mid \mathbf{u}_2 \mid \cdots \mid \mathbf{u}_n]$ . Then $\mathbf{U}_1^H \mathbf{A} \mathbf{U}_1 = \begin{pmatrix} \lambda_1 & \mathbf{w}^H \\ \mathbf{w} & \mathbf{B} \end{pmatrix}$ for some $\mathbf{w} \in \mathbb{C}^{n-1}$ and $\mathbf{B} \in \mathbb{C}^{(n-1)\times(n-1)}$ .

Because $\mathbf{A}$ is Hermitian, so is $\mathbf{U}_1^H \mathbf{A} \mathbf{U}_1$ (since $(\mathbf{U}_1^H \mathbf{A} \mathbf{U}_1)^H = \mathbf{U}_1^H \mathbf{A}^H \mathbf{U}_1 = \mathbf{U}_1^H \mathbf{A} \mathbf{U}_1$ ). The Hermitian property forces $\lambda_1 \in \mathbb{R}$ (already known) and $\mathbf{B}^H = \mathbf{B}$ . It also forces $\mathbf{w}^H$ to equal the conjugate transpose of $\mathbf{w}$ , which is automatically satisfied. However, a sharper constraint comes from the first column: $\mathbf{U}_1^H \mathbf{A} \mathbf{U}_1 \mathbf{e}_1 = \mathbf{U}_1^H \mathbf{A} \mathbf{q}_1 = \lambda_1 \mathbf{U}_1^H \mathbf{q}_1 = \lambda_1 \mathbf{e}_1 = \begin{pmatrix} \lambda_1 \\ \mathbf{0} \end{pmatrix}.$ Comparing with the first column of the block form gives $\mathbf{w} = \mathbf{0}$ . Therefore $\mathbf{U}_1^H \mathbf{A} \mathbf{U}_1 = \begin{pmatrix} \lambda_1 & \mathbf{0}^H \\ \mathbf{0} & \mathbf{B} \end{pmatrix}.$

Apply the inductive hypothesis. Since $\mathbf{B}$ is an $(n-1) \times (n-1)$ Hermitian matrix, by the induction hypothesis there exists a unitary $\mathbf{Q}_{n-1}$ and a real diagonal $\mathbf{\Lambda}_{n-1}$ such that $\mathbf{B} = \mathbf{Q}_{n-1} \mathbf{\Lambda}_{n-1} \mathbf{Q}_{n-1}^H$ .

Step 4: Assembling the full decomposition

Define $\mathbf{U}_2 = \begin{pmatrix} 1 & \mathbf{0}^H \\ \mathbf{0} & \mathbf{Q}_{n-1} \end{pmatrix}.$ Then $\mathbf{U}_2$ is unitary and $\mathbf{U}_2^H (\mathbf{U}_1^H \mathbf{A} \mathbf{U}_1) \mathbf{U}_2 = \begin{pmatrix} \lambda_1 & \mathbf{0}^H \\ \mathbf{0} & \mathbf{\Lambda}_{n-1} \end{pmatrix} = \mathbf{\Lambda},$ which is real and diagonal.

Setting $\mathbf{Q} = \mathbf{U}_1 \mathbf{U}_2$ (a product of unitary matrices, hence unitary), we obtain $\mathbf{Q}^H \mathbf{A} \mathbf{Q} = \mathbf{\Lambda} \quad \Longleftrightarrow \quad \mathbf{A} = \mathbf{Q} \mathbf{\Lambda} \mathbf{Q}^H.$

Writing $\mathbf{Q} = [\mathbf{q}_1 \mid \cdots \mid \mathbf{q}_n]$ , this becomes $\mathbf{A} = \sum_{i=1}^{n} \lambda_i \, \mathbf{q}_i \mathbf{q}_i^H,$ which completes the proof. $\blacksquare$

Step 5: Handling repeated eigenvalues

The induction above does not assume that eigenvalues are distinct. If $\lambda_1$ has algebraic multiplicity $m > 1$ , then the deflated matrix $\mathbf{B}$ still has $\lambda_1$ as an eigenvalue (with multiplicity $m - 1$ ), and the inductive hypothesis produces orthonormal eigenvectors for it.

In the case of a repeated eigenvalue, the eigenspace $\mathcal{E}_{\lambda_1} = \mathcal{N}(\mathbf{A} - \lambda_1 \mathbf{I})$ has dimension equal to the algebraic multiplicity (for Hermitian matrices, geometric and algebraic multiplicities always coincide). Any orthonormal basis of $\mathcal{E}_{\lambda_1}$ serves as a valid set of eigenvectors.

This non-uniqueness of eigenvectors within a repeated eigenspace is a feature, not a deficiency: it grants freedom in choosing convenient bases — a freedom exploited, for instance, when jointly diagonalizing commuting Hermitian matrices.

Definition:
Rayleigh Quotient

For a Hermitian matrix $\mathbf{A} \in \mathbb{C}^{n \times n}$ and nonzero $\mathbf{x} \in \mathbb{C}^n$ , the Rayleigh quotient is $R(\mathbf{A}, \mathbf{x}) = \frac{\mathbf{x}^H \mathbf{A} \mathbf{x}}{\mathbf{x}^H \mathbf{x}}.$ The Rayleigh quotient is scale-invariant: $R(\mathbf{A}, \alpha \mathbf{x}) = R(\mathbf{A}, \mathbf{x})$ for any $\alpha \neq 0$ .

The Rayleigh quotient is bounded between the smallest and largest eigenvalues of $\mathbf{A}$ (see TExtremal Properties of the Rayleigh Quotient). In wireless communications, SNR expressions such as $\text{SNR} = \mathbf{w}^H \mathbf{R}_s \mathbf{w} / \mathbf{w}^H \mathbf{R}_n \mathbf{w}$ (signal-to-noise ratio after beamforming with weight vector $\mathbf{w}$ ) are generalized Rayleigh quotients. Maximizing the SNR thus reduces to an eigenvalue problem.

Theorem: Extremal Properties of the Rayleigh Quotient

Let $\mathbf{A} \in \mathbb{C}^{n \times n}$ be Hermitian with eigenvalues $\lambda_1 \geq \lambda_2 \geq \cdots \geq \lambda_n$ and corresponding orthonormal eigenvectors $\mathbf{q}_1, \ldots, \mathbf{q}_n$ . Then for all nonzero $\mathbf{x} \in \mathbb{C}^n$ : $\lambda_n \leq R(\mathbf{A}, \mathbf{x}) \leq \lambda_1.$ Moreover, $\max_{\mathbf{x} \neq \mathbf{0}} R(\mathbf{A}, \mathbf{x}) = \lambda_1, \quad \text{achieved at } \mathbf{x} = \mathbf{q}_1,$ $\min_{\mathbf{x} \neq \mathbf{0}} R(\mathbf{A}, \mathbf{x}) = \lambda_n, \quad \text{achieved at } \mathbf{x} = \mathbf{q}_n.$

Since eigenvectors form an orthonormal basis, any vector $\mathbf{x}$ is a weighted combination of eigenvectors. The Rayleigh quotient computes a weighted average of the eigenvalues, where the weights are the squared magnitudes of the expansion coefficients. A weighted average always lies between the minimum and maximum values being averaged.

Proof

Step 1: Expand in the eigenbasis

By the spectral theorem (TSpectral Theorem for Hermitian Matrices), write $\mathbf{A} = \mathbf{Q}\mathbf{\Lambda}\mathbf{Q}^H$ . Any nonzero $\mathbf{x} \in \mathbb{C}^n$ can be expressed as $\mathbf{x} = \sum_{i=1}^{n} c_i \mathbf{q}_i$ where $c_i = \mathbf{q}_i^H \mathbf{x}$ . Define $\mathbf{c} = \mathbf{Q}^H \mathbf{x} = (c_1, \ldots, c_n)^T$ .

Since $\mathbf{Q}$ is unitary, $\|\mathbf{x}\|^2 = \|\mathbf{c}\|^2 = \sum_{i=1}^{n} |c_i|^2 > 0$ .

Step 2: Express the Rayleigh quotient as a weighted average

Compute: $\mathbf{x}^H \mathbf{A} \mathbf{x} = \mathbf{c}^H \mathbf{Q}^H \mathbf{A} \mathbf{Q} \mathbf{c} = \mathbf{c}^H \mathbf{\Lambda} \mathbf{c} = \sum_{i=1}^{n} \lambda_i |c_i|^2.$ Therefore $R(\mathbf{A}, \mathbf{x}) = \frac{\sum_{i=1}^{n} \lambda_i |c_i|^2}{\sum_{i=1}^{n} |c_i|^2}.$ Define weights $w_i = |c_i|^2 / \|\mathbf{c}\|^2 \geq 0$ with $\sum_i w_i = 1$ . Then $R(\mathbf{A}, \mathbf{x}) = \sum_{i=1}^n w_i \lambda_i$ .

Step 3: Establish the bounds

Since $w_i \geq 0$ and $\sum_i w_i = 1$ , the Rayleigh quotient is a convex combination of the eigenvalues. Therefore: $R(\mathbf{A}, \mathbf{x}) = \sum_{i=1}^{n} w_i \lambda_i \leq \lambda_1 \sum_{i=1}^{n} w_i = \lambda_1,$ and similarly $R(\mathbf{A}, \mathbf{x}) = \sum_{i=1}^{n} w_i \lambda_i \geq \lambda_n \sum_{i=1}^{n} w_i = \lambda_n.$

Step 4: The bounds are attained

For $\mathbf{x} = \mathbf{q}_1$ : $c_1 = 1$ , $c_i = 0$ for $i \geq 2$ , giving $R(\mathbf{A}, \mathbf{q}_1) = \lambda_1$ .

For $\mathbf{x} = \mathbf{q}_n$ : $c_n = 1$ , $c_i = 0$ for $i \leq n - 1$ , giving $R(\mathbf{A}, \mathbf{q}_n) = \lambda_n$ .

Hence the upper and lower bounds are both achieved. $\blacksquare$

Theorem: Courant–Fischer Min-Max Theorem

Let $\mathbf{A} \in \mathbb{C}^{n \times n}$ be Hermitian with eigenvalues $\lambda_1 \geq \lambda_2 \geq \cdots \geq \lambda_n$ . Then for each $k = 1, \ldots, n$ : $\lambda_k = \max_{\substack{\mathcal{V} \subseteq \mathbb{C}^n \\ \dim \mathcal{V} = k}} \;\min_{\substack{\mathbf{x} \in \mathcal{V} \\ \mathbf{x} \neq \mathbf{0}}} R(\mathbf{A}, \mathbf{x}) = \min_{\substack{\mathcal{W} \subseteq \mathbb{C}^n \\ \dim \mathcal{W} = n - k + 1}} \;\max_{\substack{\mathbf{x} \in \mathcal{W} \\ \mathbf{x} \neq \mathbf{0}}} R(\mathbf{A}, \mathbf{x}).$

The $k$ -th eigenvalue is the "best worst case" of the Rayleigh quotient over all $k$ -dimensional subspaces. This characterization is extremely powerful because it does not reference eigenvectors — it describes eigenvalues purely through optimization.

Proof

Step 1: Upper bound via a specific subspace

Let $\mathcal{V}_k = \operatorname{span}\{\mathbf{q}_1, \ldots, \mathbf{q}_k\}$ . For any $\mathbf{x} = \sum_{i=1}^k c_i \mathbf{q}_i \in \mathcal{V}_k$ with $\mathbf{x} \neq \mathbf{0}$ : $R(\mathbf{A}, \mathbf{x}) = \frac{\sum_{i=1}^k \lambda_i |c_i|^2}{\sum_{i=1}^k |c_i|^2} \geq \lambda_k,$ since $\lambda_i \geq \lambda_k$ for $i \leq k$ . Thus $\min_{\mathbf{x} \in \mathcal{V}_k \setminus \{\mathbf{0}\}} R(\mathbf{A}, \mathbf{x}) \geq \lambda_k$ , and equality is achieved at $\mathbf{x} = \mathbf{q}_k$ . This gives $\max_{\dim \mathcal{V} = k} \min_{\mathbf{x} \in \mathcal{V}} R(\mathbf{A}, \mathbf{x}) \geq \lambda_k.$

Step 2: Matching upper bound for an arbitrary $k$-dimensional subspace

Let $\mathcal{V}$ be any $k$ -dimensional subspace. Define $\mathcal{W}_{n-k+1} = \operatorname{span}\{\mathbf{q}_k, \mathbf{q}_{k+1}, \ldots, \mathbf{q}_n\}$ , which has dimension $n - k + 1$ . By a dimension argument: $\dim(\mathcal{V}) + \dim(\mathcal{W}_{n-k+1}) = k + (n - k + 1) = n + 1 > n,$ so $\mathcal{V} \cap \mathcal{W}_{n-k+1}$ contains a nonzero vector $\mathbf{z}$ .

Since $\mathbf{z} \in \mathcal{W}_{n-k+1}$ , write $\mathbf{z} = \sum_{i=k}^{n} c_i \mathbf{q}_i$ . Then $R(\mathbf{A}, \mathbf{z}) = \frac{\sum_{i=k}^{n} \lambda_i |c_i|^2}{\sum_{i=k}^{n} |c_i|^2} \leq \lambda_k,$ since $\lambda_i \leq \lambda_k$ for $i \geq k$ .

Since $\mathbf{z} \in \mathcal{V}$ , we have $\min_{\mathbf{x} \in \mathcal{V}} R(\mathbf{A}, \mathbf{x}) \leq R(\mathbf{A}, \mathbf{z}) \leq \lambda_k$ .

As this holds for every $k$ -dimensional $\mathcal{V}$ : $\max_{\dim \mathcal{V} = k} \min_{\mathbf{x} \in \mathcal{V}} R(\mathbf{A}, \mathbf{x}) \leq \lambda_k.$

Step 3: Conclusion

Combining Steps 1 and 2: $\lambda_k = \max_{\substack{\mathcal{V} \subseteq \mathbb{C}^n \\ \dim \mathcal{V} = k}} \;\min_{\substack{\mathbf{x} \in \mathcal{V} \\ \mathbf{x} \neq \mathbf{0}}} R(\mathbf{A}, \mathbf{x}).$ The dual (min-max) characterization follows by applying the max-min result to $-\mathbf{A}$ (whose eigenvalues are $-\lambda_n \geq \cdots \geq -\lambda_1$ ) and substituting $k' = n - k + 1$ . $\blacksquare$

Eigenvectors as Fixed Directions of a Linear Map

Visualize how a 2×2 Hermitian matrix transforms vectors. Eigenvectors (shown in red) maintain their direction, only getting scaled by their eigenvalues. The unit circle is mapped to an ellipse whose semi-axes align with the eigenvectors; the lengths of the semi-axes equal the absolute values of the eigenvalues.

Parameters

a_{11}

2

Top-left entry

\Re(a_{12})

1

Real part of off-diagonal

\Im(a_{12})

0

Imaginary part of off-diagonal

a_{22}

1

Bottom-right entry

Power Iteration Converging to Dominant Eigenvector

Watch how the power iteration $\mathbf{x}_{k+1} = \mathbf{A}\mathbf{x}_k / \|\mathbf{A}\mathbf{x}_k\|$ converges to the dominant eigenvector. At each step, the iterate is shown as a blue arrow; the true dominant eigenvector is shown in red. A bar chart tracks the Rayleigh quotient $R(\mathbf{A}, \mathbf{x}_k)$ converging to $\lambda_1$ .

Parameters

Matrix type

Iterations20

|\lambda_1 / \lambda_2|

2

Ratio of dominant to subdominant eigenvalue (controls convergence speed)

Example: Eigendecomposition of a 2×2 Hermitian Matrix

Find the eigendecomposition of $\mathbf{A} = \begin{pmatrix} 3 & 1 - j \\ 1 + j & 1 \end{pmatrix}.$

Solution

Full Solution

Step 1 — Verify Hermitian. Check: $a_{21} = 1 + j = \overline{1 - j} = \overline{a_{12}}$ , and diagonal entries are real. So $\mathbf{A}^H = \mathbf{A}$ . $\checkmark$

Step 2 — Characteristic polynomial. $\det(\mathbf{A} - \lambda \mathbf{I}) = (3 - \lambda)(1 - \lambda) - (1 - j)(1 + j) = \lambda^2 - 4\lambda + 3 - 2 = \lambda^2 - 4\lambda + 1.$ Note: $(1 - j)(1 + j) = 1 + 1 = 2$ .

Step 3 — Eigenvalues. $\lambda = \frac{4 \pm \sqrt{16 - 4}}{2} = \frac{4 \pm \sqrt{12}}{2} = 2 \pm \sqrt{3}.$ So $\lambda_1 = 2 + \sqrt{3} \approx 3.732$ and $\lambda_2 = 2 - \sqrt{3} \approx 0.268$ . Both real, as guaranteed by the spectral theorem.

Step 4 — Eigenvectors. For $\lambda_1 = 2 + \sqrt{3}$ : $(\mathbf{A} - \lambda_1 \mathbf{I})\mathbf{v} = \mathbf{0} \implies \begin{pmatrix} 1 - \sqrt{3} & 1 - j \\ 1 + j & -1 - \sqrt{3} \end{pmatrix} \mathbf{v} = \mathbf{0}.$ From the first row: $(1 - \sqrt{3})v_1 + (1 - j)v_2 = 0$ , so $v_1 = \frac{(1 - j)}{\sqrt{3} - 1} v_2 = \frac{(1 - j)(\sqrt{3} + 1)}{2} v_2$ . Setting $v_2 = 1$ and normalizing: $\mathbf{q}_1 = \frac{1}{\|\mathbf{v}_1\|} \begin{pmatrix} \frac{(1 - j)(\sqrt{3} + 1)}{2} \\ 1 \end{pmatrix}.$

For $\lambda_2 = 2 - \sqrt{3}$ , a similar computation yields the orthogonal eigenvector $\mathbf{q}_2$ .

Step 5 — Verify orthogonality. One can confirm $\mathbf{q}_1^H \mathbf{q}_2 = 0$ directly, consistent with the spectral theorem.

Step 6 — Decomposition. $\mathbf{A} = (2 + \sqrt{3})\,\mathbf{q}_1 \mathbf{q}_1^H + (2 - \sqrt{3})\,\mathbf{q}_2 \mathbf{q}_2^H = \mathbf{Q}\mathbf{\Lambda}\mathbf{Q}^H.$

Sanity checks:

$\operatorname{tr}(\mathbf{A}) = 3 + 1 = 4 = \lambda_1 + \lambda_2$ . $\checkmark$
$\det(\mathbf{A}) = 3 \cdot 1 - 2 = 1 = \lambda_1 \lambda_2 = (2+\sqrt{3})(2-\sqrt{3}) = 4 - 3 = 1$ . $\checkmark$

Example: Eigendecomposition of a 3×3 Real Symmetric Matrix

Find the eigenvalues of $\mathbf{A} = \begin{pmatrix} 2 & 1 & 0 \\ 1 & 3 & 1 \\ 0 & 1 & 2 \end{pmatrix}.$

Solution

Full Solution

Step 1 — Characteristic polynomial. $\det(\mathbf{A} - \lambda \mathbf{I}) = \begin{vmatrix} 2-\lambda & 1 & 0 \\ 1 & 3-\lambda & 1 \\ 0 & 1 & 2-\lambda \end{vmatrix}.$ Expanding along the first row: $(2 - \lambda)\bigl[(3 - \lambda)(2 - \lambda) - 1\bigr] - 1\bigl[(1)(2 - \lambda) - 0\bigr]$ $= (2 - \lambda)(\lambda^2 - 5\lambda + 5) - (2 - \lambda)$ $= (2 - \lambda)(\lambda^2 - 5\lambda + 4)$ $= (2 - \lambda)(\lambda - 1)(\lambda - 4).$

Step 2 — Eigenvalues. $\lambda_1 = 4$ , $\lambda_2 = 2$ , $\lambda_3 = 1$ .

Step 3 — Sanity checks.

$\operatorname{tr}(\mathbf{A}) = 2 + 3 + 2 = 7 = 4 + 2 + 1$ . $\checkmark$
$\det(\mathbf{A}) = 2(6-1) - 1(2-0) + 0 = 10 - 2 = 8 = 4 \cdot 2 \cdot 1$ . $\checkmark$

Step 4 — Eigenvectors. For $\lambda_1 = 4$ : solve $(\mathbf{A} - 4\mathbf{I})\mathbf{v} = \mathbf{0}$ : $\begin{pmatrix} -2 & 1 & 0 \\ 1 & -1 & 1 \\ 0 & 1 & -2 \end{pmatrix}\mathbf{v} = \mathbf{0} \implies \mathbf{v}_1 = \frac{1}{\sqrt{6}}\begin{pmatrix} 1 \\ 2 \\ 1 \end{pmatrix}.$

For $\lambda_2 = 2$ : solve $(\mathbf{A} - 2\mathbf{I})\mathbf{v} = \mathbf{0}$ : $\begin{pmatrix} 0 & 1 & 0 \\ 1 & 1 & 1 \\ 0 & 1 & 0 \end{pmatrix}\mathbf{v} = \mathbf{0} \implies v_2 = 0, \; v_1 = -v_3 \implies \mathbf{v}_2 = \frac{1}{\sqrt{2}}\begin{pmatrix} 1 \\ 0 \\ -1 \end{pmatrix}.$

For $\lambda_3 = 1$ : solve $(\mathbf{A} - \mathbf{I})\mathbf{v} = \mathbf{0}$ : $\begin{pmatrix} 1 & 1 & 0 \\ 1 & 2 & 1 \\ 0 & 1 & 1 \end{pmatrix}\mathbf{v} = \mathbf{0} \implies \mathbf{v}_3 = \frac{1}{\sqrt{3}}\begin{pmatrix} 1 \\ -1 \\ 1 \end{pmatrix}.$

One may verify $\mathbf{v}_i^T \mathbf{v}_j = 0$ for $i \neq j$ and $\mathbf{A} = 4\mathbf{v}_1\mathbf{v}_1^T + 2\mathbf{v}_2\mathbf{v}_2^T + 1\mathbf{v}_3\mathbf{v}_3^T$ .

Historical Note: From Vibrating Strings to Abstract Algebra: The History of Eigenvalues

1740s–1910

The concept of eigenvalues emerged gradually over two centuries of mathematics:

Euler and d'Alembert (1740s–1750s). The study of coupled oscillations and the rotation of rigid bodies led Euler to consider equations of the form $\mathbf{A}\mathbf{x} = \lambda \mathbf{x}$ . The term "secular equation" (from the Latin saecularis, relating to long-period astronomical perturbations) was used for the characteristic polynomial.

Cauchy (1829). Augustin-Louis Cauchy proved that the eigenvalues of a real symmetric matrix are real, establishing the first rigorous version of what would become the spectral theorem. He also showed that symmetric matrices have orthogonal eigenvectors.

Sylvester (1852). James Joseph Sylvester introduced the term "matrix" (from the Latin for "womb") and developed much of the algebraic machinery for determinants and characteristic polynomials.

Hilbert (1904–1910). David Hilbert extended the spectral theorem to infinite-dimensional spaces (integral operators), founding the field of functional analysis. His spectral theory is the mathematical backbone of quantum mechanics and modern signal processing.

The word "Eigenwert" (own value) was coined by Hilbert in 1904. The English hybrid "eigenvalue" preserves the German prefix, a testament to the concept's Germanic origins.

Common Mistake: Common Misconceptions About Eigendecomposition

Mistake:

Several common misconceptions about eigendecomposition can lead to errors in analysis and computation.

Correction:

1. Eigendecomposition requires a square matrix. Only square matrices ( $n \times n$ ) have eigenvalues. For rectangular matrices, use the singular value decomposition (SVD) instead (see §Singular Value Decomposition).

2. Eigenvectors are not always orthogonal. For a general (non-Hermitian) matrix, eigenvectors corresponding to distinct eigenvalues need not be orthogonal. Orthogonality is guaranteed only for normal matrices ( $\mathbf{A}^H \mathbf{A} = \mathbf{A}\mathbf{A}^H$ ), of which Hermitian matrices are a special case.

3. Not every matrix is diagonalizable. Defective matrices (where geometric multiplicity < algebraic multiplicity for some eigenvalue) cannot be diagonalized. The Jordan normal form handles such matrices but is numerically unstable and rarely used in practice.

4. Eigenvalues of a product $\neq$ products of eigenvalues (in general). $\sigma(\mathbf{A}\mathbf{B}) \neq \{\lambda_i(\mathbf{A})\lambda_i(\mathbf{B})\}$ unless $\mathbf{A}$ and $\mathbf{B}$ share a common eigenbasis (e.g., when they commute and are both diagonalizable).

5. Numerical eigenvalue computation never uses $\det(\mathbf{A} - \lambda \mathbf{I}) = 0$ . Finding roots of the characteristic polynomial is numerically unstable. Practical algorithms (QR iteration, divide-and-conquer) work directly with the matrix and achieve backward-stable results.

🔧Engineering Note

Practical Eigenvalue Algorithms and Their Costs

Never compute eigenvalues via the characteristic polynomial — it is numerically catastrophic even for $4 \times 4$ matrices. Production algorithms:

QR iteration (general matrices): $O(n^3)$ per step, converges in $O(n)$ steps. Total: $O(n^4)$ worst case, $O(n^3)$ typical.
Divide-and-conquer (symmetric/Hermitian): $O(n^{2.3})$ average, fastest for moderate $n$ . Used by numpy.linalg.eigh.
Lanczos/Arnoldi (large sparse): $O(k \cdot \text{nnz})$ for the top $k$ eigenvalues, where $\text{nnz}$ is the number of nonzero entries. Used by scipy.sparse.linalg.eigsh. For MIMO covariance matrices ( $n \leq 256$ ), numpy.linalg.eigh (LAPACK's divide-and-conquer) is optimal. For massive MIMO spatial correlation matrices ( $n > 1000$ ), use Lanczos.

Practical Constraints

•
LAPACK dsyevd (divide-and-conquer): ~4n^3 flops for all eigenvalues
•
For channel covariance estimation in 5G NR: $n_t = 32$ → eigendecomposition costs ~130K flops (negligible)
•
For XL-MIMO with $n_t = 1024$ : eigendecomposition costs ~4G flops (requires optimization)

🔧Engineering Note

Power Iteration: When Simple Beats Sophisticated

The power iteration converges to the dominant eigenvector at rate $|\lambda_2/\lambda_1|^k$ per iteration. Each iteration costs $O(n^2)$ (one matrix-vector product). Despite its simplicity:

It is the method of choice when only the largest eigenvalue is needed (e.g., spectral radius for stability analysis, dominant singular value for condition estimation).
Google's PageRank is essentially power iteration on a $10^{10} \times 10^{10}$ sparse matrix.
In wireless: the dominant eigenvector of $\mathbf{H}^H\mathbf{H}$ gives the optimal beamforming direction. Power iteration computes it with $O(n_t^2)$ per step — much cheaper than full eigendecomposition. Convergence is slow when $|\lambda_2/\lambda_1| \approx 1$ . In this case, use inverse iteration (converges as $|\lambda_1/\lambda_2|^k$ for the smallest eigenvalue) or Rayleigh quotient iteration (cubic convergence).

Practical Constraints

•
Power iteration per step: one matrix-vector product ( $O(n^2)$ dense, $O(\text{nnz})$ sparse)
•
Typical convergence: 10–50 iterations for $|\lambda_2/\lambda_1| < 0.9$

Key Takeaway

A Hermitian matrix is nothing more than scaling along orthogonal directions: $\mathbf{A} = \mathbf{Q}\mathbf{\Lambda}\mathbf{Q}^H$ says that $\mathbf{A}$ stretches each eigenvector $\mathbf{q}_i$ by the real factor $\lambda_i$ and does nothing else.

Implications for computation: Matrix powers become trivial ( $\mathbf{A}^k = \mathbf{Q}\mathbf{\Lambda}^k \mathbf{Q}^H$ ), the inverse is $\mathbf{A}^{-1} = \mathbf{Q}\mathbf{\Lambda}^{-1}\mathbf{Q}^H$ (when it exists), and any analytic function $f(\mathbf{A})$ is simply $\mathbf{Q}\operatorname{diag}(f(\lambda_1), \ldots, f(\lambda_n))\mathbf{Q}^H$ .

Why This Matters: Eigenvalues and MIMO Channel Capacity

Consider a narrowband MIMO channel with $n_t$ transmit and $n_r$ receive antennas, modeled as $\mathbf{y} = \mathbf{H}\mathbf{x} + \mathbf{n}$ , where $\mathbf{H} \in \mathbb{C}^{n_r \times n_t}$ is the channel matrix and $\mathbf{n} \sim \mathcal{CN}(\mathbf{0}, \sigma^2 \mathbf{I})$ .

When channel state information is available at the transmitter, the capacity is $C = \max_{\substack{\mathbf{R}_x \succeq 0 \\ \operatorname{tr}(\mathbf{R}_x) \leq P}} \log_2 \det\!\left(\mathbf{I} + \frac{1}{\sigma^2}\mathbf{H}\mathbf{R}_x \mathbf{H}^H\right).$ The Hermitian matrix $\mathbf{H}^H \mathbf{H}$ has eigendecomposition $\mathbf{H}^H \mathbf{H} = \mathbf{V}\mathbf{\Lambda}\mathbf{V}^H$ , where $\lambda_1 \geq \cdots \geq \lambda_{n_t} \geq 0$ . The optimal input covariance is $\mathbf{R}_x^{\star} = \mathbf{V} \operatorname{diag}(p_1, \ldots, p_{n_t}) \mathbf{V}^H$ , and the capacity simplifies to $C = \sum_{i=1}^{r} \log_2\!\left(1 + \frac{p_i \lambda_i}{\sigma^2}\right),$ where $r = \operatorname{rank}(\mathbf{H})$ and the powers $\{p_i\}$ are chosen by water-filling: $p_i = \left(\mu - \frac{\sigma^2}{\lambda_i}\right)^+$ with $\mu$ set to satisfy the power constraint $\sum_i p_i = P$ .

Key insight: The eigenvalues $\lambda_i$ of $\mathbf{H}^H\mathbf{H}$ determine the channel gains of the independent spatial modes. Large eigenvalues correspond to strong modes that receive more power; weak modes (small $\lambda_i$ ) may be shut off entirely. The eigendecomposition transforms the coupled MIMO channel into a set of parallel, independent scalar channels — the conceptual simplification that makes MIMO tractable.

Quick Check

Let $\mathbf{A} = \begin{pmatrix} 4 & 2j \\ -2j & 4 \end{pmatrix}$ . What are the eigenvalues of $\mathbf{A}$ ?

$\lambda_1 = 6, \; \lambda_2 = 2$

$\lambda_1 = 4 + 2j, \; \lambda_2 = 4 - 2j$

$\lambda_1 = 4, \; \lambda_2 = 4$

$\lambda_1 = 8, \; \lambda_2 = 0$

Correction:

\lambda_1 = 6, \; \lambda_2 = 2

First verify $\mathbf{A}$ is Hermitian: $a_{12} = 2j$ and $\overline{a_{21}} = \overline{-2j} = 2j = a_{12}$ . $\checkmark$ The characteristic polynomial is $(4 - \lambda)^2 - (2j)(-2j) = \lambda^2 - 8\lambda + 16 - 4 = \lambda^2 - 8\lambda + 12 = (\lambda - 6)(\lambda - 2)$ . So $\lambda_1 = 6$ and $\lambda_2 = 2$ , both real as expected.

Quick Check

A $3 \times 3$ Hermitian matrix has eigenvalues $5, 3, 1$ . What is the maximum value of $R(\mathbf{A}, \mathbf{x})$ over all nonzero $\mathbf{x}$ ?

$1$

$3$

$5$

$9$

Correction:

5

By the extremal property of the Rayleigh quotient (TExtremal Properties of the Rayleigh Quotient), $\max_{\mathbf{x} \neq \mathbf{0}} R(\mathbf{A}, \mathbf{x}) = \lambda_1 = 5$ , achieved at the eigenvector corresponding to $\lambda_1$ .

Quick Check

Which of the following matrices is guaranteed to be diagonalizable by a unitary matrix?

Any invertible matrix

Any upper triangular matrix

Any Hermitian matrix

Any matrix with distinct eigenvalues

Correction:

Any Hermitian matrix

The spectral theorem (TSpectral Theorem for Hermitian Matrices) guarantees that every Hermitian matrix has a unitary eigendecomposition $\mathbf{A} = \mathbf{Q}\mathbf{\Lambda}\mathbf{Q}^H$ . While a matrix with distinct eigenvalues is diagonalizable, the diagonalizing matrix need not be unitary. More generally, a matrix is unitarily diagonalizable if and only if it is normal ( $\mathbf{A}^H\mathbf{A} = \mathbf{A}\mathbf{A}^H$ ).

Quick Check

If $\mathbf{A} \in \mathbb{C}^{4 \times 4}$ has eigenvalues $3, 1, -1, -2$ , what is $\operatorname{tr}(\mathbf{A}^2)$ ?

3

5

15

19

Correction:

15

The eigenvalues of $\mathbf{A}^2$ are the squares of the eigenvalues of $\mathbf{A}$ . Therefore $\operatorname{tr}(\mathbf{A}^2) = 3^2 + 1^2 + (-1)^2 + (-2)^2 = 9 + 1 + 1 + 4 = 15$ .

Eigenvalue

A scalar $\lambda$ such that $\mathbf{A}\mathbf{v} = \lambda\mathbf{v}$ for some nonzero vector $\mathbf{v}$ . Equivalently, a root of the characteristic polynomial $\det(\mathbf{A} - \lambda \mathbf{I}) = 0$ . For Hermitian matrices, all eigenvalues are real.

Spectral Theorem

The result stating that every Hermitian matrix $\mathbf{A} \in \mathbb{C}^{n \times n}$ admits a decomposition $\mathbf{A} = \mathbf{Q}\mathbf{\Lambda}\mathbf{Q}^H$ where $\mathbf{Q}$ is unitary and $\mathbf{\Lambda}$ is real diagonal. This guarantees the existence of an orthonormal eigenbasis with real eigenvalues.

Rayleigh Quotient

For a Hermitian matrix $\mathbf{A}$ and nonzero vector $\mathbf{x}$ , the ratio $R(\mathbf{A}, \mathbf{x}) = \mathbf{x}^H \mathbf{A} \mathbf{x} / \mathbf{x}^H \mathbf{x}$ . It is bounded by the extreme eigenvalues of $\mathbf{A}$ and equals an eigenvalue when $\mathbf{x}$ is a corresponding eigenvector.

Eigenvalue Decomposition

Why Eigenvalues Matter

Definition: Eigenvalue and Eigenvector

Definition: Characteristic Polynomial

Definition: Eigendecomposition (Diagonalization)

Definition: Hermitian Matrix

Theorem: Eigenvalues of Hermitian Matrices Are Real

Proof

Theorem: Spectral Theorem for Hermitian Matrices

Step 1: Eigenvalues are real

Step 2: Eigenvectors for distinct eigenvalues are orthogonal

Step 3: Induction on dimension via deflation

Step 4: Assembling the full decomposition

Step 5: Handling repeated eigenvalues

Definition: Rayleigh Quotient

Theorem: Extremal Properties of the Rayleigh Quotient

Step 1: Expand in the eigenbasis

Step 2: Express the Rayleigh quotient as a weighted average

Step 3: Establish the bounds

Step 4: The bounds are attained

Theorem: Courant–Fischer Min-Max Theorem

Step 1: Upper bound via a specific subspace

Step 2: Matching upper bound for an arbitrary $k$-dimensional subspace

Step 3: Conclusion

Eigenvectors as Fixed Directions of a Linear Map

Parameters

Power Iteration Converging to Dominant Eigenvector

Parameters

Example: Eigendecomposition of a 2×2 Hermitian Matrix

Full Solution

Example: Eigendecomposition of a 3×3 Real Symmetric Matrix

Full Solution

Historical Note: From Vibrating Strings to Abstract Algebra: The History of Eigenvalues

Common Mistake: Common Misconceptions About Eigendecomposition

Practical Eigenvalue Algorithms and Their Costs

Power Iteration: When Simple Beats Sophisticated

Key Takeaway

Why This Matters: Eigenvalues and MIMO Channel Capacity

Quick Check

Quick Check

Quick Check

Quick Check

Eigenvalue

Spectral Theorem

Rayleigh Quotient

Definition:
Eigenvalue and Eigenvector

Definition:
Characteristic Polynomial

Definition:
Eigendecomposition (Diagonalization)

Definition:
Hermitian Matrix

Definition:
Rayleigh Quotient