Ferkans — Interactive Telecom Tutor

Beyond Scalar Functions: Applying Functions to Matrices

A matrix function $f(\mathbf{A})$ is not simply applying $f$ element-wise. The matrix exponential $e^{\mathbf{A}}$ , for example, solves the ODE $\dot{\mathbf{x}} = \mathbf{A}\mathbf{x}$ and arises in control theory, quantum mechanics, and the analysis of Markov chains. Special structured matrices (Toeplitz, circulant, Hankel) appear in signal processing because convolution is a Toeplitz matrix-vector product and can be diagonalized by the DFT.

Definition:
Matrix Exponential

The matrix exponential of $\mathbf{A} \in \mathbb{C}^{n \times n}$ is:

$e^{\mathbf{A}} = \sum_{k=0}^{\infty} \frac{\mathbf{A}^k}{k!} = \mathbf{I} + \mathbf{A} + \frac{\mathbf{A}^2}{2!} + \frac{\mathbf{A}^3}{3!} + \cdots$

from scipy.linalg import expm
E = expm(A)

SciPy uses the Pade approximation with scaling and squaring (the Al-Mohy & Higham algorithm), which is numerically stable.

Key property: $e^{\mathbf{A}} e^{\mathbf{B}} = e^{\mathbf{A}+\mathbf{B}}$ only when $\mathbf{A}\mathbf{B} = \mathbf{B}\mathbf{A}$ (i.e., $\mathbf{A}$ and $\mathbf{B}$ commute).

Do not use np.exp(A) for the matrix exponential — that computes the element-wise exponential $[\exp(A_{ij})]$ , which is completely different.

Definition:
Matrix Logarithm and Square Root

The matrix logarithm $\log(\mathbf{A})$ satisfies $e^{\log(\mathbf{A})} = \mathbf{A}$ (when $\mathbf{A}$ has no nonpositive real eigenvalues).

The matrix square root $\mathbf{A}^{1/2}$ satisfies $\mathbf{A}^{1/2}\mathbf{A}^{1/2} = \mathbf{A}$ (for positive definite $\mathbf{A}$ ).

from scipy.linalg import logm, sqrtm

L = logm(A)           # matrix logarithm
S = sqrtm(A)          # principal matrix square root

The matrix square root of a covariance matrix $\mathbf{R}$ is used to generate correlated random vectors: $\mathbf{x} = \mathbf{R}^{1/2}\mathbf{z}$ where $\mathbf{z} \sim \mathcal{CN}(\mathbf{0}, \mathbf{I})$ .

Definition:
Toeplitz Matrix

A Toeplitz matrix has constant diagonals: $T_{ij} = t_{i-j}$

$\mathbf{T} = \begin{bmatrix} t_0 & t_{-1} & t_{-2} & \cdots \\ t_1 & t_0 & t_{-1} & \cdots \\ t_2 & t_1 & t_0 & \cdots \\ \vdots & \vdots & \vdots & \ddots \end{bmatrix}$

from scipy.linalg import toeplitz

c = [2, 1, 0, 0]     # first column
r = [2, -1, 0, 0]    # first row
T = toeplitz(c, r)

Toeplitz matrices represent linear time-invariant (LTI) filters: the output of a discrete convolution $y[n] = \sum_k h[k] x[n-k]$ can be written as $\mathbf{y} = \mathbf{T}_h \mathbf{x}$ where $\mathbf{T}_h$ is Toeplitz with the filter coefficients.

Definition:
Circulant Matrix and the DFT

A circulant matrix is a Toeplitz matrix where each row is a cyclic shift of the previous row:

$\mathbf{C} = \begin{bmatrix} c_0 & c_{n-1} & c_{n-2} & \cdots & c_1 \\ c_1 & c_0 & c_{n-1} & \cdots & c_2 \\ c_2 & c_1 & c_0 & \cdots & c_3 \\ \vdots & & & \ddots & \vdots \\ c_{n-1} & c_{n-2} & c_{n-3} & \cdots & c_0 \end{bmatrix}$

Every circulant matrix is diagonalized by the DFT matrix: $\mathbf{C} = \mathbf{F}^H \mathrm{diag}(\mathbf{F}\mathbf{c}) \mathbf{F}$

This is why circular convolution can be computed via FFT in $O(n \log n)$ instead of $O(n^2)$ .

from scipy.linalg import circulant
C = circulant([1, 2, 3, 4])

Definition:
DFT Matrix

The $n \times n$ DFT matrix $\mathbf{F}$ has entries:

$F_{kl} = \frac{1}{\sqrt{n}} e^{-j 2\pi kl/n}, \quad k, l = 0, \ldots, n-1$

The DFT matrix is unitary: $\mathbf{F}\mathbf{F}^H = \mathbf{I}$ .

from scipy.linalg import dft

F = dft(n, scale='sqrtn')   # unitary DFT matrix
# Or construct manually:
k = np.arange(n)
F_manual = np.exp(-1j * 2 * np.pi * np.outer(k, k) / n) / np.sqrt(n)

In practice, you never form $\mathbf{F}$ explicitly — the FFT computes $\mathbf{F}\mathbf{x}$ in $O(n \log n)$ without the matrix:

X = np.fft.fft(x) / np.sqrt(n)   # equivalent to F @ x

OFDM modulation is literally the inverse DFT: $\mathbf{x}_{\mathrm{time}} = \mathbf{F}^H \mathbf{x}_{\mathrm{freq}}$ , computed via IFFT.

Theorem: Circulant Matrices Are Diagonalized by the DFT

Any circulant matrix $\mathbf{C}$ with first column $\mathbf{c}$ can be written as: $\mathbf{C} = \mathbf{F}^H \mathrm{diag}(\sqrt{n}\,\mathbf{F}\mathbf{c})\,\mathbf{F}$

Equivalently, the eigenvalues of $\mathbf{C}$ are $\lambda_k = \sqrt{n}\,(\mathbf{F}\mathbf{c})_k$ and the eigenvectors are the columns of $\mathbf{F}^H$ .

Circular convolution in the time domain equals pointwise multiplication in the frequency domain. The DFT diagonalizes circular convolution. This is the mathematical foundation of OFDM: the cyclic prefix makes the channel convolution circular, so the channel can be inverted by per-subcarrier scalar division.

Theorem: Properties of the Matrix Exponential

For $\mathbf{A}, \mathbf{B} \in \mathbb{C}^{n \times n}$ :

$e^{\mathbf{0}} = \mathbf{I}$
$(e^{\mathbf{A}})^{-1} = e^{-\mathbf{A}}$
$e^{(\alpha + \beta)\mathbf{A}} = e^{\alpha\mathbf{A}}\,e^{\beta\mathbf{A}}$
If $\mathbf{A} = \mathbf{V}\boldsymbol{\Lambda}\mathbf{V}^{-1}$ , then $e^{\mathbf{A}} = \mathbf{V}\,e^{\boldsymbol{\Lambda}}\,\mathbf{V}^{-1}$
$\det(e^{\mathbf{A}}) = e^{\mathrm{tr}(\mathbf{A})}$
$e^{\mathbf{A}} e^{\mathbf{B}} = e^{\mathbf{A}+\mathbf{B}}$ only if $\mathbf{AB} = \mathbf{BA}$

Example: State Transition via Matrix Exponential

A continuous-time linear system has state equation $\dot{\mathbf{x}} = \mathbf{A}\mathbf{x}$ with $\mathbf{A} = \begin{bmatrix} 0 & 1 \\ -2 & -3 \end{bmatrix}$ . Compute the state transition matrix $\Phi(t) = e^{\mathbf{A}t}$ at $t = 0, 0.5, 1.0, 2.0$ and verify that $\Phi(0) = \mathbf{I}$ .

Solution

Compute matrix exponential

import numpy as np
from scipy.linalg import expm

A = np.array([[0, 1], [-2, -3]])

for t in [0, 0.5, 1.0, 2.0]:
    Phi = expm(A * t)
    print(f"t = {t}:")
    print(Phi)
    print()

# Verify: Phi(0) = I
assert np.allclose(expm(A * 0), np.eye(2))
# Verify: Phi(s+t) = Phi(s) @ Phi(t) (semigroup)
assert np.allclose(expm(A * 1.5), expm(A * 0.5) @ expm(A * 1.0))

Example: Convolution as Toeplitz Matrix-Vector Product

Show that filtering a signal $\mathbf{x}$ with filter $\mathbf{h}$ is equivalent to multiplying by a Toeplitz matrix $\mathbf{T}_h$ .

Solution

Build Toeplitz and compare with np.convolve

import numpy as np
from scipy.linalg import toeplitz

h = np.array([1, 0.5, 0.25])     # filter
x = np.array([1, 2, 3, 4, 5])    # signal

# Build Toeplitz matrix for linear convolution
n = len(x) + len(h) - 1
col = np.zeros(n)
col[:len(h)] = h
row = np.zeros(len(x))
row[0] = h[0]
T = toeplitz(col, row)

y_toeplitz = T @ x
y_convolve = np.convolve(h, x)

print(f"Toeplitz:  {y_toeplitz}")
print(f"Convolve:  {y_convolve}")
print(f"Match: {np.allclose(y_toeplitz, y_convolve)}")

Example: Circular Convolution via DFT Diagonalization

Verify that multiplying by a circulant matrix is equivalent to circular convolution, which can be computed via FFT.

Solution

Circulant vs FFT

import numpy as np
from scipy.linalg import circulant

c = np.array([1, 2, 3, 4])
x = np.array([5, 6, 7, 8])

# Method 1: Direct circulant matrix multiply
C = circulant(c)
y_matrix = C @ x

# Method 2: FFT-based circular convolution
y_fft = np.real(np.fft.ifft(np.fft.fft(c) * np.fft.fft(x)))

print(f"Matrix: {y_matrix}")
print(f"FFT:    {y_fft}")
print(f"Match: {np.allclose(y_matrix, y_fft)}")
print(f"Cost: matrix O(n^2), FFT O(n log n)")

Toeplitz matrix

A matrix with constant diagonals: $T_{ij} = t_{i-j}$ . Represents linear time-invariant filtering (convolution).

Related: circulant matrix

circulant matrix

A Toeplitz matrix where each row is a cyclic shift of the previous. Diagonalized by the DFT matrix. Represents circular convolution.

Related: Toeplitz matrix

matrix exponential

The function $e^{\mathbf{A}} = \sum_{k=0}^\infty \mathbf{A}^k/k!$ . Solves the linear ODE $\dot{\mathbf{x}} = \mathbf{A}\mathbf{x}$ and is computed via Pade approximation in scipy.linalg.expm.

Common Mistake: np.exp vs scipy.linalg.expm

Mistake:

Using np.exp(A) thinking it computes the matrix exponential $e^{\mathbf{A}}$ . np.exp applies the exponential element-wise, which is mathematically different.

Correction:

Use scipy.linalg.expm(A) for the matrix exponential. The two are equal only for diagonal matrices.

Quick Check

What diagonalizes every circulant matrix?

The DFT matrix

The identity matrix

The Hadamard matrix

Its own eigenvector matrix (varies per matrix)

Correction:

The DFT matrix

Every circulant matrix C = F^H diag(Fc) F where F is the DFT matrix.

Key Takeaway

Exploit matrix structure. Toeplitz $\to$ FFT-based fast multiply. Circulant $\to$ DFT diagonalization. Knowing the structure of your matrix is often more valuable than knowing the latest algorithm. In OFDM, the cyclic prefix makes the channel Toeplitz structure effectively circulant, enabling per-subcarrier equalization.

Matrix Functions and Special Matrices

python

Matrix exponential, logarithm, square root, Toeplitz, circulant, Hankel, and DFT matrix construction with practical examples.

# Code from: ch06/python/matrix_functions.py
# Load from backend supplements endpoint

Matrix Functions and Special Matrices

Beyond Scalar Functions: Applying Functions to Matrices

Definition: Matrix Exponential

Definition: Matrix Logarithm and Square Root

Definition: Toeplitz Matrix

Definition: Circulant Matrix and the DFT

Definition: DFT Matrix

Theorem: Circulant Matrices Are Diagonalized by the DFT

Theorem: Properties of the Matrix Exponential

Example: State Transition via Matrix Exponential

Compute matrix exponential

Example: Convolution as Toeplitz Matrix-Vector Product

Build Toeplitz and compare with np.convolve

Example: Circular Convolution via DFT Diagonalization

Circulant vs FFT

Toeplitz matrix

circulant matrix

matrix exponential

Common Mistake: np.exp vs scipy.linalg.expm

Quick Check

Key Takeaway

Matrix Functions and Special Matrices

Definition:
Matrix Exponential

Definition:
Matrix Logarithm and Square Root

Definition:
Toeplitz Matrix

Definition:
Circulant Matrix and the DFT

Definition:
DFT Matrix