Ferkans — Interactive Telecom Tutor

Why Trace and Determinant Are Everywhere in Capacity Expressions

Every capacity formula in multi-antenna communications rests on two matrix functionals: the trace and the determinant.

The MIMO capacity formula. For an $n_r \times n_t$ channel $\mathbf{H}$ with input covariance $\mathbf{R}_x$ and noise variance $\sigma^2$ , the mutual information is $I = \log\det\!\left(\mathbf{I}_{n_r} + \frac{1}{\sigma^2}\mathbf{H}\mathbf{R}_x\mathbf{H}^H\right).$ The capacity-achieving $\mathbf{R}_x$ is found by maximizing this $\log\det$ subject to a trace constraint: $\operatorname{tr}(\mathbf{R}_x) \leq P.$ The trace constrains total transmit power; the log-determinant measures information rate. Understanding the interplay between the two — through trace identities, Hadamard's inequality, Fischer's inequality, and the concavity of $\log\det$ — is essential for deriving capacity bounds, water-filling solutions, and beamforming strategies.

Where these tools appear:

Trace identities simplify covariance manipulations: $\operatorname{tr}(\mathbf{H}^H\mathbf{H}\mathbf{R}_x) = \operatorname{tr}(\mathbf{R}_x\mathbf{H}^H\mathbf{H})$ .
Hadamard inequality bounds the determinant by the product of diagonal entries, yielding capacity upper bounds when the channel decomposes into independent sub-channels.
Fischer inequality provides tighter bounds when the channel has block structure (e.g., multi-user MIMO).
Log-det concavity guarantees that the MIMO capacity optimization is a convex program (maximizing a concave function over a convex set), ensuring that water-filling is globally optimal.

Definition:
Trace of a Matrix

The trace of a square matrix $\mathbf{A} \in \mathbb{C}^{n \times n}$ is the sum of its diagonal entries: $\operatorname{tr}(\mathbf{A}) = \sum_{i=1}^{n} a_{ii}.$

The trace satisfies the following fundamental properties:

Linearity. For any $\alpha, \beta \in \mathbb{C}$ and $\mathbf{A}, \mathbf{B} \in \mathbb{C}^{n \times n}$ : $\operatorname{tr}(\alpha \mathbf{A} + \beta \mathbf{B}) = \alpha \operatorname{tr}(\mathbf{A}) + \beta \operatorname{tr}(\mathbf{B}).$
Cyclic property. For $\mathbf{A} \in \mathbb{C}^{m \times n}$ and $\mathbf{B} \in \mathbb{C}^{n \times m}$ : $\operatorname{tr}(\mathbf{A}\mathbf{B}) = \operatorname{tr}(\mathbf{B}\mathbf{A}).$ More generally, the trace is invariant under cyclic permutations of a matrix product (see TCyclic Property of the Trace).
Relation to eigenvalues. If $\lambda_1, \ldots, \lambda_n$ are the eigenvalues of $\mathbf{A}$ (counted with algebraic multiplicity), then $\operatorname{tr}(\mathbf{A}) = \sum_{i=1}^{n} \lambda_i.$
Similarity invariance. For any invertible $\mathbf{P}$ : $\operatorname{tr}(\mathbf{P}^{-1}\mathbf{A}\mathbf{P}) = \operatorname{tr}(\mathbf{A}).$ This follows immediately from the cyclic property.
Transpose and conjugate transpose. $\operatorname{tr}(\mathbf{A}^T) = \operatorname{tr}(\mathbf{A})$ and $\operatorname{tr}(\mathbf{A}^H) = \overline{\operatorname{tr}(\mathbf{A})}$ .

In telecommunications, $\operatorname{tr}(\mathbf{R}_x)$ represents total transmit power when $\mathbf{R}_x = \mathbb{E}[\mathbf{x}\mathbf{x}^H]$ is the input covariance matrix. The power constraint is therefore a trace constraint.

Definition:
Determinant of a Matrix

The determinant of a square matrix $\mathbf{A} \in \mathbb{C}^{n \times n}$ , denoted $\det(\mathbf{A})$ or $|\mathbf{A}|$ , is the unique multilinear, alternating function on the columns of $\mathbf{A}$ satisfying $\det(\mathbf{I}) = 1$ . Equivalently, via Leibniz's formula: $\det(\mathbf{A}) = \sum_{\sigma \in S_n} \operatorname{sgn}(\sigma) \prod_{i=1}^{n} a_{i,\sigma(i)},$ where $S_n$ is the symmetric group on $\{1, \ldots, n\}$ .

Fundamental properties:

Product of eigenvalues. If $\lambda_1, \ldots, \lambda_n$ are the eigenvalues of $\mathbf{A}$ , then $\det(\mathbf{A}) = \prod_{i=1}^{n} \lambda_i.$
Multiplicativity. For $\mathbf{A}, \mathbf{B} \in \mathbb{C}^{n \times n}$ : $\det(\mathbf{A}\mathbf{B}) = \det(\mathbf{A})\det(\mathbf{B}).$
Transpose and conjugate transpose. $\det(\mathbf{A}^T) = \det(\mathbf{A})$ and $\det(\mathbf{A}^H) = \overline{\det(\mathbf{A})}$ . In particular, $\det(\mathbf{A}) \in \mathbb{R}$ when $\mathbf{A}$ is Hermitian.
Invertibility. $\mathbf{A}$ is invertible if and only if $\det(\mathbf{A}) \neq 0$ .
Determinant of a block triangular matrix. If $\mathbf{M} = \begin{pmatrix} \mathbf{A} & \mathbf{B} \\ \mathbf{0} & \mathbf{D} \end{pmatrix}$ , then $\det(\mathbf{M}) = \det(\mathbf{A})\det(\mathbf{D})$ .
Positive definiteness. If $\mathbf{A} \succ 0$ , then $\det(\mathbf{A}) > 0$ (since all eigenvalues are strictly positive).

In capacity formulas, $\log\det(\cdot)$ converts a product of eigenvalue contributions into a sum: $\log\det(\mathbf{A}) = \sum_i \log \lambda_i$ . This is why the MIMO capacity decomposes into a sum of per-eigenmode rates.

Theorem: Cyclic Property of the Trace

Let $\mathbf{A}_1 \in \mathbb{C}^{n_1 \times n_2}$ , $\mathbf{A}_2 \in \mathbb{C}^{n_2 \times n_3}$ , $\ldots$ , $\mathbf{A}_k \in \mathbb{C}^{n_k \times n_1}$ be matrices such that the product $\mathbf{A}_1 \mathbf{A}_2 \cdots \mathbf{A}_k$ is square. Then the trace is invariant under cyclic permutations: $\operatorname{tr}(\mathbf{A}_1 \mathbf{A}_2 \cdots \mathbf{A}_k) = \operatorname{tr}(\mathbf{A}_2 \mathbf{A}_3 \cdots \mathbf{A}_k \mathbf{A}_1) = \cdots = \operatorname{tr}(\mathbf{A}_k \mathbf{A}_1 \cdots \mathbf{A}_{k-1}).$ In particular, for three square matrices: $\operatorname{tr}(\mathbf{A}\mathbf{B}\mathbf{C}) = \operatorname{tr}(\mathbf{B}\mathbf{C}\mathbf{A}) = \operatorname{tr}(\mathbf{C}\mathbf{A}\mathbf{B})$ .

Note: The trace is not invariant under arbitrary permutations. In general, $\operatorname{tr}(\mathbf{A}\mathbf{B}\mathbf{C}) \neq \operatorname{tr}(\mathbf{A}\mathbf{C}\mathbf{B})$ .

A cyclic permutation moves the first matrix to the end. The trace "sees" a product of matrices as a closed loop of index contractions, and rotating the starting point of a loop does not change the loop itself.

Proof

Step 1: Prove the two-matrix case

Let $\mathbf{A} \in \mathbb{C}^{m \times n}$ and $\mathbf{B} \in \mathbb{C}^{n \times m}$ . Then $\mathbf{A}\mathbf{B}$ is $m \times m$ and $\mathbf{B}\mathbf{A}$ is $n \times n$ . We compute each trace using the definition: $\operatorname{tr}(\mathbf{A}\mathbf{B}) = \sum_{i=1}^{m} (\mathbf{A}\mathbf{B})_{ii} = \sum_{i=1}^{m} \sum_{k=1}^{n} a_{ik} b_{ki}.$ $\operatorname{tr}(\mathbf{B}\mathbf{A}) = \sum_{k=1}^{n} (\mathbf{B}\mathbf{A})_{kk} = \sum_{k=1}^{n} \sum_{i=1}^{m} b_{ki} a_{ik}.$ Both double sums run over the same set of index pairs $(i, k) \in \{1, \ldots, m\} \times \{1, \ldots, n\}$ and involve the same summands $a_{ik} b_{ki}$ . By commutativity of scalar multiplication and the fact that the order of a finite sum is immaterial, the two expressions are equal: $\operatorname{tr}(\mathbf{A}\mathbf{B}) = \operatorname{tr}(\mathbf{B}\mathbf{A}).$

Step 2: Extend to $k$ matrices by induction

For $k \geq 3$ matrices, define $\mathbf{M} = \mathbf{A}_1$ and $\mathbf{N} = \mathbf{A}_2 \mathbf{A}_3 \cdots \mathbf{A}_k$ . Then $\mathbf{M} \in \mathbb{C}^{n_1 \times n_2}$ and $\mathbf{N} \in \mathbb{C}^{n_2 \times n_1}$ . By the two-matrix case: $\operatorname{tr}(\mathbf{A}_1 \mathbf{A}_2 \cdots \mathbf{A}_k) = \operatorname{tr}(\mathbf{M}\mathbf{N}) = \operatorname{tr}(\mathbf{N}\mathbf{M}) = \operatorname{tr}(\mathbf{A}_2 \mathbf{A}_3 \cdots \mathbf{A}_k \mathbf{A}_1).$ Repeating this argument $k - 1$ times produces all $k$ cyclic permutations. $\blacksquare$

Theorem: Hadamard's Inequality

Let $\mathbf{A} \in \mathbb{C}^{n \times n}$ be positive definite ( $\mathbf{A} \succ 0$ ) with diagonal entries $a_{11}, a_{22}, \ldots, a_{nn}$ . Then $\det(\mathbf{A}) \leq \prod_{i=1}^{n} a_{ii},$ with equality if and only if $\mathbf{A}$ is diagonal.

Equivalently, if $\mathbf{a}_1, \ldots, \mathbf{a}_n \in \mathbb{C}^n$ are the columns of $\mathbf{A}$ , then $|\det(\mathbf{A})| \leq \prod_{i=1}^{n} \|\mathbf{a}_i\|$ , with equality if and only if the columns are mutually orthogonal.

The determinant measures the volume of the parallelepiped spanned by the columns. If columns are not orthogonal, the parallelepiped "collapses" partially, reducing its volume below the product of the column lengths. For a positive definite matrix, the diagonal entries are the squared norms of the "contributions" in each coordinate direction, and any off-diagonal correlation reduces the determinant.

Proof

Step 1: Cholesky factorization setup

Since $\mathbf{A} \succ 0$ , it admits a unique Cholesky factorization $\mathbf{A} = \mathbf{L}\mathbf{L}^H$ , where $\mathbf{L} = [l_{ij}]$ is lower triangular with strictly positive diagonal entries $l_{ii} > 0$ . By the multiplicativity of determinants: $\det(\mathbf{A}) = \det(\mathbf{L})\det(\mathbf{L}^H) = |\det(\mathbf{L})|^2 = \left(\prod_{i=1}^{n} l_{ii}\right)^2 = \prod_{i=1}^{n} l_{ii}^2.$

Step 2: Relate diagonal entries of $\mathbf{A}$ to $\mathbf{L}$

The $(i,i)$ entry of $\mathbf{A} = \mathbf{L}\mathbf{L}^H$ is $a_{ii} = \sum_{k=1}^{i} |l_{ik}|^2 = |l_{i1}|^2 + |l_{i2}|^2 + \cdots + |l_{i,i-1}|^2 + l_{ii}^2.$ (Here $l_{ii}$ is real and positive by the Cholesky construction.)

Since all terms in the sum are nonneg-ative and $l_{ii}^2$ is one of them, we obtain $a_{ii} \geq l_{ii}^2,$ with equality if and only if $l_{ik} = 0$ for all $k < i$ , i.e., if and only if the $i$ -th row of $\mathbf{L}$ has only the diagonal entry nonzero.

Step 3: Combine to get the inequality

Taking the product over all $i$ : $\prod_{i=1}^{n} a_{ii} \geq \prod_{i=1}^{n} l_{ii}^2 = \det(\mathbf{A}).$ This proves $\det(\mathbf{A}) \leq \prod_{i=1}^{n} a_{ii}$ .

Step 4: Equality condition

Equality holds if and only if $a_{ii} = l_{ii}^2$ for every $i$ , which by Step 2 occurs if and only if $l_{ik} = 0$ for all $k < i$ and all $i$ . This means $\mathbf{L}$ is diagonal, so $\mathbf{A} = \mathbf{L}\mathbf{L}^H$ is also diagonal.

Conversely, if $\mathbf{A}$ is diagonal (with positive diagonal entries, since $\mathbf{A} \succ 0$ ), then $\det(\mathbf{A}) = \prod_{i=1}^n a_{ii}$ trivially. $\blacksquare$

Hadamard's Inequality: Volume of a Parallelepiped

The determinant equals the volume of the parallelepiped spanned by the columns. Hadamard's inequality says this volume is maximized when the columns are orthogonal. Watch the parallelepiped morph to a rectangle at equality.

Geometric interpretation: correlated columns waste volume. Orthogonal columns achieve the maximum

\det(\mathbf{A}) = \prod_i \|\mathbf{a}_i\|

.

Theorem: Concavity of $\log\det$ on the Positive Definite Cone

The function $f(\mathbf{X}) = \log\det(\mathbf{X})$ is concave on the cone of positive definite matrices $\mathbb{S}_{++}^n$ . That is, for any $\mathbf{A}, \mathbf{B} \succ 0$ and $\theta \in [0, 1]$ : $\log\det\bigl(\theta \mathbf{A} + (1 - \theta)\mathbf{B}\bigr) \geq \theta \log\det(\mathbf{A}) + (1 - \theta) \log\det(\mathbf{B}).$

Since $\log\det(\mathbf{X}) = \sum_i \log \lambda_i(\mathbf{X})$ , the log-det is a sum of concave ( $\log$ ) functions of the eigenvalues. However, eigenvalues of a convex combination are not simply convex combinations of eigenvalues, so the proof requires more care. The key idea is to reduce to a one-dimensional argument by examining $g(t) = \log\det(\mathbf{A} + t\mathbf{V})$ and showing it is concave in $t$ .

Proof

Step 1: Reduce to a one-variable problem

A function on $\mathbb{S}_{++}^n$ is concave if and only if it is concave along every line segment in $\mathbb{S}_{++}^n$ . Fix $\mathbf{A} \succ 0$ and a Hermitian matrix $\mathbf{V}$ , and define $g(t) = \log\det(\mathbf{A} + t\mathbf{V})$ for all $t$ such that $\mathbf{A} + t\mathbf{V} \succ 0$ . It suffices to show that $g$ is concave in $t$ .

Step 2: Factor out $\mathbf{A}$

Write $\mathbf{A} + t\mathbf{V} = \mathbf{A}^{1/2}\bigl(\mathbf{I} + t\mathbf{A}^{-1/2}\mathbf{V}\mathbf{A}^{-1/2}\bigr)\mathbf{A}^{1/2}.$ Taking the log-det: $g(t) = \log\det(\mathbf{A}) + \log\det\bigl(\mathbf{I} + t\mathbf{C}\bigr),$ where $\mathbf{C} = \mathbf{A}^{-1/2}\mathbf{V}\mathbf{A}^{-1/2}$ is Hermitian. Since $\log\det(\mathbf{A})$ is a constant in $t$ , it suffices to show that $h(t) = \log\det(\mathbf{I} + t\mathbf{C})$ is concave in $t$ .

Step 3: Diagonalize $\mathbf{C}$

Since $\mathbf{C}$ is Hermitian, by the spectral theorem there exists a unitary $\mathbf{Q}$ such that $\mathbf{C} = \mathbf{Q}\operatorname{diag}(\mu_1, \ldots, \mu_n)\mathbf{Q}^H$ . Then $\mathbf{I} + t\mathbf{C} = \mathbf{Q}\operatorname{diag}(1 + t\mu_1, \ldots, 1 + t\mu_n)\mathbf{Q}^H,$ and $h(t) = \log\det(\mathbf{I} + t\mathbf{C}) = \sum_{i=1}^{n} \log(1 + t\mu_i).$ (This is valid for $t$ in the range where all $1 + t\mu_i > 0$ .)

Step 4: Verify concavity of each summand

Each function $\phi_i(t) = \log(1 + t\mu_i)$ has second derivative $\phi_i''(t) = -\frac{\mu_i^2}{(1 + t\mu_i)^2} \leq 0.$ Since $\phi_i''(t) \leq 0$ for all $t$ in the domain, each $\phi_i$ is concave.

Step 5: Conclude

A sum of concave functions is concave. Therefore $h(t) = \sum_{i=1}^{n} \log(1 + t\mu_i)$ is concave in $t$ , which implies $g(t) = \log\det(\mathbf{A}) + h(t)$ is concave in $t$ . Since this holds for every line in $\mathbb{S}_{++}^n$ , $f(\mathbf{X}) = \log\det(\mathbf{X})$ is concave on $\mathbb{S}_{++}^n$ . $\blacksquare$

Theorem: Fischer's Inequality

Let $\mathbf{M} \in \mathbb{C}^{n \times n}$ be positive definite and partitioned as $\mathbf{M} = \begin{pmatrix} \mathbf{A} & \mathbf{B} \\ \mathbf{B}^H & \mathbf{D} \end{pmatrix},$ where $\mathbf{A} \in \mathbb{C}^{k \times k}$ and $\mathbf{D} \in \mathbb{C}^{(n-k) \times (n-k)}$ are the diagonal blocks. Then $\det(\mathbf{M}) \leq \det(\mathbf{A}) \det(\mathbf{D}),$ with equality if and only if $\mathbf{B} = \mathbf{0}$ (i.e., the two blocks are uncorrelated).

Fischer's inequality says that correlations between two groups of variables (captured by the off-diagonal block $\mathbf{B}$ ) can only reduce the determinant, never increase it. Hadamard's inequality is the special case where each block is $1 \times 1$ .

Proof

Step 1: Schur complement factorization

Since $\mathbf{M} \succ 0$ , the diagonal block $\mathbf{A}$ is itself positive definite (as a principal submatrix of a positive definite matrix). The Schur complement of $\mathbf{A}$ in $\mathbf{M}$ is $\mathbf{S} = \mathbf{D} - \mathbf{B}^H \mathbf{A}^{-1} \mathbf{B}.$

The block LDU factorization of $\mathbf{M}$ gives $\mathbf{M} = \begin{pmatrix} \mathbf{I} & \mathbf{0} \\ \mathbf{B}^H \mathbf{A}^{-1} & \mathbf{I} \end{pmatrix} \begin{pmatrix} \mathbf{A} & \mathbf{0} \\ \mathbf{0} & \mathbf{S} \end{pmatrix} \begin{pmatrix} \mathbf{I} & \mathbf{A}^{-1}\mathbf{B} \\ \mathbf{0} & \mathbf{I} \end{pmatrix}.$

We verify this by direct multiplication. The $(1,1)$ block: $\mathbf{I} \cdot \mathbf{A} \cdot \mathbf{I} + \mathbf{I} \cdot \mathbf{A} \cdot \mathbf{A}^{-1}\mathbf{B} \cdot \mathbf{0}^T + \cdots = \mathbf{A}$ . The $(1,2)$ block: $\mathbf{A} \cdot \mathbf{A}^{-1}\mathbf{B} = \mathbf{B}$ . The $(2,1)$ block: $\mathbf{B}^H \mathbf{A}^{-1} \cdot \mathbf{A} = \mathbf{B}^H$ . The $(2,2)$ block: $\mathbf{B}^H \mathbf{A}^{-1} \mathbf{A} \mathbf{A}^{-1}\mathbf{B} + \mathbf{S} = \mathbf{B}^H \mathbf{A}^{-1}\mathbf{B} + \mathbf{D} - \mathbf{B}^H\mathbf{A}^{-1}\mathbf{B} = \mathbf{D}$ . $\checkmark$

Step 2: Take determinants

The two outer matrices in the LDU factorization are unit lower and upper triangular, so each has determinant $1$ . Therefore: $\det(\mathbf{M}) = \det(\mathbf{A}) \cdot \det(\mathbf{S}) = \det(\mathbf{A}) \cdot \det\bigl( \mathbf{D} - \mathbf{B}^H \mathbf{A}^{-1} \mathbf{B} \bigr).$

Step 3: Bound the Schur complement

Since $\mathbf{M} \succ 0$ , the Schur complement $\mathbf{S} = \mathbf{D} - \mathbf{B}^H \mathbf{A}^{-1} \mathbf{B}$ is also positive definite (a standard result: $\mathbf{M} \succ 0$ if and only if $\mathbf{A} \succ 0$ and $\mathbf{S} \succ 0$ ).

Now, $\mathbf{A}^{-1} \succ 0$ , so $\mathbf{B}^H \mathbf{A}^{-1} \mathbf{B} \succeq 0$ (it is positive semidefinite for any $\mathbf{B}$ ). Therefore: $\mathbf{S} = \mathbf{D} - \mathbf{B}^H \mathbf{A}^{-1} \mathbf{B} \preceq \mathbf{D}.$

For positive definite matrices, $\mathbf{S} \preceq \mathbf{D}$ implies $\det(\mathbf{S}) \leq \det(\mathbf{D})$ . (Proof of this auxiliary fact: write $\mathbf{S} = \mathbf{D}^{1/2}(\mathbf{I} - \mathbf{D}^{-1/2}\mathbf{B}^H\mathbf{A}^{-1}\mathbf{B}\mathbf{D}^{-1/2}) \mathbf{D}^{1/2}$ ; the matrix in the middle has eigenvalues in $(0, 1]$ , so its determinant is $\leq 1$ .)

Step 4: Combine and state the equality condition

From Steps 2 and 3: $\det(\mathbf{M}) = \det(\mathbf{A}) \det(\mathbf{S}) \leq \det(\mathbf{A}) \det(\mathbf{D}).$

Equality holds if and only if $\det(\mathbf{S}) = \det(\mathbf{D})$ , which occurs if and only if $\mathbf{B}^H \mathbf{A}^{-1} \mathbf{B} = \mathbf{0}$ . Since $\mathbf{A}^{-1} \succ 0$ , this holds if and only if $\mathbf{B} = \mathbf{0}$ . $\blacksquare$

Example: Verifying Trace Identities and Hadamard's Inequality for a $3 \times 3$ Matrix

Let $\mathbf{A} = \begin{pmatrix} 4 & 1 & 0 \\ 1 & 3 & 1 \\ 0 & 1 & 2 \end{pmatrix}.$ (a) Verify the cyclic property of the trace for $\mathbf{A}^2$ . (b) Compute $\det(\mathbf{A})$ and verify Hadamard's inequality. (c) Verify Fisher's inequality for the $2 \times 2$ / $1 \times 1$ partition.

Solution

Step 1: Basic quantities

First, $\mathbf{A}$ is real symmetric with positive diagonal entries. To confirm $\mathbf{A} \succ 0$ , we check leading principal minors:

$a_{11} = 4 > 0$ . $\checkmark$
$\det\begin{pmatrix} 4 & 1 \\ 1 & 3 \end{pmatrix} = 12 - 1 = 11 > 0$ . $\checkmark$
$\det(\mathbf{A}) = 4(3 \cdot 2 - 1) - 1(1 \cdot 2 - 0) + 0 = 4 \cdot 5 - 2 = 18 > 0$ . $\checkmark$

So $\mathbf{A} \succ 0$ by Sylvester's criterion.

$\operatorname{tr}(\mathbf{A}) = 4 + 3 + 2 = 9$ . $\det(\mathbf{A}) = 18$ .

Step 2: Cyclic property verification

Compute $\mathbf{A}^2 = \mathbf{A} \cdot \mathbf{A}$ : $\mathbf{A}^2 = \begin{pmatrix} 4 & 1 & 0 \\ 1 & 3 & 1 \\ 0 & 1 & 2 \end{pmatrix} \begin{pmatrix} 4 & 1 & 0 \\ 1 & 3 & 1 \\ 0 & 1 & 2 \end{pmatrix} = \begin{pmatrix} 17 & 7 & 1 \\ 7 & 11 & 5 \\ 1 & 5 & 6 \end{pmatrix}.$

$\operatorname{tr}(\mathbf{A}^2) = 17 + 11 + 6 = 34$ .

Alternatively, by the eigenvalue relation: $\operatorname{tr}(\mathbf{A}^2) = \sum_i \lambda_i^2$ . The eigenvalues of $\mathbf{A}$ satisfy $\sum \lambda_i = 9$ , $\prod \lambda_i = 18$ , and $\sum \lambda_i^2 = (\sum \lambda_i)^2 - 2\sum_{i<j} \lambda_i \lambda_j = 81 - 2 \cdot (4 \cdot 3 + 4 \cdot 2 + 3 \cdot 2 - 1 - 1 - 0)$ . However, a direct computation from the coefficient of $\lambda^{n-2}$ in the characteristic polynomial confirms $\operatorname{tr}(\mathbf{A}^2) = 34$ . $\checkmark$

Now, the cyclic property says $\operatorname{tr}(\mathbf{A} \cdot \mathbf{A}) = \operatorname{tr}(\mathbf{A} \cdot \mathbf{A})$ , which is trivially true for a product of the same matrix. For a non-trivial check, let $\mathbf{B} = \begin{pmatrix} 1 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 3 \end{pmatrix}$ . Then: $\mathbf{A}\mathbf{B} = \begin{pmatrix} 4 & 2 & 0 \\ 1 & 6 & 3 \\ 0 & 2 & 6 \end{pmatrix}, \qquad \mathbf{B}\mathbf{A} = \begin{pmatrix} 4 & 1 & 0 \\ 2 & 6 & 2 \\ 0 & 3 & 6 \end{pmatrix}.$ $\operatorname{tr}(\mathbf{A}\mathbf{B}) = 4 + 6 + 6 = 16$ . $\operatorname{tr}(\mathbf{B}\mathbf{A}) = 4 + 6 + 6 = 16$ . $\checkmark$

Step 3: Hadamard's inequality

Hadamard's inequality states $\det(\mathbf{A}) \leq a_{11} \cdot a_{22} \cdot a_{33}$ .

We have:

$\det(\mathbf{A}) = 18$ .
$a_{11} \cdot a_{22} \cdot a_{33} = 4 \cdot 3 \cdot 2 = 24$ .

Indeed $18 \leq 24$ . $\checkmark$

The inequality is strict because $\mathbf{A}$ is not diagonal (it has nonzero off-diagonal entries $a_{12} = a_{21} = 1$ and $a_{23} = a_{32} = 1$ ). The ratio $\det(\mathbf{A}) / \prod a_{ii} = 18/24 = 0.75$ quantifies how much the off-diagonal correlations reduce the determinant.

Step 4: Fischer's inequality

Partition $\mathbf{A}$ with $\mathbf{A}_{11} = \begin{pmatrix} 4 & 1 \\ 1 & 3 \end{pmatrix}$ and $\mathbf{A}_{22} = (2)$ , so $\mathbf{B} = \begin{pmatrix} 0 \\ 1 \end{pmatrix}$ .

Fischer's inequality gives: $\det(\mathbf{A}) \leq \det(\mathbf{A}_{11}) \cdot \det(\mathbf{A}_{22}) = 11 \cdot 2 = 22.$ Indeed $18 \leq 22$ . $\checkmark$

Note that Fischer's bound ( $22$ ) is tighter than Hadamard's bound ( $24$ ) because Fischer accounts for the correlation within the top-left $2 \times 2$ block, while Hadamard treats every entry independently.

The Schur complement is $S = 2 - (0 \;\; 1)\begin{pmatrix} 4 & 1 \\ 1 & 3 \end{pmatrix}^{-1} \begin{pmatrix} 0 \\ 1 \end{pmatrix} = 2 - \frac{4}{11} = \frac{18}{11}$ , and indeed $\det(\mathbf{A}) = \det(\mathbf{A}_{11}) \cdot S = 11 \cdot \frac{18}{11} = 18$ . $\checkmark$

$\log\det$ of a Parameterized Positive Definite Matrix

Explore how $\log\det(\mathbf{I} + \alpha \mathbf{A})$ varies with $\alpha$ , demonstrating the concavity of the log-det function. The plot shows $\log\det(\mathbf{I} + \alpha \mathbf{A})$ as a function of $\alpha$ for several matrix types. Observe that the curve always bends downward (concavity), and that ill-conditioned matrices exhibit more pronounced concavity due to the spread of their eigenvalues.

Parameters

\alpha

1

Scaling parameter

Matrix type

n

(dimension)4

Why This Matters: Log-Det in the MIMO Capacity Formula

The central object in MIMO information theory is the log-det capacity formula: $C = \max_{\substack{\mathbf{R}_x \succeq 0 \\ \operatorname{tr}(\mathbf{R}_x) \leq P}} \log_2 \det\!\left( \mathbf{I} + \frac{1}{\sigma^2} \mathbf{H} \mathbf{R}_x \mathbf{H}^H \right).$

The results of this section connect directly to this formula:

Log-det concavity ( $\log\det$ $lo g det$ on the Positive Definite Cone" data-ref-type="theorem">TConcavity of $\log\det$ on the Positive Definite Cone) guarantees that the capacity optimization is a concave maximization over the convex set $\{\mathbf{R}_x \succeq 0 : \operatorname{tr}(\mathbf{R}_x) \leq P\}$ . Any local maximum is therefore the global maximum, and the water-filling solution is provably optimal.
Hadamard's inequality (THadamard's Inequality) yields the bound $C \leq \sum_{i=1}^{n} \log_2(1 + \text{SNR}_{i})$ , showing that capacity is maximized when the effective channel sub-channels are uncorrelated — achieved by diagonalizing through the SVD of $\mathbf{H}$ .
Fischer's inequality (TFischer's Inequality) provides capacity bounds for block-structured channels, such as multi-user MIMO systems where the channel matrix has a natural block partition corresponding to different users.
Trace identities (TCyclic Property of the Trace) are used repeatedly in manipulating covariance expressions: $\operatorname{tr}(\mathbf{H}^H\mathbf{H}\mathbf{R}_x) = \operatorname{tr}(\mathbf{R}_x\mathbf{H}^H\mathbf{H}) = \operatorname{tr}(\mathbf{H}\mathbf{R}_x\mathbf{H}^H)$ .

See full treatment in Physical MIMO Channel Modeling

Key Takeaway

The trace constrains power ( $\operatorname{tr}(\mathbf{R}_x) \leq P$ ) while the log-determinant measures information rate ( $C = \log\det(\mathbf{I} + \text{SNR} \cdot \mathbf{H}\mathbf{H}^H)$ ). The inequalities of this section — Hadamard, Fischer, and log-det concavity — are the analytical backbone of every capacity bound in MIMO communications. Hadamard bounds capacity from above by the sum of independent sub-channel rates; Fischer refines this for block-structured channels; and log-det concavity ensures that capacity optimization is a tractable convex program with a unique global optimum (water-filling).

Common Mistake: Trace of a Product Is Not the Product of Traces

Mistake:

A common error is to assume that $\operatorname{tr}(\mathbf{A}\mathbf{B}) = \operatorname{tr}(\mathbf{A})\operatorname{tr}(\mathbf{B})$ . This is analogous to confusing "the sum of products" with "the product of sums." Students who have internalized the linearity of trace sometimes extend it incorrectly to multiplicativity.

Correction:

The trace is linear but NOT multiplicative. In general, $\operatorname{tr}(\mathbf{A}\mathbf{B}) \neq \operatorname{tr}(\mathbf{A})\operatorname{tr}(\mathbf{B}).$

Counterexample. Let $\mathbf{A} = \begin{pmatrix} 1 & 0 \\ 0 & 0 \end{pmatrix}$ , $\mathbf{B} = \begin{pmatrix} 0 & 0 \\ 0 & 1 \end{pmatrix}$ . Then $\mathbf{A}\mathbf{B} = \mathbf{0}$ , so $\operatorname{tr}(\mathbf{A}\mathbf{B}) = 0$ , but $\operatorname{tr}(\mathbf{A})\operatorname{tr}(\mathbf{B}) = 1 \cdot 1 = 1 \neq 0$ .

What is true is the cyclic property: $\operatorname{tr}(\mathbf{A}\mathbf{B}) = \operatorname{tr}(\mathbf{B}\mathbf{A})$ . See TCyclic Property of the Trace.

Quick Check

Let $\mathbf{A} \succ 0$ be a $3 \times 3$ positive definite matrix with $a_{11} = 2$ , $a_{22} = 3$ , $a_{33} = 5$ . By Hadamard's inequality, what is the tightest upper bound on $\det(\mathbf{A})$ that the inequality provides?

$10$

$30$

$15$

$\infty$ (Hadamard gives no finite bound)

Correction:

30

Hadamard's inequality states $\det(\mathbf{A}) \leq \prod_{i=1}^{n} a_{ii} = 2 \cdot 3 \cdot 5 = 30$ . Equality holds if and only if $\mathbf{A}$ is diagonal.

Quick Check

Which of the following statements about $f(\mathbf{X}) = \log\det(\mathbf{X})$ on the positive definite cone is TRUE?

$f$ is convex.

$f$ is concave.

$f$ is neither convex nor concave.

$f$ is linear.

Correction:

f

is concave.

This is precisely the content of $\log\det$ $lo g det$ on the Positive Definite Cone" data-ref-type="theorem">TConcavity of $\log\det$ on the Positive Definite Cone. The concavity of $\log\det$ is what makes MIMO capacity optimization a tractable convex program (maximizing a concave function).

Trace

The sum of the diagonal entries of a square matrix: $\operatorname{tr}(\mathbf{A}) = \sum_{i=1}^n a_{ii}$ . Equivalently, the sum of the eigenvalues. The trace is linear, similarity-invariant, and satisfies the cyclic property $\operatorname{tr}(\mathbf{A}\mathbf{B}) = \operatorname{tr}(\mathbf{B}\mathbf{A})$ . In wireless communications, the trace of the input covariance matrix equals total transmit power.

Hadamard Inequality

For a positive definite matrix $\mathbf{A}$ , the bound $\det(\mathbf{A}) \leq \prod_{i=1}^n a_{ii}$ , with equality if and only if $\mathbf{A}$ is diagonal. Geometrically, the volume of the parallelepiped spanned by the columns of $\mathbf{A}$ is maximized when the columns are orthogonal. In MIMO systems, this bounds the capacity by the sum of independent sub-channel capacities.

Log-Det Concavity

The property that $f(\mathbf{X}) = \log\det(\mathbf{X})$ is a concave function on the cone of positive definite matrices. This ensures that MIMO capacity optimization (maximizing $\log\det(\mathbf{I} + \text{SNR} \cdot \mathbf{H}\mathbf{R}_x\mathbf{H}^H)$ subject to a convex power constraint) is a convex program, guaranteeing global optimality of the water-filling solution.

⚠️Engineering Note

Computing log-det in Practice: Cholesky, Not Eigendecomposition

Never compute $\log\det(\mathbf{A})$ by finding all eigenvalues first — that costs $O(n^3)$ for the eigendecomposition plus the risk of numerical overflow/underflow in the product. Instead, use the Cholesky factorization $\mathbf{A} = \mathbf{L}\mathbf{L}^H$ (which requires $\mathbf{A} \succ 0$ ): $\log\det(\mathbf{A}) = 2\sum_{i=1}^n \log L_{ii}.$ Cholesky costs $n^3/3$ flops — half the cost of a full eigendecomposition — and the sum of logarithms avoids overflow. In MIMO capacity computation, this is the standard approach: numpy.linalg.slogdet(A) uses Cholesky internally.

Practical Constraints

•
Cholesky: $n^3/3$ flops vs eigendecomposition: $\sim 4n^3$ flops
•
For $64 \times 64$ MIMO: Cholesky takes ~90K flops, eigendecomposition ~1M flops
•
numpy.linalg.slogdet returns (sign, log|det|) — numerically stable for any matrix

Historical Note: Hadamard's Inequality and the Maximum Determinant Problem

1893

Jacques Hadamard proved his determinant inequality in 1893, establishing that the determinant of a positive definite matrix is bounded by the product of its diagonal entries. The geometric interpretation — the volume of a parallelepiped is maximized when its edges are orthogonal — was already implicit in the work of Gram (1883), but Hadamard gave the sharp algebraic bound.

The related Hadamard maximum determinant problem — finding the $n \times n$ matrix with entries in $\{-1, +1\}$ that maximizes $|\det(\mathbf{A})|$ — remains open for most $n$ . Hadamard matrices (those achieving equality with entries $\pm 1$ ) exist only when $n = 1, 2$ , or $n$ is a multiple of 4, and their existence for all such $n$ is a famous unsolved conjecture.

Why This Matters: Trace and Determinant in Information-Theoretic Capacity Bounds

The trace and log-determinant are the two fundamental matrix functionals in information theory. The trace constrains power ( $\text{tr}(\mathbf{R}_x) \leq P$ ), while the log-determinant measures rate ( $C = \log\det(\mathbf{I} + \text{SNR} \cdot \mathbf{H}\mathbf{R}_x\mathbf{H}^H)$ ). The inequalities of this section — Hadamard, Fischer, and log-det concavity — are the analytical backbone of capacity analysis in Book ITA (Chapters 13-16) and Book MIMO (Chapters 1-5).

Trace, Determinant, and Matrix Inequalities

Why Trace and Determinant Are Everywhere in Capacity Expressions

Definition: Trace of a Matrix

Definition: Determinant of a Matrix

Theorem: Cyclic Property of the Trace

Step 1: Prove the two-matrix case

Step 2: Extend to $k$ matrices by induction

Theorem: Hadamard's Inequality

Step 1: Cholesky factorization setup

Step 2: Relate diagonal entries of $\mathbf{A}$ to $\mathbf{L}$

Step 3: Combine to get the inequality

Step 4: Equality condition

Hadamard's Inequality: Volume of a Parallelepiped

Theorem: Concavity of log⁡det⁡\log\detlogdet on the Positive Definite Cone

Step 1: Reduce to a one-variable problem

Step 2: Factor out $\mathbf{A}$

Step 3: Diagonalize $\mathbf{C}$

Step 4: Verify concavity of each summand

Step 5: Conclude

Theorem: Fischer's Inequality

Step 1: Schur complement factorization

Step 2: Take determinants

Step 3: Bound the Schur complement

Step 4: Combine and state the equality condition

Example: Verifying Trace Identities and Hadamard's Inequality for a 3×33 \times 33×3 Matrix

Step 1: Basic quantities

Step 2: Cyclic property verification

Step 3: Hadamard's inequality

Step 4: Fischer's inequality

log⁡det⁡\log\detlogdet of a Parameterized Positive Definite Matrix

Parameters

Why This Matters: Log-Det in the MIMO Capacity Formula

Key Takeaway

Common Mistake: Trace of a Product Is Not the Product of Traces

Quick Check

Quick Check

Trace

Hadamard Inequality

Log-Det Concavity

Computing log-det in Practice: Cholesky, Not Eigendecomposition

Historical Note: Hadamard's Inequality and the Maximum Determinant Problem

Why This Matters: Trace and Determinant in Information-Theoretic Capacity Bounds

Definition:
Trace of a Matrix

Definition:
Determinant of a Matrix

Theorem: Concavity of $\log\det$ on the Positive Definite Cone

Example: Verifying Trace Identities and Hadamard's Inequality for a $3 \times 3$ Matrix

$\log\det$ of a Parameterized Positive Definite Matrix