Prerequisites & Notation

Before You Begin

This chapter assumes familiarity with multivariable calculus (gradients, Hessians), linear algebra (eigenvalues, positive definiteness), and basic NumPy/SciPy operations. If any of these feel unfamiliar, review the linked material first.

NumPy array creation, slicing, and broadcasting (Chapter 5)(Review ch05)
Self-check: Can you compute a matrix-vector product with @ and reshape arrays?
Linear algebra: solve, eigenvalues, SVD (Chapter 6)(Review ch06)
Self-check: Can you solve $\mathbf{A}\mathbf{x} = \mathbf{b}$ and compute eigenvalues in NumPy?
Multivariable calculus: gradients and Hessians
Self-check: Can you compute $\nabla f$ and $\nabla^2 f$ for $f(x,y) = x^2 + xy + y^2$ ?
Convexity basics: convex sets and convex functions
Self-check: Do you know that $f(\lambda x + (1-\lambda)y) \le \lambda f(x) + (1-\lambda)f(y)$ ?

Notation for This Chapter

Symbols and conventions used throughout this chapter. We write optimization problems in the standard minimization form.

Symbol	Meaning	Introduced
$f(\\mathbf{x})$	Objective function to be minimized	s01
$\\nabla f$ , $\\mathbf{g}$	Gradient (vector of partial derivatives)	s01
$\\nabla^2 f$ , $\\mathbf{H}$	Hessian matrix (matrix of second partial derivatives)	s01
$\\mathbf{x}^\\star$	Optimal solution	s01
$g_i(\\mathbf{x}) \\le 0$	Inequality constraints	s02
$h_j(\\mathbf{x}) = 0$	Equality constraints	s02
$\\mathrm{prox}_{\\lambda f}(\\mathbf{v})$	Proximal operator of $f$ with step size $\lambda$	s04
$\\mathcal{S}_\\lambda(x)$	Soft-thresholding operator with threshold $\lambda$	s04
$\\\|\\mathbf{x}\\\|_1$	$\ell_1$ norm: $\sum_i \|x_i\|$ (promotes sparsity)	s03
$\\succeq 0$	Positive semidefinite (for matrices)	s03

← Ch 7 Unconstrained Optimization