Ferkans — Interactive Telecom Tutor

ex-sp-ch08-01

Easy

Minimize $f(x, y) = (x-3)^2 + (y+1)^2$ using scipy.optimize.minimize with method 'BFGS'. Provide the analytical gradient. Verify that the solution is $(3, -1)$ .

Show Hint

The gradient is $\nabla f = [2(x-3), \; 2(y+1)]^T$ .

Start from $x_0 = (0, 0)$ .

Solution

Implementation

import numpy as np
from scipy.optimize import minimize

def f(x):
    return (x[0] - 3)**2 + (x[1] + 1)**2

def grad_f(x):
    return np.array([2*(x[0] - 3), 2*(x[1] + 1)])

res = minimize(f, [0, 0], method='BFGS', jac=grad_f)
print(f"x* = {res.x}")  # [3.0, -1.0]
print(f"f(x*) = {res.fun:.2e}")  # ~0
print(f"Converged: {res.success}")

ex-sp-ch08-02

Easy

Use scipy.optimize.brentq to find the root of $f(x) = e^x - 3x$ on the interval $[0, 2]$ . Verify by plugging back into $f$ .

Show Hint

Check that $f(0) > 0$ and $f(2) < 0$ (or vice versa) for a valid bracket.

Solution

Implementation

import numpy as np
from scipy.optimize import brentq

f = lambda x: np.exp(x) - 3*x
x_star = brentq(f, 0, 2)
print(f"Root: {x_star:.10f}")
print(f"f(x*) = {f(x_star):.2e}")

ex-sp-ch08-03

Easy

Implement the soft-thresholding operator and apply it to the vector $\mathbf{v} = [-3, -1, 0.5, 0, 2, -0.3, 4]$ with $\lambda = 1$ . How many components become exactly zero?

Show Hint

$\mathcal{S}_\lambda(v_i) = \mathrm{sign}(v_i)\max(|v_i| - \lambda, 0)$ .

Solution

Implementation

import numpy as np

def soft_threshold(v, lam):
    return np.sign(v) * np.maximum(np.abs(v) - lam, 0)

v = np.array([-3, -1, 0.5, 0, 2, -0.3, 4])
result = soft_threshold(v, 1.0)
print(f"Result: {result}")
# [-2, 0, 0, 0, 1, 0, 3]
print(f"Zeros: {np.sum(result == 0)}")  # 4

ex-sp-ch08-04

Easy

Solve the linear program: maximize $3x + 5y$ subject to $x + y \le 10$ , $2x + y \le 14$ , $x, y \ge 0$ using linprog.

Show Hint

Negate the objective for minimization.

Solution

Implementation

from scipy.optimize import linprog

c = [-3, -5]  # negate for maximization
A_ub = [[1, 1], [2, 1]]
b_ub = [10, 14]
bounds = [(0, None), (0, None)]

res = linprog(c, A_ub=A_ub, b_ub=b_ub, bounds=bounds)
print(f"x={res.x[0]:.1f}, y={res.x[1]:.1f}")
print(f"Max profit = {-res.fun:.1f}")

ex-sp-ch08-05

Easy

Use scipy.optimize.fixed_point to find the fixed point of $g(x) = \cos(x)$ starting from $x_0 = 0.5$ . Compare to the analytical value (Dottie number $\approx 0.73909$ ).

Show Hint

from scipy.optimize import fixed_point

Solution

Implementation

import numpy as np
from scipy.optimize import fixed_point

x_star = fixed_point(np.cos, 0.5)
print(f"Fixed point: {x_star:.10f}")
print(f"cos(x*) = {np.cos(x_star):.10f}")
print(f"|x* - cos(x*)| = {abs(x_star - np.cos(x_star)):.2e}")

ex-sp-ch08-06

Medium

Compare the iteration counts of Nelder-Mead, CG, BFGS, and Newton-CG on the Rosenbrock function $f(x,y) = (1-x)^2 + 100(y-x^2)^2$ from starting point $(-1.5, 2)$ . Provide analytical gradient and Hessian for the methods that use them.

Show Hint

The Hessian is $\begin{bmatrix} 2 - 400(y - 3x^2) & -400x \\ -400x & 200 \end{bmatrix}$ .

Use result.nit for iteration count and result.nfev for function evaluations.

Solution

Implementation

import numpy as np
from scipy.optimize import minimize

def rosen(x):
    return (1-x[0])**2 + 100*(x[1]-x[0]**2)**2

def rosen_grad(x):
    return np.array([
        -2*(1-x[0]) - 400*x[0]*(x[1]-x[0]**2),
        200*(x[1]-x[0]**2)
    ])

def rosen_hess(x):
    return np.array([
        [2 - 400*(x[1]-3*x[0]**2), -400*x[0]],
        [-400*x[0], 200]
    ])

x0 = np.array([-1.5, 2.0])
for method in ['Nelder-Mead', 'CG', 'BFGS', 'Newton-CG']:
    kw = {}
    if method in ('CG', 'BFGS', 'Newton-CG'):
        kw['jac'] = rosen_grad
    if method == 'Newton-CG':
        kw['hess'] = rosen_hess
    res = minimize(rosen, x0, method=method, **kw)
    print(f"{method:12s}: nit={res.nit:4d}, nfev={res.nfev:4d}, "
          f"f*={res.fun:.2e}")

ex-sp-ch08-07

Medium

Implement water-filling power allocation for $N=12$ channels with total power $P=20$ . Channel gains are $|h_i|^2 = 10^{(20 - 3i)/10}$ for $i = 0,\ldots,11$ . How many channels receive zero power? Verify with CVXPY.

Show Hint

Sort channels by gain; use the iterative water-level formula.

In CVXPY: cp.Maximize(cp.sum(cp.log(1 + cp.multiply(gains, p)))).

Solution

Water-filling

import numpy as np

N = 12
gains = 10**((20 - 3*np.arange(N))/10)
P, sigma2 = 20.0, 1.0
noise_floor = sigma2 / gains
idx = np.argsort(noise_floor)
nf_sorted = noise_floor[idx]

for K in range(N, 0, -1):
    mu = (P + np.sum(nf_sorted[:K])) / K
    p = np.maximum(mu - nf_sorted, 0)
    if p[K-1] > 0:
        break

powers = np.zeros(N)
powers[idx] = p
print(f"Zero-power channels: {np.sum(powers == 0)}")

ex-sp-ch08-08

Medium

Solve the LASSO problem for a $50 \times 200$ random system with 10 true nonzeros using both CVXPY and ISTA. Compare the solutions and convergence.

Show Hint

For ISTA, use step size $1/L$ where $L = \|A^TA\|_2$ .

Run ISTA for 1000 iterations and plot the objective vs iteration.

Solution

Implementation

import numpy as np
import cvxpy as cp

np.random.seed(42)
m, n, k = 50, 200, 10
A = np.random.randn(m, n)
x_true = np.zeros(n); x_true[:k] = np.random.randn(k)
b = A @ x_true + 0.1*np.random.randn(m)
lam = 0.5

# CVXPY
x_cv = cp.Variable(n)
prob = cp.Problem(cp.Minimize(0.5*cp.sum_squares(A@x_cv-b) + lam*cp.norm(x_cv,1)))
prob.solve()

# ISTA
L = np.linalg.norm(A.T @ A, 2)
x_is = np.zeros(n)
for _ in range(1000):
    x_is = np.sign(x_is - A.T@(A@x_is-b)/L) * np.maximum(
        np.abs(x_is - A.T@(A@x_is-b)/L) - lam/L, 0)

print(f"||x_cvxpy - x_ista|| = {np.linalg.norm(x_cv.value - x_is):.4f}")

ex-sp-ch08-09

Medium

Solve the system of nonlinear equations $x^2 + y^2 = 25$ , $x^2 - y = 3$ using fsolve from multiple starting points. Find all four solutions.

Show Hint

Try starting points in each quadrant: $(3,3), (-3,3), (3,-3), (-3,-3)$ .

Provide the Jacobian for faster convergence.

Solution

Implementation

from scipy.optimize import fsolve
import numpy as np

def sys(v):
    x, y = v
    return [x**2 + y**2 - 25, x**2 - y - 3]

def jac(v):
    x, y = v
    return [[2*x, 2*y], [2*x, -1]]

starts = [(3,3), (-3,3), (3,-3), (-3,-3)]
solutions = set()
for s in starts:
    sol, info, ier, msg = fsolve(sys, s, fprime=jac, full_output=True)
    if ier == 1:
        solutions.add(tuple(np.round(sol, 8)))

for s in sorted(solutions):
    print(f"({s[0]:.4f}, {s[1]:.4f})")

ex-sp-ch08-10

Medium

Implement the projection onto the probability simplex $\Delta_n = \{\mathbf{x} \ge 0 : \sum x_i = 1\}$ and test it on the vector $\mathbf{v} = [1.5, -0.5, 0.3, 0.8, -0.1]$ . Verify that the result satisfies the simplex constraints.

Show Hint

Sort $v$ in decreasing order and find the threshold.

Solution

Implementation

import numpy as np

def proj_simplex(v):
    n = len(v)
    u = np.sort(v)[::-1]
    cssv = np.cumsum(u) - 1
    rho = np.max(np.where(u > cssv / np.arange(1, n+1)))
    theta = cssv[rho] / (rho + 1)
    return np.maximum(v - theta, 0)

v = np.array([1.5, -0.5, 0.3, 0.8, -0.1])
w = proj_simplex(v)
print(f"Projection: {w}")
print(f"Sum: {np.sum(w):.10f}")
print(f"All >= 0: {np.all(w >= -1e-10)}")

ex-sp-ch08-11

Medium

Use trust-constr to minimize $f(x,y) = x^2 + y^2$ subject to $x + y = 1$ and $x \ge 0.1$ , $y \ge 0.1$ . Compare the result with the analytical solution.

Show Hint

The analytical solution is $x = y = 0.5$ .

Use LinearConstraint for the equality constraint.

Solution

Implementation

import numpy as np
from scipy.optimize import minimize, LinearConstraint, Bounds

f = lambda x: x[0]**2 + x[1]**2
grad_f = lambda x: 2*x
hess_f = lambda x: 2*np.eye(2)

lc = LinearConstraint([[1, 1]], [1], [1])
bounds = Bounds([0.1, 0.1], [np.inf, np.inf])

res = minimize(f, [0.5, 0.5], method='trust-constr',
               jac=grad_f, hess=hess_f,
               constraints=lc, bounds=bounds)
print(f"x* = {res.x}")
print(f"f* = {res.fun:.6f}")

ex-sp-ch08-12

Hard

Implement FISTA (Fast ISTA) for the LASSO problem and compare its convergence rate with ISTA on a $100 \times 500$ random system. Plot $f(\mathbf{x}_k) - f^\star$ vs iteration for both methods on a log scale.

Show Hint

FISTA adds momentum: $y_{k+1} = x_k + \frac{k-1}{k+2}(x_k - x_{k-1})$ .

Use CVXPY to find $f^\star$ as reference.

Solution

Key idea

ISTA converges as $O(1/k)$ while FISTA converges as $O(1/k^2)$ . On the log-scale plot, ISTA should show slope $-1$ and FISTA slope $-2$ (asymptotically).

ex-sp-ch08-13

Hard

Solve the SDP relaxation of MAX-CUT for a random graph with $n=10$ nodes and edge probability $0.5$ . Implement Goemans-Williamson randomized rounding and compute the approximation ratio.

Show Hint

Use cp.Variable((n,n), symmetric=True) with X >> 0 and X[i,i] == 1.

Round using random hyperplanes: $s = \mathrm{sign}(L r)$ where $L L^T = X$ and $r \sim \mathcal{N}(0, I)$ .

Solution

Key idea

The Goemans-Williamson algorithm guarantees at least $0.878$ of the optimal cut in expectation. With 1000 random hyperplanes, you should find a good cut.

ex-sp-ch08-14

Hard

Implement Anderson acceleration for the fixed-point iteration $x = 0.5 + \sin(x)/3$ and compare convergence speed with plain iteration. Use history depth $m=5$ .

Show Hint

Anderson acceleration finds weights $\theta$ that minimize the residual.

Use scipy.optimize.root(method="anderson").

Solution

Key idea

Anderson acceleration typically reduces iteration count by 3-10x compared to plain iteration. The residual norms should decrease much faster.

ex-sp-ch08-15

Hard

Implement the 1D total variation proximal operator using Chambolle's dual algorithm. Apply it to a noisy piecewise-constant signal ( $n=200$ , 5 pieces, SNR=10 dB) and compare denoised output for different $\lambda$ values.

Show Hint

Generate: constant segments of random heights + Gaussian noise.

Chambolle iteration: $p \leftarrow \mathrm{clip}(p + \mathrm{diff}(x)/(2\lambda), -1, 1)$ .

Solution

Key idea

Small $\lambda$ preserves detail but retains noise; large $\lambda$ over-smooths. The optimal $\lambda$ trades off between the two. TV denoising preserves sharp edges, unlike Gaussian smoothing.

ex-sp-ch08-16

Hard

Use scipy.optimize.minimize with method='Newton-CG' providing only Hessian-vector products (via hessp) to minimize a logistic regression loss on a $1000 \times 50$ dataset. Compare timing with full Hessian via hess.

Show Hint

The Hessian-vector product is $H v = X^T \mathrm{diag}(s(1-s)) X v$ where $s = \sigma(Xw)$ .

This avoids forming the $50 \times 50$ Hessian matrix.

Solution

Key idea

For large $n$ , Hessian-vector products cost $O(mn)$ while forming and applying the Hessian costs $O(mn + n^2)$ . The hessp approach should be faster when $n$ is large.

ex-sp-ch08-17

Challenge

Implement ADMM (Alternating Direction Method of Multipliers) for the basis pursuit denoising problem $\min \|\mathbf{x}\|_1 \text{ s.t. } \|\mathbf{A}\mathbf{x} - \mathbf{b}\|_2 \le \epsilon$ . Compare with CVXPY's solution on a $30 \times 100$ system with 5 true nonzeros.

Show Hint

ADMM splits: $x$ -update = L2 projection, $z$ -update = soft-threshold.

Use augmented Lagrangian with penalty $\rho = 1$ .

Solution

Key idea

ADMM decomposes the problem into easy subproblems. The $x$ -update is a least-squares solve; the $z$ -update is soft-thresholding. Convergence may require 100-500 iterations but each is cheap.

ex-sp-ch08-18

Challenge

Formulate and solve a robust beamforming problem as an SOCP in CVXPY: minimize transmit power $\|\mathbf{w}\|_2^2$ subject to $|\mathbf{a}(\theta_0)^H \mathbf{w}| \ge 1$ and $|\mathbf{a}(\theta_k)^H \mathbf{w}| \le \delta$ for interferer directions $\theta_k$ , where $\mathbf{a}(\theta)$ is the steering vector for a ULA with $n=8$ antennas.

Show Hint

Steering vector: $a_m(\theta) = e^{j \pi m \sin\theta}$ for $m = 0,\ldots,n-1$ .

The constraint $|a^H w| \ge 1$ can be written as $\mathrm{Re}(a^H w) \ge 1$ by absorbing phase.

Solution

Key idea

After fixing the phase of the desired signal constraint, this becomes an SOCP. CVXPY handles complex variables natively. The resulting beamformer should show a beam toward $\theta_0$ and nulls toward interferers.

Exercises

ex-sp-ch08-01

Implementation

ex-sp-ch08-02

Implementation

ex-sp-ch08-03

Implementation

ex-sp-ch08-04

Implementation

ex-sp-ch08-05

Implementation

ex-sp-ch08-06

Implementation

ex-sp-ch08-07

Water-filling

ex-sp-ch08-08

Implementation

ex-sp-ch08-09

Implementation

ex-sp-ch08-10

Implementation

ex-sp-ch08-11

Implementation

ex-sp-ch08-12

Key idea

ex-sp-ch08-13

Key idea

ex-sp-ch08-14

Key idea

ex-sp-ch08-15

Key idea

ex-sp-ch08-16

Key idea

ex-sp-ch08-17

Key idea

ex-sp-ch08-18

Key idea