References & Further Reading

References

I. Csiszár and J. Körner, Information Theory: Coding Theorems for Discrete Memoryless Systems, Cambridge University Press, 2nd ed., 1998
The definitive reference for the method of types. Chapters 1–2 develop the combinatorial framework; Chapters 5–6 derive the error exponents. Our treatment follows their notation and proof structure closely.
T. M. Cover and J. A. Thomas, Elements of Information Theory, Wiley, 2nd ed., 2006
Chapter 11 provides an accessible introduction to the method of types, and Chapter 10 covers error exponents. Our presentation bridges the two approaches.
R. G. Gallager, Information Theory and Reliable Communication, Wiley, 1968
The original source for the random coding and expurgated error exponents. Gallager's bounding technique (the "$\\rho$-trick") remains the most elegant approach to achievability bounds.
C. E. Shannon, R. G. Gallager, and E. R. Berlekamp, Lower Bounds to Error Probability for Coding on Discrete Memoryless Channels, 1967
The two-part paper establishing the sphere-packing bound — the strongest known converse for error exponents of DMCs.
Y. Polyanskiy, H. V. Poor, and S. Verdú, Channel Coding Rate in the Finite Blocklength Regime, 2010
Introduces the normal approximation and channel dispersion for finite-blocklength analysis, providing much tighter bounds than error exponents for short codes.
A. Dembo and O. Zeitouni, Large Deviations Techniques and Applications, Springer, 2nd ed., 1998
The standard reference for large deviations theory beyond the i.i.d. setting. Chapters 2–3 generalize Sanov's theorem to abstract alphabets and Markov chains.