Prerequisites & Notation
Before You Begin
This chapter requires nn.Module and training loops (Chapter 26). Probability and statistics background (Chapter 9) is helpful.
- PyTorch nn.Module and training (Chapter 26)(Review ch26)
Self-check: Can you train a neural network with PyTorch?
- Probability and expectation (Chapter 9)(Review ch09)
Self-check: Do you understand expected value and conditional probability?
Notation for This Chapter
| Symbol | Meaning | Introduced |
|---|---|---|
| State space, action space | s01 | |
| Policy: probability of action in state | s01 | |
| Action-value function | s01 | |
| State-value function | s01 | |
| Discount factor | s01 | |
| Reward at time | s01 |