Value-Based Methods
Definition: DQN Loss Function
DQN Loss Function
\mathcal{D}Q_{\bar{\theta}}$ is the target network.
Definition: Double DQN
Double DQN
Standard DQN overestimates Q-values. Double DQN decouples action selection from evaluation: