Notes on Deep RL, including the different classes of algorithms, implementation details in code, and applications to robotics, games, etc.
Table of Contents
- The Underpinnings of RL
- Q-Learning
- Actor-Critic Methods
- Policy Gradient
- Proximal Policy Optimization (PPO)