My Implementations of standard Reinforcement Learning algorithms using PyTorch
- Vanilla Policy Gradient (VPG)
- Double Deep Q Network (DDQN)
- Deep Deterministic Policy Gradient (DDPG)
- Twin Delayed DDPG (TD3)
- Soft Actor Critic
- Proximal Policy Optimization
- Trust Region Policy Optimization