References:
- Stable Baselines: Reinforcement Learning Tips and Tricks
- Blog: The 32 Implementation Details of Proximal Policy Optimization (PPO) Algorithm
- Blog: 曾伊言:深度强化学习调参技巧:以D3QN、TD3、PPO、SAC算法为例
- Paper: Deep Reinforcement Learning that Matters
- Paper: Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO
- Paper: Revisiting Design Choices in Proximal Policy Optimization