Notes

[20211022] The Control Architecture for MIT Cheetah 3

[20211203] Markov Decision Process and Deep Q Learning

[20220315] Policy Gradient Methods

[20240530] Policy Gradient Methods in Deep Reinforcement Learning: Introduction and Implementation