MS Student, Southern University of Science and Technology (SUSTech)
[20211022] The Control Architecture for MIT Cheetah 3
[20211203] Markov Decision Process and Deep Q Learning
[20220315] Policy Gradient Methods
[20240530] Policy Gradient Methods in Deep Reinforcement Learning: Introduction and Implementation