MPC-RL | Haoxiang You

In this project, we developed a novel approach that integrates Proximal Policy Optimization (PPO) with a parameterized Model Predictive Control (MPC) policy. By using MPC as the actor in the reinforcement learning loop, our method improves sample efficiency.

Learning Curves

Animation