MPC-RL
Developing a novel method taht combining RL with MPC for improved sample efficiency
In this project, we developed a novel approach that integrates Proximal Policy Optimization (PPO) with a parameterized Model Predictive Control (MPC) policy. By using MPC as the actor in the reinforcement learning loop, our method improves sample efficiency.
Learning Curves

Animation
