MPC-RL

Developing a novel method taht combining RL with MPC for improved sample efficiency

In this project, we developed a novel approach that integrates Proximal Policy Optimization (PPO) with a parameterized Model Predictive Control (MPC) policy. By using MPC as the actor in the reinforcement learning loop, our method improves sample efficiency.

Learning Curves

Animation