(363h) Comparing Reinforcement Learning and Bayesian Optimization for Tuning MPC Policies
AIChE Annual Meeting
2022
2022 Annual Meeting
Computing and Systems Technology Division
Interactive Session: Systems and Process Control
Tuesday, November 15, 2022 - 3:30pm to 5:00pm
Despite these results, the authors feel that there has not been enough cross-communication between these two lines of work. To bridge the gap, this work compares deterministic policy gradient methods from reinforcement learning against Bayesian optimization methods using a series of benchmark energy storage problems. We compare the two classes of algorithms in terms of data efficiency and discuss their relative merits and disadvantages, as well as opportunities for integration.
References:
Gros, S., & Zanon, M. (2019). Data-driven economic nmpc using reinforcement learning. IEEE Transactions on Automatic Control, 65(2), 636-648.
Lu, Q., González, L. D., Kumar, R., & Zavala, V. M. (2021). Bayesian optimization with reference models: A case study in MPC for HVAC central plants. Computers & Chemical Engineering, 154, 107491.