(250h) Meta-Adaptive Koopman Operators for Learning-Based Model Predictive Control of Parametrically Uncertain Nonlinear Systems
AIChE Annual Meeting
2024
2024 AIChE Annual Meeting
Computing and Systems Technology Division
10B: Advances in Process Control II
Tuesday, October 29, 2024 - 9:52am to 10:08am
Model predictive control (MPC) is a popular advanced control technique [2]. It optimizes the future predicted behavior of the system by utilizing a dynamic model along with current measurement data. Adaptive model predictive control (AMPC) has been proposed to address uncertainties [3]; however, the results on nonlinear systems have been limited. More results on nonlinear AMPC can be found in [4,5]. Although these results represent significant advancements, first-principle models are typically required as the foundation of control system designs. The theoretical assumption of linear dependency on uncertain parameters further limits its applicability to general nonlinear processes.
Recently, the Koopman operator theory has gained substantial research attention, owing to its capability to represent the dynamics of complex nonlinear processes in a linear manner [6]. Several algorithms have been developed to construct linear Koopman models from process data. These include dynamic mode decomposition (DMD) [7] and extended dynamic mode decomposition (EDMD) [8]. DMD and EDMD represent the observables using a predetermined set of basis functions, and then solve least-squares problems to approximate the linear Koopman operator. To streamline the design of the observable functions, researchers have leveraged deep learning and proposed Deep-DMD Koopman modeling methods [9â11]. However, these existing approaches have primarily targeted addressing a specific control task with fixed model parameters.
The learning and control of parametrically uncertain systems can be conceptualized as a multi-task problem, which may be addressed using the meta-learning concept [12]. For instance, meta-reinforcement learning (meta-RL), a fusion of meta-learning and RL, has been developed for learning-based control in multi-task scenarios. Meta RL controllers utilize previously acquired knowledge and real-time data to adapt to new tasks, wherein the system dynamics, objectives, or distribution of noises and disturbances can vary. [13] introduced a meta-learning-based MPC framework capable of finetuning the meta-trained neural network (NN) model using online data. This method was applied to control a legged robot in the presence of changed payloads, terrains, and even a disabled leg. In [14], a novel offline meta-RL strategy was proposed for tuning proportional-integral (PI) controllers in process control systems. However, the online adaptation of deep NNs is inefficient and computationally demanding. Furthermore, it is generally challenging for these meta-RL approaches to offer stability and ensure closed-loop performance.
Based on these observations, we aim to integrate meta-learning with Koopman operator theory to create a learning-based adaptive control framework for parametrically uncertain nonlinear systems. Within the Koopman operator framework, we proposed a meta-adaptive Koopman operator (MAKO) modeling approach. This approach learns from a multi-modal dataset to construct a meta-model for online adaptation. An adaptation scheme is developed to update the meta-model using online data while ensuring convergence. Based on the adaptive meta-model, a predictive control scheme is proposed for the underlying uncertain nonlinear systems. We prove the stability of the closed-loop system in the presence of previously unseen parameter settings. The contributions of this work include: 1) Meta-learning and Koopman operator theory are integrated for the first time to establish a learning-based adaptive MPC framework applicable to a general class of parametrically uncertain nonlinear systems; 2) We rigorously analyze and prove the stability of both the model online adaptation and the closed-loop system; 3) Based on three benchmark systems from various fields, MAKO demonstrates good modeling accuracy and robust tracking control performance in the presence of parameter uncertainties, and it outperforms competitive baselines.
References
[1] Tao, âMultivariable adaptive control: A survey,â Automatica, vol. 50, no. 11, pp. 2737â2764, 2014.
[2] Q. Mayne, J. B. Rawlings, C. V. Rao, and P. O. Scokaert, âConstrained model predictive control: Stability and optimality,â Automatica, vol. 36, no. 6, pp. 789â814, 2000.
[3] Fukushima, T.-H. Kim, and T. Sugie, âAdaptive model predictive control for a class of constrained linear systems based on the comparison model,â Automatica, vol. 43, no. 2, pp. 301â308, 2007.
[4] DeHaan and M. Guay, âAdaptive robust MPC: A minimally-conservative approach,â in 2007 American Control Conference, 2007, pp. 3937-3942.
[5] Xu, A. Suleman, and Y. Shi, âmulti-rate hierarchical fault-tolerant adaptive model predictive control framework: Theory and design for quadrotors,â Automatica, vol. 153, p. 111015, 2023.
[6] O. Koopman, âHamiltonian systems and transformation in Hilbert space,â Proceedings of the National Academy of Sciences, vol. 17, no. 5, pp. 315â318, 1931.
[7] J. Schmid, âDynamic mode decomposition of numerical and experimental data,â Journal of Fluid Mechanics, vol.656, pp. 5â28, 2010.
[8] Li, F. Dietrich, E. M. Bollt, and I. G. Kevrekidis, âExtended dynamic mode decomposition with dictionary learning: A data-driven adaptive spectral decomposition of the Koopman operator,â Chaos: An Interdisciplinary Journal of Nonlinear Science, vol. 27, no. 10, p. 103111, 2017.
[9] Morton, A. Jameson, M. J. Kochenderfer, and F. Witherden, âDeep dynamical modeling and control of unsteady fluid flows,â Advances in Neural Information Processing Systems, vol. 31, 2018.
[10] Han, Z. Li, X. Yin, and X. Yin, âRobust learning and control of time-delay nonlinear systems with deep recurrent Koopman operators,â IEEE Transactions on Industrial Informatics, vol. 20, no. 3, pp. 4675â4684, 2024.
[11] Shi and M. Q.-H. Meng, âDeep Koopman operator with control for nonlinear systems,â IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 7700â7707, 2022.
[12] Hospedales, A. Antoniou, P. Micaelli, and A. Storkey, âMeta-learning in neural networks: A survey,â IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 9, pp. 5149â5169, 2021.
[13] Nagabandi, I. Clavera, S. Liu, R. S. Fearing, P. Abbeel, S. Levine, and C. Finn, âLearning to adapt in dynamic, realworld environments through meta-reinforcement learning,â in International Conference on Learning Representations, 2018.
[14] G. McClement, N. P. Lawrence, J. U. Backstr¨om, P. D. Loewen, M. G. Forbes, and R. B. Gopaluni, âMetareinforcement learning for the tuning of PI controllers: An offline approach,â Journal of Process Control, vol. 118, pp. 139â152, 2022.