(14f) Transfer Reinforcement Learning-Based Optimal Control of Nonlinear Systems
AIChE Annual Meeting
2024
2024 AIChE Annual Meeting
Computing and Systems Technology Division
10B: Data-driven Modeling, Estimation and Optimization for Control I
Sunday, October 27, 2024 - 4:50pm to 5:06pm
Motivated by the above challenges, we propose a safe transfer reinforcement learning (TRL) framework. The algorithm leverages knowledge obtained from pre-trained source tasks to expedite learning in a new yet related target task, thereby significantly reducing both learning time and computational overhead for optimizing a control policy. Additionally, since there is a discrepancy between target and source tasks, the knowledge transferred to the target process may cause performance degradation. To account for this discrepancy, we develop a theoretical analysis via statistical learning theory to characterize the performance of TRL by accounting for the differences between the source and target tasks [4,5]. Furthermore, the proposed TRL method is designed to collect data and optimize the control policy within a control invariant set (CIS) to ensure the safety of the system throughout the learning process. Finally, we apply the proposed TRL method to an example of optimal control of a chemical reactor, showcasing its effectiveness in solving the optimal control problem with improved computational efficiency and safety guarantees compared to traditional RL without using transfer learning.
References:
[1] B. Yan, P. Shi, C. P. Lim, Y. Sun, and R. K. Agarwal, âSecurity and safety-critical learning-based collaborative control for multiagent systems,â IEEE Transactions on Neural Networks and Learning Systems, pp. 1â12, 2024.
[2] Dogru O, Xie J, Prakash O, et al. Reinforcement learning in process industries: review and perspective[J]. IEEE/CAA Journal of Automatica Sinica, 2024, 11(2): 283-300.
[3] A. B. Jeddi, N. L. Dehghani, and A. Shafieezadeh, âLyapunov-based uncertainty-aware safe reinforcement learning,â arXiv preprint arXiv:2107.13944, 2021.
[4] M. Xiao, C. Hu, and Z. Wu, âModeling and predictive control of nonlinear processes using transfer learning method,â AIChE Journal, p. e18076, 2023.
[5] Y. Wang and Z. Wu, âControl lyapunov-barrier function-based safe reinforcement learning for nonlinear optimal control,â AIChE Journal, p. e18306, 2023.