(241b) Improving Robustness of Machine Learning Modeling of Nonlinear Processes Using Lipschitz-Constrained Neural Networks
AIChE Annual Meeting
2023
2023 AIChE Annual Meeting
Computing and Systems Technology Division
Advances in Computational Methods and Numerical Analysis - II
Wednesday, November 8, 2023 - 3:55pm to 4:13pm
To resolve these issues, we propose the use of Lipschitz-Constrained NNs (LCNNs) to model nonlinear processes. LCNNs have gained recent attention since they can reduce input sensitivity by maintaining a low Lipschitz constant and prevent overfitting by maintaining a low hypothesis complexity. In this work, we first prove a universal approximation theorem for LCNNs using SpectralDense layers (Serrurier et al. (2021)) to show that despite their lowered hypothesis complexity, they can approximate any 1-Lipschitz continuous function. Next, we develop a probabilistic bound on their generalization error by computing a size-dependent upper bound for their empirical Rademacher Complexity (ERC). Subsequently, we incorporate the LCNNs into the model predictive control (MPC) scheme, and a chemical process example is utilized to demonstrate that the LCNN-based MPC outperforms the MPC using conventional feedforward NNs in the presence of training data noise. Furthermore, due to the improved robustness of LCNNs, we also investigate the integration of LCNNs with autoencoders to improve the performance of model order reduction by effectively learning a low-dimensional representation of data embedded in a high-dimensional space even in the presence of data noise.
References
Baldi, P., Sadowski, P., 2014. The dropout learning algorithm. Artificial intelligence 210, 78â122
Baldi, P., Sadowski, P.J., 2013. Understanding dropout. Advances in neural information processing systems 26.
Moore, R.C., DeNero, J., 2011. L1 and L2 regularization for multiclass hinge loss models, in: Symposium on Machine Learning in Speech and Natural Language Processing
Serrurier, M., Mamalet, F., González-Sanz, A., Boissin, T., Loubes, J.M., Del Barrio, E., 2021. Achieving robustness in classification using optimal transport with hinge regularization, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 505â514
Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I., Fergus, R., 2013. Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199
Ying, X., 2019. An overview of overfitting and its solutions. Journal of Physics: Conference Series 1168, 022022