(362f) Development of Bayesian Machine Learning Algorithms for Optimal Nonlinear Model Selection and Parameter Estimation from Noisy Data of Unknown Characteristics

Conference

AIChE Annual Meeting

Year

2022

Proceeding

2022 Annual Meeting

Group

Computing and Systems Technology Division

Session

Interactive Session: Data and Information Systems

Time

Tuesday, November 15, 2022 - 3:30pm to 5:00pm

Authors

Adeyemo, S. - Presenter

Bhattacharyya, D., West Virginia University

Process models are very essential in the study of dynamics, prediction of outputs and control of process plants. While data driven models serve as a powerful alternative to high fidelity first-principles models which are often complicated and require considerable resources to develop, the selection of an appropriate model form for highly nonlinear systems is a significant challenge. The model should be sparse and provide succinct representation of a system and should possess favorable characteristics for being used in optimization. A classical model structure that has proven very useful for many systems is the state space model. However, these models can yield large errors for highly nonlinear systems. One simple extension of the state space model for nonlinear systems is to include bilinear and extended bilinear terms. In our earlier work, such models were presented and an algorithm was developed for parameter estimation under uncertainty for systems with noisy data with unknown noise characteristics. A two-stage Bayesian machine learning (ML) algorithm was developed that was found to be very effective not just in the prediction of model behaviors but also for exploring the connectivity structures of process plants [1]. However, the model form may be restrictive for highly nonlinear systems. In addition, sparsity was not guaranteed.

Recently, several authors have employed sparse regression for succinct representation of dynamical systems through optimal selection of basis function from several candidate basis functions[2][3]. However, the algorithms are applicable to the data without noise and therefore the efficacy of the developed algorithms were tested using simulated data. Furthermore, the candidate basis functions were rather limited. However, most real-life process data are corrupted with noise with unknown characteristics. Besides, there may be colored noise present in the data.

In this work, we develop a Branch and Bound algorithm [4] for subset selection using an information criteria that rewards model fitness while penalizing model size and model complexity. Large number of basis functions are considered including polynomial, logarithmic, and exponential functions, and interaction terms between inputs and outputs including time delays this leading to a large-scale combinatorial optimization problem. A desired number of hierarchically ranked candidate models are obtained leaving the user with the choice of selecting any top rank model that yields succinct representation of the system with explainable basis functions. Parameters are estimated by employing Bayesian inferencing by maximizing the likelihood of the estimated model parameters. The two-step approach also estimates the unknown noise characteristics in an outer loop conditioned on available data. The method facilitates incorporating prior user belief of model parameters. Based on an expectation maximization algorithm, the algorithm yields simultaneous estimation of the noise covariance in addition to the model parameters.

The proposed algorithm is applied to the simulated data for a reactor-separator system and also to the operational data from an industrial boiler. The algorithm shows promise for flexible and succinct model development for highly nonlinear systems from data with correlated noise and unknown noise characteristics.

References

[1] T. Bankole and D. Bhattacharyya, â€œExploiting connectivity structures for decomposing process plants,â€ J. Process Control, vol. 71, pp. 116â€“129, 2018, doi: 10.1016/j.jprocont.2018.09.002.

[2] B. Bhadriraju, M. S. F. Bangi, A. Narasingam, and J. S. Il Kwon, â€œOperable adaptive sparse identification of systems: Application to chemical processes,â€ AIChE J., vol. 66, no. 11, pp. 1â€“15, 2020, doi: 10.1002/aic.16980.

[3] N. M. Mangan, J. N. Kutz, S. L. Brunton, and J. L. Proctor, â€œModel selection for dynamical systems via sparse regression and information criteria,â€ Proc. R. Soc. A Math. Phys. Eng. Sci., vol. 473, no. 2204, 2017, doi: 10.1098/rspa.2017.0009.

[4] A. A. H. Land and A. G. Doig, â€œAutomatic Method of Solving Discrete Programming Problemsâ€. JSTOR vol. 28, no. 3, pp. 497â€“520, 1960.

Topics

Computing and Systems Engineering

Other Sites & Tools

Technical Groups

Technical

Professional/Personal Growth

Societal Needs

Leadership

2024 Annual Safety in Ammonia Plants and Related Facilities Symposium

4th Optogenetic Technologies and Applications Conference

Upcoming Conferences & Events

CCPS India Regional Meeting

CCPS Process Safety Knowledge Webinar (Brazil)

2024 Indonesia Student Regional Conference

Procesa 2024: 6th AIChE Latin America Student Regional Conference

CCPS Workshop on Process Safety Metrics: API-RP-754 Implementation

University of Houston Student Process Safety Bootcamp

2024 Annual Safety in Ammonia Plants and Related Facilities Symposium

9th CCPS Canadian Regional Meeting

4th Optogenetic Technologies and Applications Conference

CEP: August 2024

CEP: July 2024

Explore Areas of Advancement:

Learning Center:

Want to be an Entrepreneur? Personal Stories From Three Successful Entrepreneurs Who Have Traveled This Path.

(362f) Development of Bayesian Machine Learning Algorithms for Optimal Nonlinear Model Selection and Parameter Estimation from Noisy Data of Unknown Characteristics

AIChE Annual Meeting

2022

2022 Annual Meeting

Computing and Systems Technology Division

Interactive Session: Data and Information Systems

Tuesday, November 15, 2022 - 3:30pm to 5:00pm

Authors

Topics

More Conference Links

Cancellation Policy

Code of Conduct

Beware of Hotel and Attendee-list Scams