(126b) General Feature Extraction for Process Data Using Convolutional Neural Network Based Transfer Learning | AIChE

(126b) General Feature Extraction for Process Data Using Convolutional Neural Network Based Transfer Learning

Authors 

Zhu, W. - Presenter, Chemical Engineering Department, Louisiana State U
Romagnoli, J. A. - Presenter, Louisiana State University
In the past decades, data-driven methods have been widely applied in process data analytics. Particularly in process monitoring and fault detection area, many approaches were developed from simple linear models to complicated non-linear models, including PCA-based methods [1], different kernel methods [2], and various novel machine learning techniques. Nevertheless, most of these data-driven approaches are extremely specific, where the developed models can only be applied in the given process with the selected variables. In other words, a model trained on an old chemical process may not be suitable to be applied on a new process with same implementation of operation units. The lack of generalization is a major drawback of current data-driven process monitoring techniques, which potentially limits their performance on real industrial applications.

On the other hand, if we review the data in any chemical processes, a commonality can be observed that most of the measurements are pressures, temperatures, flow rates, levels and concentration of different species. Typically, their intrinsic correlations can be summarized by the first principles, such as mass balances, gas laws, energy balances and etc. Although the first principle approach requires a lot of effort to investigate a physical system, while it provides a more general solution for different kinds of systems. Additionally, commonalities also widely exist in process monitoring and fault detection models. Regardless of the number of input variables used in the model, the objective is always to correctly identify the distribution, ~N(μ,σ2) of normal operating conditions and distinguish the abnormality of test data.

With the recent development of deep learning techniques, it offers the feasibility to build models with better generalization capability. Inspired by the recent breakthroughs in computer vision field, a transfer learning approach was proposed using convolutional neural networks (CNN), in order to extract general features from process data. The proposed framework is more generalized than conventional methods, where features across different processes with different variables can be learned. The proposed CNN classifier was first trained on the famous benchmark, the Tennessee Eastman Process (TEP) [3] to learn the patterns of different kinds of faults. The features learned in the intermediate layers of the CNN classifier can be pooled out for various tasks. In this work, a fault detection method is demonstrated based on the proposed feature extraction tool, which provides a superior generalization ability that can be applied outside the TEP datasets.


References:

[1] L. H. Chiang, E. L. Russell and R. D. Braatz, Fault detection and diagnosis in industrial systems, New York City: Springer Science & Business Media, 2000.

[2] S. W. Choi and e. al., "Fault detection and identification of nonlinear processes based on kernel PCA," Chemometrics and intelligent laboratory systems, vol. 75.1, pp. 55-67, 2005.

[3] J. J. Downs and E. F. Vogel., "A plant-wide industrial process control problem," Computers & chemical engineering, vol. 17.3, pp. 245-255, 1993.