(385e) Sparse Data-Driven Modeling of Crystallization Processes | AIChE

(385e) Sparse Data-Driven Modeling of Crystallization Processes

Authors 

W. Nyande, B. - Presenter, The Hong Kong University of Science & Technology (HKUST)
Nagy, Z., Purdue
Lakerveld, R., The Hong Kong University of Science and Technology
Solution crystallization is an important unit operation in the fine chemicals, food, agrochemical, and pharmaceutical industries for the production of high added-value products in crystalline form.1,2 Crystallization processes are commonly described by population balance models. The population balance model is an integro-partial differential equation describing the dynamic development of crystal quality attributes such as the size distribution by accounting for crystallization kinetics, including nucleation, growth, agglomeration, and breakage. The population balance model can enable improved process design, a systematic evaluation of process alternatives, and the development of model-based control strategies3 which are better capable of coping with process disturbances and input-output constraints compared to model-free strategies. However, the development of population balance models for industrial crystallizers is challenging due to the high sensitivity of crystallization kinetics on supersaturation and mixing. Additionally, the functional forms of the kinetic expressions for the various crystallization mechanisms being modelled are often unknown and are generally chosen based on trial-and-error.

High quality real time crystallization data are now readily available in industry due to recent advancements in process analytical technology (PAT) tools.4 Such high quality data can be used to develop mathematical models for crystallization processes. Data-driven methods such as symbolic regression based on evolutionary computations5,6, physics informed neural networks (PINN)7, and the sparse identification of nonlinear dynamics (SINDy)8 methodologies can lead to the discovery of first-principle models from data and thus overcome some of the challenges encountered in developing interpretable first-principle models. The SINDy approach identifies models by leveraging the inherent sparsity available in most physical systems of interest without performing a combinatorial brute-force search which can be computationally intractable.8 Despite the promising potential of the SINDy approach, its application to modelling crystallization processes has not been characterized yet.

The objective of this work is to characterize the performance of the sparse data-driven methodology, SINDy, for modelling crystallization processes. Three case studies with increasing complexities are presented. As a benchmark, the SINDy approach is first characterized for the identification of crystallization kinetics in a mixed-suspension mixed-product-removal crystallizer using in silico data followed by the identification of crystallization kinetics from experimental data obtained from seeded protein crystallization and the cooling crystallization of paracetamol in batch. Considerable agreements are obtained between the data-driven model and the experiments. The SINDy approach holds promise for developing model-based control strategies for crystallization processes due to the relatively low computational cost associated with identifying crystallization models.

References

(1) Wu, W. L.; Oliva, J. A.; Kshirsagar, S.; Burcham, C. L.; Nagy, Z. K. Continuous In Situ Seed Generation through the Integration of a Mixed Suspension Mixed Product Removal and an Oscillatory Baffled Crystallizer for the Control of Crystal Size Distribution and Polymorphic Form. Cryst. Growth Des. 2021, 21 (12), 6684–6696.

(2) Nyande, B. W.; Thomas, K. M.; Takarianto, A. A.; Lakerveld, R. Control of Crystal Size Distribution in Batch Protein Crystallization by Integrating a Gapped Kenics Static Mixer to Flexibly Produce Seed Crystals. Chem. Eng. Sci. 2022, 263, 118085.

(3) Szilagyi, B.; Eren, A.; Quon, J. L.; Papageorgiou, C. D.; Nagy, Z. K. Application of Model-Free and Model-Based Quality-by-Control (QbC) for the Efficient Design of Pharmaceutical Crystallization Processes. Cryst. Growth Des. 2020, 20 (6), 3979–3996.

(4) Simon, L. L.; Pataki, H.; et al. Assessment of Recent Process Analytical Technology (PAT) Trends: A Multiauthor Review. Org. Process Res. Dev. 2015, 19 (1), 3–62.

(5) Schmidt, M.; Lipson, H. Distilling Free-Form Natural Laws from Experimental Data. Science (80-. ). 2009, 324 (5923), 81–85.

(6) Bongard, J.; Lipson, H. Automated Reverse Engineering of Nonlinear Dynamical Systems. Proc. Natl. Acad. Sci. U. S. A. 2007, 104 (24), 9943–9948.

(7) Mao, Z.; Jagtap, A. D.; Karniadakis, G. E. Physics-Informed Neural Networks for High-Speed Flows. Comput. Methods Appl. Mech. Eng. 2020, 360, 112789

(8) Brunton, S. L.; Proctor, J. L.; Kutz, J. N.; Bialek, W. Discovering Governing Equations from Data by Sparse Identification of Nonlinear Dynamical Systems. Proc. Natl. Acad. Sci. U. S. A. 2016, 113 (15), 3932– 3937.

Topics