(712f) Embedding Data-Driven Classifier Surrogates in Solvent Mixture Design Optimisation Problems
AIChE Annual Meeting
2024
2024 AIChE Annual Meeting
Computing and Systems Technology Division
10C: Data-driven Optimization
Thursday, October 31, 2024 - 5:15pm to 5:36pm
In this work, we investigate the use of surrogate models to replace the nonlinear thermodynamic constraints with a focus on the reliability and tractability of the resulting optimization problem. We examine two types of machine learning models, support vector machines (SVMs) and artificial neural networks (ANNs), with the aim to train models that are valid over a range of solvent mixtures and thermodynamic conditions. Instead of training regression models, we train classifier models to predict the phase behaviour of the mixtures of interest, specifically the phase stability of the mixture and the solubility of the target product. The surrogate models are trained based on phase equilibrium data generated using the UNIFAC group contribution method.6 This provides an alternative framework to define the optimisation problem. Given certain candidate solvents and a target product, the van der Waals surface area, van der Waals volume, composition of the solvent mixture and temperature are selected as the features of the surrogate models. We consider different kernel function formulations of SVMs and activation functions for ANNs. For the phase stability, the Matthews correlation coefficient (MCC) is applied as the metric while for the solubility, the deviation between the true and predicted solubility by fixing composition of certain solvent is used to evaluate the classifier performance.
We apply the proposed approach to two case studies. Given as set of 9 candidate solvents, Case 1 aims to find the optimal binary solvent mixture that yields the highest solubility of ibuprofen at a fixed temperature. Case 2 considers a more challenging cooling / anti-solvent crystallisation process, with variable initial temperature. In Case 1, the results show that the problem can be solved more efficiently with some surrogate models than with the .7 Using ANNs, the problem can be solved within 10 seconds, which is much faster than with any other approach tested. Although BARON8 can solve the models in case 1 within shorter runtimes, numerical difficulties sometime appears which makes it unreliable. This also happens with the solver GUROBI. 9 In Case 2, better (local) solutions are obtained from the surrogate-based models than with the model based on thermodynamic equations with sBB solver.10 However, this case remains intractable for global solvers within a limited runtime. These results reveal the potential and benefits of implementing machine learning classifier models in the solvent design problem, as an alternative to the standard thermodynamics-based models.
Reference
(1) Gani, R. Chemical Product Design: Challenges and Opportunities. Computers & Chemical Engineering 2004, 28 (12), 2441â2457. https://doi.org/10.1016/j.compchemeng.2004.08.010.
(2) Watson, O. L.; Jonuzaj, S.; McGinty, J.; Sefcik, J.; Galindo, A.; Jackson, G.; Adjiman, C. S. Computer Aided Design of Solvent Blends for Hybrid Cooling and Antisolvent Crystallization of Active Pharmaceutical Ingredients. Org. Process Res. Dev. 2021, 25 (5), 1123â1142. https://doi.org/10.1021/acs.oprd.0c00516.
(3) Mitsos, A.; Bollas, G. M.; Barton, P. I. Bilevel Optimization Formulation for Parameter Estimation in LiquidâLiquid Phase Equilibrium Problems. Chemical Engineering Science 2009, 64 (3), 548â559. https://doi.org/10.1016/j.ces.2008.09.034.
(4) Jonuzaj, S.; Akula, P. T.; Kleniati, P.-M.; Adjiman, C. S. The Formulation of Optimal Mixtures with Generalized Disjunctive Programming: A Solvent Design Case Study. AIChE Journal 2016, 62 (5), 1616â1633. https://doi.org/10.1002/aic.15122.
(5) Jonuzaj, S.; Adjiman, C. S. Designing Optimal Mixtures Using Generalized Disjunctive Programming: Hull Relaxations. Chemical Engineering Science 2017, 159, 106â130. https://doi.org/10.1016/j.ces.2016.08.008.
(6) Fredenslund, A.; Jones, R. L.; Prausnitz, J. M. Group-Contribution Estimation of Activity Coefficients in Nonideal Liquid Mixtures. AIChE Journal 1975, 21 (6), 1086â1099. https://doi.org/10.1002/aic.690210607.
(7) Bolusani, S.; Besançon, M.; Bestuzheva, K.; Chmiela, A.; DionÃsio, J.; Donkiewicz, T.; van Doornmalen, J.; Eifler, L.; Ghannam, M.; Gleixner, A.; Graczyk, C.; Halbig, K.; Hedtke, I.; Hoen, A.; Hojny, C.; van der Hulst, R.; Kamp, D.; Koch, T.; Kofler, K.; Lentz, J.; Manns, J.; Mexi, G.; Mühmer, E.; Pfetsch, M. E.; Schlösser, F.; Serrano, F.; Shinano, Y.; Turner, M.; Vigerske, S.; Weninger, D.; Xu, L. The SCIP Optimization Suite 9.0. arXiv February 27, 2024. http://arxiv.org/abs/2402.17702 (accessed 2024-03-02).
(8) Tawarmalani, M.; Sahinidis, N. V. A Polyhedral Branch-and-Cut Approach to Global Optimization. Math Program 2005, 103 (2), 225â249. https://doi.org/10.1007/s10107-005-0581-8.
(9) Gurobi Optimization, L. Gurobi Optimizer Reference Manual. Gurobi Optimization. https://www.gurobi.com/documentation/ (accessed 2024-04-02).
(10) GAMS Development Corporation, 2024. GAMS - The Solver Manuals: SBB. https://www.gams.com/latest/docs/S_SBB.html (accessed 2024-04-01).