(106b) Hybrid Modeling of Bioprocesses Based on a Kinetic Canonical Formalism
AIChE Annual Meeting
2022
2022 Annual Meeting
Computing and Systems Technology Division
Data-Driven Dynamic Modeling, Estimation and Control II
Monday, November 14, 2022 - 12:49pm to 1:08pm
Modeling biotechnological processes is particularly challenging due to its inherent complexity, making it very difficult to precisely describe the underlying metabolic mechanisms dictating the microorganismsâ behavior [2]â[5]. However, modeling these pathways is key for producing therapeutic compounds so that the associated processes can be optimized effectively [1]. Ideally, modelers have addressed this problem by combining well-defined kinetic expressions with available data (semi-empirical or deterministic) to support the efficient and cost-effective development of new therapeutic drugs [6]. However, choosing a well-suitable kinetic expression and calibrating it with experimental points is particularly challenging. Such methods involve the solution of nonconvex dynamic optimization problems that can lead to high computational costs and several suboptimal solutions.
As an alternative to mechanistic models, data-driven strategies allow studying the systemâs behavior without relying on detailed expert knowledge [7], [8]. An advantage of such process models is that they only require data and can be set up without any deep understanding of the underlying system. However, they are hard to interpret and may return values that are not consistent with the physical laws governing the underlying system due to the few mechanistic constraints imposed during training. As a bridge between deterministic and data-driven methods, hybrid modeling has recently gained popularity to exploit their complementary advantages. In essence, hybrid models combine a mechanistic backbone with a surrogate component, with their specific structure depending on the problem at hand. While this modeling approach is well known, how to optimally define hybrid models for bioprocesses remains challenging, including the choice of mechanistic and black-box components and their optimal level of hybridization.
This work proposes a hybrid modeling approach based on an S-system canonical formalism that automatically builds kinetic models of bioprocesses from experimental observations. The proposed methodology determines the model structure and parameters during the model-building procedure. Notably, our hybrid approach combines a first-principles backbone based on mass balances with a canonical kinetic S-system formalism, whose structure and model parameters are automatically identified by solving a mixed-integer nonlinear program (MINLP). Our approach significantly tightens the search space by applying ârationalâ constraints to the structure of the model. The model training is performed following a two-stage procedure that simplifies the calculations by avoiding the iterative integration of differential equations.
Numerical examples show that our method performs similarly to hybrid models based on artificial neural networks, with the advantage of obtaining a compact, analytical canonical expression. Such a hybrid model based on a canonical formalism allows modelers to extract information about the processes and generate further insight into their behavior. In this context, the proposed MINLP approach helps screen and adjust the complexity of the hybrid model. Hence, our approach could help find a suitable process model while simultaneously allowing practitioners to analyze the underlying formulation more quickly and use it in subsequent optimization studies.
[1] H. Narayanan, M. Sokolov, M. Morbidelli, and A. Butté, âA new generation of predictive models: The added value of hybrid models for manufacturing processes of therapeutic proteins,â Biotechnol. Bioeng., vol. 116, no. 10, pp. 2540â2549, Oct. 2019, doi: 10.1002/bit.27097.
[2] P. Petsagkourakis, I. O. Sandoval, E. Bradford, D. Zhang, and E. A. del Rio-Chanona, âReinforcement learning for batch bioprocess optimization,â Comput. Chem. Eng., vol. 133, p. 106649, Feb. 2020, doi: 10.1016/j.compchemeng.2019.106649.
[3] S. M. Mercier, B. Diepenbroek, R. H. Wijffels, and M. Streefland, âMultivariate PAT solutions for biopharmaceutical cultivation: current progress and limitations,â Trends Biotechnol., vol. 32, no. 6, pp. 329â336, Jun. 2014, doi: 10.1016/j.tibtech.2014.03.008.
[4] D. Zhang, T. R. Savage, and B. A. Cho, âCombining model structure identification and hybrid modelling for photoâproduction process predictive simulation and optimisation,â Biotechnol. Bioeng., vol. 117, no. 11, pp. 3356â3367, Nov. 2020, doi: 10.1002/bit.27512.
[5] G. Guillén-Gosálbez, A. Miró, R. Alves, A. Sorribas, and L. Jiménez, âIdentification of regulatory structure and kinetic parameters of biochemical networks via mixed-integer dynamic optimization,â BMC Syst. Biol., vol. 7, no. 1, p. 113, 2013, doi: 10.1186/1752-0509-7-113.
[6] M. von Stosch, R. Oliveira, J. Peres, and S. Feyo de Azevedo, âHybrid semi-parametric modeling in process systems engineering: Past, present and future,â Comput. Chem. Eng., vol. 60, pp. 86â101, Jan. 2014, doi: 10.1016/j.compchemeng.2013.08.008.
[7] O. Kahrs and W. Marquardt, âThe validity domain of hybrid models and its application in process optimization,â Chem. Eng. Process. Process Intensif., vol. 46, no. 11, pp. 1054â1066, Nov. 2007, doi: 10.1016/j.cep.2007.02.031.
[8] C. J. Taylor et al., âRapid, automated determination of reaction models and kinetic parameters,â Chem. Eng. J., vol. 413, p. 127017, Jun. 2021, doi: 10.1016/j.cej.2020.127017.