(46b) A New Method for Building Surrogate Models of Stochastic Simulations
AIChE Annual Meeting
2020
2020 Virtual AIChE Annual Meeting
Topical Conference: Next-Gen Manufacturing
Artificial Intelligence and Advanced Computation I
Monday, November 16, 2020 - 8:15am to 8:30am
When high-fidelity simulation models have uncertain parameters and/or uncertain inputs, the traditional techniques for building surrogate models fail to accurately represent the original model outputs (Staum, 2009). Current approaches for building surrogate models of stochastic simulations, in general, fall into one of two groups. The first group considers a nominal value for the uncertain parameter(s) and builds surrogate models at these values of uncertain parameters (Hüllen et al., 2019) or defines an uncertainty set for the stochastic parameter, evaluates the high-fidelity simulation model using the values of the elements of these sets and carries out the surrogate modeling (Hüllen et al., 2019). The second group, which has been quite popular in recent years, applies stochastic kriging (Ankenman et al., 2008). By utilizing the methods under the first group, more uncertainty may be added to the surrogate model because part of the information is being ignored by fixing the value of uncertain parameters (Hüllen et al., 2019). Stochastic kriging has been shown to exhibit good performance in predicting the expected output value (Ankenman et al., 2008). However, by using this method, the surrogate model type is predetermined because it is built based on kriging meta-models, while other surrogate modeling techniques may be more appropriate for representing the data (Williams and Cremaschi, 2019). Developing approaches for building surrogate models to estimate the outputs of high-fidelity stochastic simulation models accurately can overcome these shortcomings.
In this study, we propose that accurate surrogate models can be built for estimating the outputs of high-fidelity stochastic simulations by considering each uncertain parameter of the stochastic simulation as an uncertain input to the simulation. Isolating the uncertain stochastic parameters of the high-fidelity simulation converts the simulation to a deterministic one, for which most of the popular surrogate modeling techniques can be used to train a meta-model. The uncertain parameter(s) are still part of the meta-model, which paves the way for easier analysis of the uncertainty. Assume we have a high-fidelity simulation model g(X;K), where X is the input vector with dimension d1, where some of the inputs may be uncertain, and K is the vector of system uncertain parameters with d2 components. The new simulation model is defined as g'(X*), where X* is a d dimensional vector of inputs with d = d1+ d2, which contains all inputs and the uncertain parameters of the stochastic simulation (g(X;K)). It is assumed that the distributions of the uncertain parameters (K) are known, and the parameters of the distributions are constant. With these definitions, the new simulation model (g'(X*)) becomes deterministic, and any of the surrogate modeling techniques can be utilized to train a model representing this simulation (Fig. 1).
With these definitions, the new simulation model (g'(X*)) becomes deterministic, and any of the surrogate modeling techniques can be utilized to train a model representing this simulation.
g'(X*) â F'(X* ), where F'(X* ) is the trained meta-model representing the deterministic formulation. With this new method, the high fidelity stochastic simulation is approximated using the existing surrogate modeling techniques for deterministic models.
There are various machine learning techniques practiced for surrogate modeling. Seven different techniques are implemented in this study to train the meta-models of deterministic formulation, g'(X*). The performance of the new method is evaluated computationally using these seven surrogate modeling methods for test functions with uncertain parameters. The test functions are from the Virtual Library of Simulation Experiments (Surjanovic and Bingham, 2013) with a different number of inputs and uncertain parameters. The quality of the predictions is evaluated for test functions with various features. The number of inputs of the function (dimension of X), number of uncertain parameters (cardinality of K), and nonlinearity of the uncertain parameters are considered as features of the test functions. Root mean square error and the mean absolute error are the metrics of the quality of the performance. One thousand data points generated for each of the test functions were used as the training data set. The test set was 105 new data points for comparison of the model performances. The results suggest that the prediction accuracy depends on the specifications of the test functions. The number of uncertain parameters, as well as the number of input parameters have a significant effect on the quality of the predictions.
References
Ankenman, B., Nelson, B.L., Staum, J., 2008. Stochastic kriging for simulation metamodeling. Proc. - Winter Simul. Conf. 362â370. https://doi.org/10.1109/WSC.2008.4736089
Eaton, A.N., Beal, L.D.R., Thorpe, S.D., Hubbell, C.B., Hedengren, J.D., Nybø, R., Aghito, M., 2017. Real time model identification using multi-fidelity models in managed pressure drilling. Comput. Chem. Eng. 97, 76â84. https://doi.org/10.1016/j.compchemeng.2016.11.008
Han, Z.-H., Zhang, K.-S., 2012. Surrogate-based optimization. Real-world Appl. Genet. algorithms 343â362.
Hüllen, G., Zhai, J., Kim, S.H., Sinha, A., Realff, M.J., Boukouvala, F., 2019. Managing Uncertainty in Data-Driven Simulation-Based Optimization. Comput. Chem. Eng. 106519. https://doi.org/10.1016/j.compchemeng.2019.106519
Liu, B., Koziel, S., Zhang, Q., 2016. A multi-fidelity surrogate-model-assisted evolutionary algorithm for computationally expensive optimization problems. J. Comput. Sci. 12, 28â37. https://doi.org/10.1016/j.jocs.2015.11.004
Ookawara, S., Kuroki, M., Street, D., Ogawa, K., 2007. High-fidelity DEM-CFD modeling of packed bed reactors for process intensification. Chem. Eng. 16â20.
Peherstorfer, B., Kramer, B., Willcox, K., 2017. Combining multiple surrogate models to accelerate failure probability estimation with expensive high-fidelity models. J. Comput. Phys. 341, 61â75. https://doi.org/10.1016/j.jcp.2017.04.012
Quirante, N., Javaloyes, J., Ruiz-Femenia, R., Caballero, J.A., 2015. Optimization of chemical processes using surrogate models based on a kriging interpolation, in: Computer Aided Chemical Engineering. Elsevier, pp. 179â184.
Staum, J., 2009. Better Simulation Metamodeling: The why, what, and how of Stochastic Kriging 119â133.
Surjanovic, S., Bingham, D., 2013. Virtual Library of Simulation Experiments: Test Functions and Datasets.
Szilágyi, B., Agachi, P.Å., Nagy, Z.K., 2018. Chord Length Distribution Based Modeling and Adaptive Model Predictive Control of Batch Crystallization Processes Using High Fidelity Full Population Balance Models. Ind. Eng. Chem. Res. 57, 3320â3332. https://doi.org/10.1021/acs.iecr.7b03964
Williams, B.A., Cremaschi, S., 2019. Surrogate Model Selection for Design Space Approximation And Surrogatebased Optimization, in: Computer Aided Chemical Engineering. Elsevier, pp. 353â358.