(400a) Efficient Flexibility Analysis of Computationally Expensive Black-Box Simulators Using Quantile-Based Bayesian Optimization | AIChE

(400a) Efficient Flexibility Analysis of Computationally Expensive Black-Box Simulators Using Quantile-Based Bayesian Optimization

Authors 

Paulson, J., The Ohio State University
Tang, W. T., The Ohio State University
Uncertainty, which is inevitably present in real-world problems, can come from a variety of sources such as noisy and incomplete datasets, unknown parameters, environmental disturbances (such as product demand and prices), and implementation errors. Flexibility analysis provides a quantitative framework for identifying feasibility of a process design over a range of uncertainty values while accounting for feedback/recourse from manipulatable control variables available in the system. The notion of flexibility was first introduced for process systems engineering problems in [1] and has been extended to account for probabilistic uncertainty descriptions [2] and dynamic problems [3]. The key assumption in the majority of these works is that the structure of the model is exactly known and can be exploited by existing optimization algorithms that assume derivatives and/or relaxations of the objective and constraint functions can be easily computed. In many emerging applications, however, this assumption is violated since the process model is defined in terms of an expensive simulator whose internal structure cannot be accessed by the modeler. Relevant examples include the simulation of: (i) multi-product plants that often requires proprietary (e.g., Aspen Plus) models of thermodynamics or kinetics, (ii) environmental models (e.g., CALPUFF, SEEMAC, iTree) that can be used to evaluate important sustainability metrics, and (iii) multiphysics phenomena (e.g., COMSOL) that is needed to represent important spatial variability and boundary effects. These cases are often referred to as “black-box” (in contrast to the “white-box” known equation case) and have been the focus of more recent work on flexibility analysis [4-7]. It is worth noting that most previous works on black-box flexibility analysis focus on the special case in which there are no recourse variables – this can be interpreted as a “feasibility test,” as one is trying to determine if the system maintains feasible operation despite perturbations from the uncertainty. Such a representation, however, can lead to a highly pessimistic view of flexibility since, in most engineering systems, there is often significant capacity to adapt to these perturbations as they occur.

In this work, we develop a novel black-box flexibility analysis method that addresses two of the key limitations of existing approaches. First, our method simultaneously accounts for the effect of uncertain parameters and resource variables, which is known to lead to a challenging tri-level “max-min-max” structure that is significantly harder to solve than traditional feasibility problems. Second, our method is specifically constructed to be data-efficient, meaning that it aims to minimize the number of calls to the black-box system model. The rationale for this second feature is that the simulation models mentioned above are computationally expensive (i.e., they require a long time and/or substantial resources to be evaluated), so that they are very likely to be the main bottleneck for the execution of the method. The proposed method builds upon the Bayesian optimization (BO) paradigm, which has become one of the most popular methods for data-efficient optimization of expensive black-box functions due to its practical success in several real-world applications [8-12]. The BO framework consists of two main components: (i) a predictive surrogate model of the black-box functions (equipped with uncertainty estimates) learned according to the Bayesian paradigm and (ii) an acquisition function that depends on the posterior distribution of the surrogate models, whose value at any point quantifies the benefit of evaluating the black-box functions at this point. However, standard BO methods only apply to single-level optimization problems. Therefore, in this work, we develop a novel extension of BO that is suitable for the types of tri-level optimization problems underpinning quantitative flexibility analysis. The basis for our proposed method is the construction of a quantile function that can be used to evaluate tight, high probability upper and lower confidence bounds for the flexibility metric. By defining an acquisition function in terms of this quantile function, we can efficiently identify the settings of the next expensive simulation that are most informative for classifying the system as flexible or not. The effectiveness of the proposed method is compared to several state-of-the-art alternatives (e.g., [13]) on a benchmark problem and a significantly more challenging heat exchanger network (HEN) problem. The results show that our new quantile-based BO method can provide more accurate upper and lower bounds on the true flexibility metric as well as identify better simulation settings to make sure those bounds converge at a faster rate than all tested alternative methods.

References:

[1] Halemane, Keshava Prasad, and Ignacio E. Grossmann. "Optimal process design under uncertainty." AIChE Journal 29.3 (1983): 425-433.

[2] Pistikopoulos, E. N., and T. A. Mazzuchi. "A novel flexibility analysis approach for processes with stochastic parameters." Computers & Chemical Engineering 14.9 (1990): 991-1000.

[3] Dimitriadis, Veniamin D., and Efstratios N. Pistikopoulos. "Flexibility analysis of dynamic systems." Industrial & Engineering Chemistry Research 34.12 (1995): 4451-4462.

[4] Boukouvala, Fani, and Marianthi G. Ierapetritou. "Feasibility analysis of black-box processes using an adaptive sampling Kriging-based method." Computers & Chemical Engineering 36 (2012): 358-368.

[5] Banerjee, Ipsita, Siladitya Pal, and Spandan Maiti. "Computationally efficient black-box modeling for feasibility analysis." Computers & chemical engineering 34.9 (2010): 1515-1521.

[6] Wang, Zilong, and Marianthi Ierapetritou. "A novel feasibility analysis method for black‐box processes using a radial basis function adaptive sampling approach." AIChE Journal 63.2 (2017): 532-550.

[7] Zhao, F., Grossmann, I. E., García‐Muñoz, S., & Stamatis, S. D. (2021). Flexibility index of black‐box models with parameter uncertainty through derivative‐free optimization. AIChE Journal, 67(5), e17189.

[8] Frazier, Peter I. "A tutorial on Bayesian optimization." arXiv preprint arXiv:1807.02811 (2018).

[9] Shahriari, B., Swersky, K., Wang, Z., Adams, R. P., & De Freitas, N. (2015). Taking the human out of the loop: A review of Bayesian optimization. Proceedings of the IEEE, 104(1), 148-175.

[10] Paulson, Joel A., and Congwen Lu. "COBALT: COnstrained Bayesian optimizAtion of computationaLly expensive grey-box models exploiting derivaTive information." Computers & Chemical Engineering 160 (2022): 107700.

[11] Paulson, Joel A., Georgios Makrygiorgos, and Ali Mesbah. "Adversarially robust Bayesian optimization for efficient auto‐tuning of generic control structures under uncertainty." AIChE Journal 68.6 (2022): e17591.

[12] Sorourifar, F., Makrygirgos, G., Mesbah, A., & Paulson, J. A. (2021). A data-driven automatic tuning method for MPC under uncertainty using constrained Bayesian optimization. IFAC-PapersOnLine, 54(3), 243-250.

[13] Kudva, Akshay, and Joel A. Paulson. "A Bayesian optimization approach for data-efficient flexibility analysis of expensive black-box models." Foundations of Computer Aided Process Operations/Chemical Process Control (2023).