(12f) Bonsai: A Hyper-Sample-Efficient Framework for Robust Global Optimization of Expensive Function Network Systems Under Uncertainty | AIChE

(12f) Bonsai: A Hyper-Sample-Efficient Framework for Robust Global Optimization of Expensive Function Network Systems Under Uncertainty

Authors 

Tang, W. T., The Ohio State University
Donnelly, K., West Virginia University
Paulson, J., The Ohio State University
Uncertainty is inevitably present in real-world design problems due to several sources including noisy and incomplete data, unknown or fluctuating model parameters, implementation errors, and environmental perturbations/disturbances (e.g., product demand, prices, and material availability). Thus, “optimal” solutions identified by solving a nominal design problem neglecting the impact of these uncertainties are potentially suboptimal or even infeasible when implemented in practice. Robust optimization is a framework that looks to overcome such limitations by searching for the robust optimal design that achieves the best worst-case value of the objective while satisfying the constraints for all possible uncertainties [1, 2, 3]. There has been a significant amount of work on developing new algorithms for robust optimization in recent years; however, a significant gap remains between theory and practice on many types of problems encountered in the real world. One of the biggest challenges is that most available robust optimization methods (especially those capable of efficiently finding the global solution) are only applicable to a narrow class of problems that exploit a large amount of problem structure such as convexity [4, 5]. As such, these algorithms are not applicable to high-fidelity simulation software (also known as “digital twins” [6]), which have quickly grown in scope due to advances in computing and increased availability of modeling software. Even though digital twins have been developed in a variety of domains including cyber-physical systems [7], robotics [8], fluid dynamics [9], building energy systems [10], and spacecraft systems [11], there has been very limited work on their use for robust design purposes due to their expensive black-box nature. Most work has instead focused on the use of high-fidelity digital twins for monitoring and/or validation of approximate design methodologies, which does not take full advantage of their powerful capabilities to extrapolate to completely new (unobserved) situations.

Previously, we developed the adversarially robust Bayesian optimization (ARBO) framework as a sample-efficient derivative-free robust optimization approach for black-box functions [12, 13]. ARBO extends traditional Bayesian optimization to explicitly consider the impact of uncertainties during the optimization process; its key innovation is the joint modeling of the dependence of the unknown functions on the design and uncertain variables, leading to a significant improvement in the choice of new query points at every iteration. ARBO has been successfully applied to a diverse set of problems involving digital twins, e.g., genome-scale reactor design [13], MPC tuning [14], and heat exchanger network flexibility tests [15]. However, as commonly observed in surrogate-based optimization routines, ARBO’s performance tends to degrade as the dimensionality of the problem increases. This performance degradation is a direct consequence of the assumption that the objective function is a black box; this generality comes at the cost performance and is rarely needed in practice since there is at least partial knowledge that can be exploited by the algorithm. Previous work has shown that significant performance gains can be achieved by leveraging a grey-box perspective within Bayesian optimization [16, 17]; however, these methods are applicable to single-level composite functions and do not consider any form of uncertainty. We look to develop a method that can straightforwardly address both of these shortcomings here.

In this work, we propose a new algorithm called Bayesian Optimization of Network Systems under uncertAInty (BONSAI) that significantly improves the sample efficiency of ARBO by exploiting prior structural knowledge available in simulation-based robust optimization problems. Instead of assuming the objective is a black-box function, BONSAI represents the objective as a directed acyclic graph (DAG) where each node can be either a black- or white-box function, enabling it to leverage intermediate information in the graph (which is currently ignored by ARBO). Note that the DAG representation is a natural way to express information flow in most engineering problems. Take, for example, a chemical manufacturing plant that converts raw materials into a set of desired products – initial, middle, and final nodes can capture the conversion, separation, and purification processes, respectively, all of which have their own local variables and share some subset of the complete set of variables in the plant. Similar graph abstractions have been developed for nonlinear optimization; however, they require an equation-based model at every node [18]. BONSAI, on the other hand, “fills in the blanks” by modeling the black-box nodes using Gaussian processes (GPs). Since the implied posterior of the objective function obtained by passing the GPs through the DAG is non-Gaussian, we cannot compute the standard acquisition functions (such as upper confidence bound) in closed form. We overcome this challenge in BONSAI by proposing a novel (scalable) Thompson sampling-based acquisition function that can be efficiently optimized using state-of-the-art gradient-based methods. Through numerical experiments on multiple synthetic and real-world problems, we demonstrate that BONSAI can dramatically accelerate optimization compared to competing methods such as ARBO (up to multiple orders of magnitude in some cases). It is worth highlighting that such improvements come simply from incorporating information that is readily available inside of the simulators but is traditionally neglected by other methods. Therefore, BONSAI provides a promising (general) path toward unlocking the full power of computationally expensive digital twins for systematic design in the presence of uncertainty.

References:

[1] Bertsimas, D., Nohadani, O., & Teo, K. M. (2010). Robust optimization for unconstrained simulation-based problems. Operations research, 58(1), 161-178.

[2] Vázquez, F. G., Rückmann, J. J., Stein, O., & Still, G. (2008). Generalized semi-infinite programming: a tutorial. Journal of computational and applied mathematics, 217(2), 394-419.

[3] Beyer, H. G., & Sendhoff, B. (2007). Robust optimization–a comprehensive survey. Computer methods in applied mechanics and engineering, 196(33-34), 3190-3218.

[4] Žaković, S., Pantelides, C., & Rustem, B. (2000). An interior point algorithm for computing saddle points of constrained continuous minimax. Annals of Operations Research, 99, 59-77.

[5] Mitsos, A., & Tsoukalas, A. (2015). Global optimization of generalized semi-infinite programs via restriction of the right hand side. Journal of Global Optimization, 61, 1-17.

[6] Niederer, S. A., Sacks, M. S., Girolami, M., & Willcox, K. (2021). Scaling digital twins from the artisanal to the industrial. Nature Computational Science, 1(5), 313-320.

[7] Fritzson, P. (2011, July). Modelica—A cyber-physical modeling language and the OpenModelica environment. In 2011 7th International Wireless Communications and Mobile Computing Conference (pp. 1648-1653). IEEE.

[8] Hu, Y., Liu, J., Spielberg, A., Tenenbaum, J. B., Freeman, W. T., Wu, J., Rus, D., & Matusik, W. (2019, May). Chainqueen: A real-time differentiable physical simulator for soft robotics. In 2019 International conference on robotics and automation (ICRA) (pp. 6265-6271). IEEE.

[9] Jasak, H. (2009). OpenFOAM: Open source CFD in research and industry. International Journal of Naval Architecture and Ocean Engineering, 1(2), 89-94.

[10] Wetter, M., Zuo, W., Nouidui, T. S., & Pang, X. (2014). Modelica buildings library. Journal of Building Performance Simulation, 7(4), 253-270.

[11] McCamish, S., & Romano, M. (2007). Simulation of relative multiple spacecraft dynamics and control with MATLAB-SIMULINK and Satellite Tool Kit. In AIAA Modeling and Simulation Technologies Conference and Exhibit (p. 6805).

[12] Paulson, J. A., Makrygiorgos, G., & Mesbah, A. (2022). Adversarially robust Bayesian optimization for efficient auto‐tuning of generic control structures under uncertainty. AIChE Journal, 68(6), e17591.

[13] Kudva, A., Sorourifar, F., & Paulson, J. A. (2022). Constrained robust Bayesian optimization of expensive noisy black‐box functions with guaranteed regret bounds. AIChE Journal, 68(12), e17857.

[14] Kudva, A., Sorouifar, F., & Paulson, J. A. (2022, June). Efficient robust global optimization for simulation-based problems using decomposed Gaussian processes: Application to MPC calibration. In 2022 American Control Conference (ACC) (pp. 2091-2097). IEEE.

[15] Kudva, A., Tang, W. T., & Paulson, J. A. (2024). Robust Bayesian optimization for flexibility analysis of expensive simulation-based models with rigorous uncertainty bounds. Computers & Chemical Engineering, 181, 108515.

[16] Astudillo, R., & Frazier, P. (2019, May). Bayesian optimization of composite functions. In International Conference on Machine Learning (pp. 354-363). PMLR.

[17] Paulson, J. A., & Lu, C. (2022). COBALT: COnstrained Bayesian optimizAtion of computationaLly expensive grey-box models exploiting derivaTive information. Computers & Chemical Engineering, 160, 107700.

[18] Cole, D. L., Shin, S., & Zavala, V. M. (2022). A Julia Framework for Graph-Structured Nonlinear Optimization. Industrial & Engineering Chemistry Research, 61(26), 9366-9380.