(15c) Inverse Mixed-Integer Optimization for Learning Interpretable Decision Rules
AIChE Annual Meeting
2023
2023 AIChE Annual Meeting
Computing and Systems Technology Division
CAST Director's Student Presentation Award Finalists (Invited Talks)
Monday, November 6, 2023 - 1:06pm to 1:24pm
To accomplish this, we apply the concept of data-driven inverse optimization (IO) (Ahuja & Orlin, 2001; Aswani, Shen, & Siddiq, 2018; Chan, Mahmood, & Zhu, 2021). Mathematical optimization is a natural model for decision-making, assuming that the decision maker aims to make the best possible decision given a set of available options. This makes IO an ideal approach for learning unknown decision-making mechanisms as it finds an optimization model that can generate decisions close to the observed data. In our previous work (Gupta & Zhang, 2022, 2023), we have shown that IO is very data-efficient in learning interpretable models, making it an even more attractive learning paradigm.
Commonly used decision rules include ranking options, bounding values, and assigning causality, many of which involve logical relationships between variables. This naturally motivates the use of disjunctions and discrete variables (Raman & Grossmann, 1994) to model these logical relationships. Therefore, in this work, we propose an approach to learning human decision rules using inverse optimization with a mixed-integer linear program (MILP) as the forward problem.
Our approach formulates an MILP with constraints defining the hypothesis space of possible decision rules. Leveraging domain knowledge and the modeling flexibility of mixed-integer optimization, we include two types of constraints: 1) inherent physical limitations of the system faced by the decision maker, and 2) a superset of choices that the decision maker needs to consider while arriving at their decisions. We then use data-driven IO to find the weights that the decision maker assigns to each of the decisions in the objective function. This approach makes our model inherently interpretable as the values of these parameters being estimated have a direct meaning that reflects the decision policy. We apply the proposed approach to a supply chain planning problem to demonstrate the accuracy and interpretability of the resulting decision-making models.
References:
Ahuja, R. K., & Orlin, J. B. (2001). Inverse optimization. Operations Research, 49(5), 771â783.
Aswani, A., Shen, Z. J. M., & Siddiq, A. (2018). Inverse optimization with noisy data. Operations Research, 66(3), 870â892.
Chan, T. C. Y., Mahmood, R., & Zhu, I. Y. (2021). Inverse Optimization: Theory and Applications. arXiv preprint arXiv:2109.03920 (2021).
Gupta, R., & Zhang, Q. (2022). Decomposition and Adaptive Sampling for Data-Driven Inverse Linear Optimization. INFORMS Journal on Computing, 34(5), 2720â2735.
Gupta, R., & Zhang, Q. (2023). Efficient learning of decision-making models: A penalty block coordinate descent algorithm for data-driven inverse optimization. Computers and Chemical Engineering, 170(December 2022).
Raman, R., & Grossmann, I. E. (1994). Modelling and computational techniques for logic based integer programming. Computers and Chemical Engineering, 18(7), 563â578.