(509cs) Combining Uncertainty Metrics to Control Neural Network Error and Accelerate Chemical Exploration
AIChE Annual Meeting
2021
2021 Annual Meeting
Catalysis and Reaction Engineering Division
Poster Session: Catalysis and Reaction Engineering (CRE) Division
Wednesday, November 10, 2021 - 3:30pm to 5:00pm
Previous work using a NN to predict the energetics of small molecules showed that a K-nearest neighbors distance in the latent space could be used to predict error more accurately than the dropout method, and had comparable performance to the ensemble method while being more computationally tractable [1]. While promising, it is unknown how well these results for small molecules translate to solid-state materials and heterogeneous catalysis. Here we will answer this question, as well as examine two new latent space uncertainty metrics we dub the âlatent densityâ and âlatent probabilityâ. We also hypothesize that combining latent space metrics with input space metrics gives an even better indicator of prediction error than using purely latent space metrics.
We analyze different proposed latent space uncertainty metrics and compare their efficacy in controlling NN error on the Open Catalyst Dataset (OC20) [2]. We train a NN on a subset of reference data from OC20 and test the error prediction performance of existing NN uncertainty metrics (Monte-Carlo dropout, ensemble method, and feature space distances), latent space metrics (K-nearest neighbors distance, latent density, and latent probability), and metrics combining latent space and feature space information. Ultimately, developing better uncertainty estimation approaches for NNs for solid-state materials will enable wider spread adoption of ML for computational chemistry research, and accelerate novel materials exploration.
References:
[1] Janet, Duan, Yang, Nandy, and Kulik. âA quantitative uncertainty metric controls error in neural network-driven chemical discoveryâ, Chem. Sci., (2019), 10, 7913
[2] Ulissi et al. âThe Open Catalyst 2020 (OC20) Dataset and Community Challengesâ arXiv, (2021), 2010.09990