Machine Learning for Predicting Cradle-to-Gate Life Cycle Inventory (LCI) Data
AIChE Annual Meeting
2023
2023 AIChE Annual Meeting
Annual Student Conference: Competitions & Events
Undergraduate Student Poster Session: Computing and Process Control
Monday, November 6, 2023 - 10:00am to 12:30pm
In this work, a ML algorithm was developed to predict cradle-to-gate life cycle inventory (LCI) data for both pre-existing and novel chemicals. The LCI data predicted by the algorithm focused on four different environmental metrics: human health (HH), ecosystem quality (EQ), global warming potential (GWP), and recourse utilization (RU). Two variants of the algorithm were developed: one employed Artificial Neural Networks (ANN), and the other employed eXtreme Gradient Boosting (XGBoost). Both variants used over 350 data points, split between a training, testing, and validation set. Data was sourced from EcoInvent, a chemical information database, and the feature set included 200 molecular descriptors and 16 thermodynamic properties for each datapoint. A stepwise feature selection process was used to reduce the number of features from 216 to 10. Following hyperparameter tuning, the performance of both variants was assessed on the test set using the R-squared and Root-Mean-Squared-Error (RMSE) values. Following this analysis, it was determined that the XGBoost variant was more effective for the HH, GWP, and EQ metrics, producing RMSE values equaling 2.73, 1.18, and 0.574, respectively. In addition to the algorithm itself, a case study representing the extraction of polyphenols from wine pomace using acetone solvent was also implemented to demonstrate the utility of the LCI prediction algorithm in enabling the creation of a detailed cradle-to-grave Life Cycle Analysis of the entire process.