(116b) Leveraging Small Data for Better Chemistry
AIChE Spring Meeting and Global Congress on Process Safety
2023
2023 Spring Meeting and 19th Global Congress on Process Safety
Industry 4.0 Topical Conference
Data-Driven and Hybrid Approaches to Development of New Products I
Wednesday, March 15, 2023 - 8:30am to 9:00am
Data-driven machine-learning (ML) offers the possibility to dramatically reduce the resources needed for reaction optimization through supervised-learning models that predict reaction performance, catalytic behavior, and optimal reaction conditions. However, most ML algorithms require large datasets (i.e. 100âs-1000âs of points) to provide accurate predictions, while experimental data collection is expensive in human and material resources. The challenge of building large datasets can be averted with Bayesian Optimization frameworks that guide experimental campaigns through informed and smart decisions. Predictions are thus based on carefully selected training data, which ultimately reduces the number of experiments required to achieve reliable understanding of variable effects and reaction behavior.
Addressing fundamental questions regarding the limitations of a modified Bayesian Optimization approach, we elucidate a pathway for the design and implementation of intelligent experimental campaigns using an organic electrocatalytic cross-coupling transformation as a model reaction. The approach has been further validated on catalytic biomass conversion, dehydrogenation processes, and mechanocatalytic transformations, among others. Working at the intersection of ML and chemistry, we created SuntheticsML, an online, user-friendly, ML platform that leverages very small data (starting with only 5 data points) to enable the development of new chemical processes, materials, and formulations up to 15 times faster
without the need of coding or ML skills.
The insights from this work innovate on the future of reaction engineering, material design, and process optimization while lowering the barrier to the implementation of new chemical technologies. SuntheticsML accelerates lengthy and complex traditional optimization campaigns, harnessing the power of ML and small data to fast-track innovation, sustainability, and digitalization in the chemical industry, further reducing development waste, emissions, and resource consumption by up to 95%