(372ah) Similarity-Based Machine Learning for Small Datasets; Application in Predicting Bio-Lubricant Properties

Conference

AIChE Annual Meeting

Year

2024

Proceeding

2024 AIChE Annual Meeting

Group

Computing and Systems Technology Division

Session

10B: Interactive Session: Systems and Process Control

Time

Tuesday, October 29, 2024 - 3:30pm to 5:00pm

Authors

Kim, J. Y. - Presenter, University of Delaware

Khan, S. A., University of Delaware

Vlachos, D., University of Delaware - Catalysis Center For Ener

Machine learning (ML) has been successfully applied to learn patterns in experimentally generated chemical data to predict molecular properties. However, experimental measurements can be expensive and, as a result, experimental data for several properties is scarce. Several ML methods face challenges when trained with limited data. Here, we introduce a similarity-based ML approach to efficiently train ML models on small datasets. We group molecules with similar structures, represented by molecular fingerprints, and use these groups to train separate ML models. We apply the methodology to predict kinematic viscosity of bio-lubricant base oil molecules at 40 °C (KV40). Our method shows noticeable improvement in model performance compared to transfer learning (TL) and standard Random Forest (RF) approach. Our methodology provides a robust framework for scenarios with limited data and can be readily generalized to a diverse range of molecular datasets.

Topics

Biorefineries

Physical Properties

Other Sites & Tools

Technical Groups

Technical

Professional/Personal Growth

Societal Needs

Leadership

2025 Spring Meeting and 21st Global Congress on Process Safety

2025 AIChE Annual Meeting

Upcoming Conferences & Events

CEP: January 2025

CEP: December 2024

Explore Areas of Advancement:

Learning Center:

Want to be an Entrepreneur? Personal Stories From Three Successful Entrepreneurs Who Have Traveled This Path.

(372ah) Similarity-Based Machine Learning for Small Datasets; Application in Predicting Bio-Lubricant Properties

AIChE Annual Meeting

2024

2024 AIChE Annual Meeting

Computing and Systems Technology Division

10B: Interactive Session: Systems and Process Control

Tuesday, October 29, 2024 - 3:30pm to 5:00pm

Authors

Topics

More Conference Links

Cancelation Policy

Code of Conduct

Beware of Hotel and Attendee-list Scams