(346bl) Understanding the Chemical Machine Learning Design Space Using a Property Graph Database

Conference

AIChE Annual Meeting

Year

2020

Proceeding

2020 Virtual AIChE Annual Meeting

Group

Computational Molecular Science and Engineering Forum

Session

Poster Session: Computational Molecular Science and Engineering Forum (CoMSEF)

Time

Wednesday, November 18, 2020 - 8:00am to 9:00am

Authors

Luxon, A. - Presenter, Virginia Commonwealth University

Le, Q., Virginia Commonwealth University

Ferri, J. K., Virginia Commonwealth University

McQuade, T., Virginia Commonwealth University

Machine learning has been used extensively to predict molecular properties and design molecules [1-6]. When initializing a machine learning model, the modeler must make a series of decisions about how the model will operate. One must choose a learning algorithm (i.e random forest vs neural network), a featurization method (how the molecules in the data set will be described to the learning algorithm), training set size, hyper-parameters values, validation method, etc. These decisions are not independent and impact the cost and efficacy of the machine learning model. In this work, we trained a series of machine learning models using a wide gamut of the above parameters. For example, one instance could be a random forest model with 10-fold cross validation and Morgan molecular fingerprint featurization to predict logP. Each model was used to predict molecular properties and was evaluated based on error and prediction uncertainty. The model parameters, performance, and molecular datasets were stored in a property graph database (PGDB). Graph topology algorithms were used to identify model features, including molecular fragments, that most impact a modelâ€™s performance. The PGDB enhances the explainability of machine learning models by enabling visualization and efficient queries of relationships between modeling choices, data, and model performance.

Topics

Computational Molecular Engineering

Physical Properties

Other Sites & Tools

Technical Groups

Technical

Professional/Personal Growth

Societal Needs

Leadership

2025 Spring Meeting and 21st Global Congress on Process Safety

2025 AIChE Annual Meeting

Upcoming Conferences & Events

CEP: November 2024

CEP: October 2024

Explore Areas of Advancement:

Learning Center:

Want to be an Entrepreneur? Personal Stories From Three Successful Entrepreneurs Who Have Traveled This Path.

(346bl) Understanding the Chemical Machine Learning Design Space Using a Property Graph Database

AIChE Annual Meeting

2020

2020 Virtual AIChE Annual Meeting

Computational Molecular Science and Engineering Forum

Poster Session: Computational Molecular Science and Engineering Forum (CoMSEF)

Wednesday, November 18, 2020 - 8:00am to 9:00am

Authors

Topics

More Conference Links

Contact Us

Cancellation Policy

Code of Conduct

Beware of Hotel and Attendee-list Scams

Code of Conduct

Beware of Hotel and Attendee-list Scams