(451g) A Transferable Diffusion Model for Coarse-Grained Backmapping

Conference

AIChE Annual Meeting

Year

2023

Proceeding

2023 AIChE Annual Meeting

Group

Computational Molecular Science and Engineering Forum

Session

Machine Learning for Soft and Hard Materials

Time

Tuesday, November 7, 2023 - 9:12am to 9:24am

Authors

Jones, M. - Presenter

Shmilovich, K., University of Chicago

Ferguson, A., University of Chicago

Coarse-grained molecular models of proteins permit access to length and time scales unattainable by all-atom models and enable the simulation of important processes that occur on long time scales such as aggregation and folding. The reduced resolution of the coarse-grained models enables realization of computational accelerations, but sacrifices the atomistic resolution that can be vital for a complete understanding of the mechanistic details. Backmapping is the process of restoring the all-atom details to coarse-grained molecular representations in order to recover atomistic-level insight. Conventional backmapping approaches generate initial all-atom structures based on geometric rules and then apply energy relaxation to eliminate aphysical high-energy overlaps and produce stable all-atom configurations. The need for energy minimization makes these procedures typically quite expensive and slow. Recently, data-driven approaches have demonstrated great promise in furnishing trainable models to efficiently perform backmapping of small molecules and proteins. In this work, we report a novel backmapping approach based on autoregressive denoising diffusion probability models to restore all-atom details to coarse-grained simulations represented only by C-alpha coordinates. The generation process is conditioned on the coarse-grained protein configuration and any previously backmapped side chains in an autoregressive fashion in order to avoid steric clashes. As an inherently transferable and local model, it is scalable to proteins of arbitrary size with linear scaling. We train the model on over 100K proteins in the SidechainNet training data set and demonstrate state-of-the-art performance on systems including DE Shaw training trajectories of fast-folding mini-proteins, ensembles of intrinsically-disordered proteins, and randomly sampled selections from the Protein Data Bank. Furthermore, we demonstrate that fine-tuning the transferable model on a given system can further improve performance in recapitulating protein-specific sidechain distributions. We make the backmapping tool available as a free, open source Python package.

Topics

Computational Molecular Engineering

Other Sites & Tools

Technical Groups

Technical

Professional/Personal Growth

Societal Needs

Leadership

2024 Annual Safety in Ammonia Plants and Related Facilities Symposium

4th Optogenetic Technologies and Applications Conference

Upcoming Conferences & Events

Procesa 2024: 6th AIChE Latin America Student Regional Conference

2024 Indonesia Student Regional Conference

CCPS Workshop on Process Safety Metrics: API-RP-754 Implementation

University of Houston Student Process Safety Bootcamp

2024 Annual Safety in Ammonia Plants and Related Facilities Symposium

9th CCPS Canadian Regional Meeting

4th Optogenetic Technologies and Applications Conference

tcbiomass 2024

AIChE 2024 Virtual Career Fair for Professionals

CEP: August 2024

CEP: July 2024

Explore Areas of Advancement:

Learning Center:

Want to be an Entrepreneur? Personal Stories From Three Successful Entrepreneurs Who Have Traveled This Path.

(451g) A Transferable Diffusion Model for Coarse-Grained Backmapping

AIChE Annual Meeting

2023

2023 AIChE Annual Meeting

Computational Molecular Science and Engineering Forum

Machine Learning for Soft and Hard Materials

Tuesday, November 7, 2023 - 9:12am to 9:24am

Authors

Topics

More Conference Links

Visit Orlando

Universal Studios Offer

Cancellation Policy

Code of Conduct

Beware of Hotel and Attendee-list Scams

Code of Conduct

Beware of Hotel and Attendee-list Scams