(346ay) How Can Machine Learning Accelerate the Sampling and Interpretation of Molecular Dynamics Simulations?
AIChE Annual Meeting
2020
2020 Virtual AIChE Annual Meeting
Computational Molecular Science and Engineering Forum
Poster Session: Computational Molecular Science and Engineering Forum (CoMSEF)
Wednesday, November 18, 2020 - 8:00am to 9:00am
to key metastable states that the protein adopts. Second, membrane protein simulations require access to large computational resources, of the order of several 100,000 hours on a single Nvidia GTX1080 GPU using a popular molecular simulation package such as NAMD. Finally, it is difficult to extract valuable insights from the resulting high-dimensional simulation data (several terabytes).
In this study, we aim at developing efficient computational tools to accelerate the sampling and interpretation of molecular dynamics simulations. To address the absence of structural information, we developed a machine learning based algorithm (FingerprintContacts) to quickly predict multiple protein structures by combining agglomerative clustering and co-evolutionary information. We have demonstrated the capabilities of FingerprintContacts on eight proteins with varying conformational motions. To enhance the sampling efficiency, we proposed that evolutionary couplings can be used as reaction coordinates to efficiently guide the sampling of complex conformational free energy landscapes. To interpret the resulting high-dimensional simulation data, we developed a genetic algorithm based method to automatically select features for dimensionality reduction. The integration of the developed algorithms and all-atom molecular dynamics simulations has allowed us to characterize long timescale conformational transitions and the complete substrate translocation cycle of two nitrogen transporters. This work would establish efficient computational frameworks for understanding long timescale biophysical processes.