(622e) Sigma Profiles in Deep Learning: Towards a Universal Molecular Descriptor
AIChE Annual Meeting
2022
2022 Annual Meeting
Computational Molecular Science and Engineering Forum
Machine Learning for Soft Materials I
Thursday, November 17, 2022 - 2:00pm to 2:15pm
By virtue of being unnormalized histograms of screened charges, Ï-profiles encode a great deal of chemical information (charge density, polarity, etc.) and their size does not change with the size of the molecule, mitigating the disadvantages of the descriptors mentioned above. As such, this work showcases, for the first time, the ability of Ï-profiles to function as universal molecular descriptors in deep learning. To do so, the Ï-profiles of 1432 compounds were used to train convolutional neural networks (CNNs) that accurately correlate and predict a wide range of physicochemical properties (molar masses, normal boiling temperatures, vapor pressures, densities, refractive indexes, and aqueous solubilities). To boost their performance, the architecture and hyperparameters of each CNN were optimized using a battery of algorithms, particularly Bayesian Optimization and Local Search. Furthermore, it was shown that thermodynamic conditions, namely temperature, can also be used as additional inputs to broaden the applicability of the models. Among all other advantages mentioned, this work shows that Ï-profiles can extend the use of deep learning methodologies to areas where datasets are relatively small and scarce.