(197ai) Effect of Molecular Structure on the HOMO-LUMO GAP Using Atomic Signatures As Molecular Descriptors
AIChE Annual Meeting
2023
2023 AIChE Annual Meeting
Computational Molecular Science and Engineering Forum
Poster Session: Computational Molecular Science and Engineering Forum
Monday, November 6, 2023 - 3:30pm to 5:00pm
Compounds with low HOMOâLUMO gap Egap are in need as they are kinetically unstable making them reactive and desirable for different applications. However, the search for new organic compounds with a low Egap is an expensive endeavor due to the exponentially increasing pool of virtual compounds. Thus, there is a need to construct a robust, predictive, and computationally inexpensive quantitate structure property relationship (QSPR) to identify and screen for structures possessing low Egap. Herein, a QSPR is proposed that utilizes atomic Signatures as molecular descriptors correlating the molecular structure with the Egap. A dataset consisting of 112 organic compounds was used to construct the QSPR using forward stepping multilinear regression (FSMLR) with leave-one-out cross validation (LOOCV), achieving a regression coefficient (r2) and predictability (q2) of 0.86 and 0.76, respectively. The QSPR was able to infer relations between different structural motifs (i.e. atomic Signatures) and the Egap. It was found that the atomic Signature encapsulating aromatic bonds, [C](p[C]p[C][H]), was able to explain around 50% of the variance in the data, showing its significance in affecting the property of interest. The five most significant Signatures were found to negatively affect the Egap, due to them encapsulating Ïâbonds, heteroatoms, and aromatics. This is attributed to high electron density, making these structural motifs reactive sites, specifically due to Ï â Ï interactions, lone pair electrons, and/or delocalized Ïâbonds. In constant the presence of terminal nitrogen or internal alkane chain groups causes an increase in the Egap, which can be attributed to steric hindrance and nonâconjugated bonding. Finally, an external test set were used to further elucidate the predictive ability of the model â an r2 of 0.91 was achieved. Overall, the constructed QSPR can be used as a reliable initial screening tool to identify potential candidates possessing low Egap values