| Literature DB >> 32668151 |
Abstract
Coulomb matrix eigenvalues (CMEs) are global 3D representations of molecular structure, which have been previously used to predict atomization energies, prioritize geometry searches, and interpret rotational spectra. The properties of the CME representation and its relationship to molecular structure are established using the Gershgorin circle theorem. Numerical bounds are studied using a data set of 309 000 conformational samples of all constitutional isomers of acyclic alkanes, CnH2n+2, from methane (n = 1) to undecane (n = 11), to establish the extent to which the CME preserves chemical intuitions about isomer and conformer similarity and its ability to distinguish constitutional isomers. Neither supervised nor unsupervised machine-learning algorithms can perfectly distinguish constitutional isomers as the molecular size increases, but the misclassification rate can be kept below 1%.Entities:
Mesh:
Year: 2020 PMID: 32668151 DOI: 10.1021/acs.jcim.0c00631
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956