Literature DB >> 28429572

Finding Chemical Structures Corresponding to a Set of Coordinates in Chemical Descriptor Space.

Tomoyuki Miyao1, Kimito Funatsu1.   

Abstract

When chemical structures are searched based on descriptor values, or descriptors are interpreted based on values, it is important that corresponding chemical structures actually exist. In order to consider the existence of chemical structures located in a specific region in the chemical space, we propose to search them inside training data domains (TDDs), which are dense areas of a training dataset in the chemical space. We investigated TDDs' features using diverse and local datasets, assuming that GDB11 is the chemical universe. These two analyses showed that considering TDDs gives higher chance of finding chemical structures than a random search-based method, and that novel chemical structures actually exist inside TDDs. In addition to those findings, we tested the hypothesis that chemical structures were distributed on the limited areas of chemical space. This hypothesis was confirmed by the fact that distances among chemical structures in several descriptor spaces were much shorter than those among randomly generated coordinates in the training data range.
© 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

Keywords:  chemical structure existence; inverse QSPR/QSAR; machine-learning

Mesh:

Substances:

Year:  2017        PMID: 28429572     DOI: 10.1002/minf.201700030

Source DB:  PubMed          Journal:  Mol Inform        ISSN: 1868-1743            Impact factor:   3.353


  1 in total

1.  CReM: chemically reasonable mutations framework for structure generation.

Authors:  Pavel Polishchuk
Journal:  J Cheminform       Date:  2020-04-22       Impact factor: 5.514

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.