Literature DB >> 15807504

LINGO, an efficient holographic text based method to calculate biophysical properties and intermolecular similarities.

David Vidal1, Michael Thormann, Miquel Pons.   

Abstract

SMILES strings are the most compact text based molecular representations. Implicitly they contain the information needed to compute all kinds of molecular structures and, thus, molecular properties derived from these structures. We show that this implicit information can be accessed directly at SMILES string level without the need to apply explicit time-consuming conversion of the SMILES strings into molecular graphs or 3D structures with subsequent 2D or 3D QSPR calculations. Our method is based on the fragmentation of SMILES strings into overlapping substrings of a defined size that we call LINGOs. The integral set of LINGOs derived from a given SMILES string, the LINGO profile, is a hologram of the SMILES representation of the molecule described. LINGO profiles provide input for QSPR models and the calculation of intermolecular similarities at very low computational cost. The octanol/water partition coefficient (LlogP) QSPR model achieved a correlation coefficient R2=0.93, a root-mean-square error RRMS=0.49 log units, a goodness of prediction correlation coefficient Q2=0.89 and a QRMS=0.61 log units. The intrinsic aqueous solubility (LlogS) QSPR model achieved correlation coefficient values of R2=0.91, Q2=0.82, and RRMS=0.60 and QRMS=0.89 log units. Integral Tanimoto coefficients computed from LINGO profiles provided sharp discrimination between random and bioisoster pairs extracted from Accelrys Bioster Database. Average similarities (LINGOsim) were 0.07 for the random pairs and 0.36 for the bioisosteric pairs.

Entities:  

Mesh:

Year:  2005        PMID: 15807504     DOI: 10.1021/ci0496797

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  31 in total

1.  CORAL: QSPR models for solubility of [C60] and [C70] fullerene derivatives.

Authors:  Alla P Toropova; Andrey A Toropov; Emilio Benfenati; Giuseppina Gini; Danuta Leszczynska; Jerzy Leszczynski
Journal:  Mol Divers       Date:  2010-03-27       Impact factor: 2.943

Review 2.  Methods for Similarity-based Virtual Screening.

Authors:  Thomas G Kristensen; Jesper Nielsen; Christian N S Pedersen
Journal:  Comput Struct Biotechnol J       Date:  2013-03-03       Impact factor: 7.271

3.  QSAR-modeling of toxicity of organometallic compounds by means of the balance of correlations for InChI-based optimal descriptors.

Authors:  A A Toropov; A P Toropova; E Benfenati
Journal:  Mol Divers       Date:  2009-05-19       Impact factor: 2.943

4.  QSAR modelling of carcinogenicity by balance of correlations.

Authors:  A A Toropov; A P Toropova; E Benfenati; A Manganaro
Journal:  Mol Divers       Date:  2009-02-04       Impact factor: 2.943

5.  An improved scoring function for suboptimal polar ligand complexes.

Authors:  Giovanni Cincilla; David Vidal; Miquel Pons
Journal:  J Comput Aided Mol Des       Date:  2008-10-09       Impact factor: 3.686

6.  QSAR modelling of the toxicity to Tetrahymena pyriformis by balance of correlations.

Authors:  A A Toropov; A P Toropova; E Benfenati; A Manganaro
Journal:  Mol Divers       Date:  2009-08-14       Impact factor: 2.943

7.  Error bounds on the SCISSORS approximation method.

Authors:  Imran S Haque; Vijay S Pande
Journal:  J Chem Inf Model       Date:  2011-09-08       Impact factor: 4.956

8.  Transformer-CNN: Swiss knife for QSAR modeling and interpretation.

Authors:  Pavel Karpov; Guillaume Godin; Igor V Tetko
Journal:  J Cheminform       Date:  2020-03-18       Impact factor: 5.514

9.  Shallow Representation Learning via Kernel PCA Improves QSAR Modelability.

Authors:  Stefano E Rensi; Russ B Altman
Journal:  J Chem Inf Model       Date:  2017-08-07       Impact factor: 4.956

10.  SCISSORS: practical considerations.

Authors:  Steven M Kearnes; Imran S Haque; Vijay S Pande
Journal:  J Chem Inf Model       Date:  2013-12-16       Impact factor: 4.956

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.