Literature DB >> 33797669

LINGO-DL: a text-based approach for molecular similarity searching.

Ammar Abdo1, Maude Pupin2.   

Abstract

The line notations of chemical structures are more compact than those of graphs and connection tables, so they can be useful for storing and transferring a large number of molecular structures. The simplified molecular input line system (SMILES) representation is the most extensively used, as it is much easier to utilise and comprehend than others, and it can be generated automatically from connection tables. A SMILES represents and encodes the molecule structure. It has been used by an existing method, LINGO, to calculate the molecular similarities and predict the structure-related properties. The LINGO method decomposes a canonical SMILES into a set of substrings of four characters referred to as LINGOs. The purpose of LINGO method is to measure the similarity between a pair of molecules by comparing the LINGOs that occur in each molecule. This paper aims to introduce an alternative version of the LINGO method using LINGOs of different lengths, called LINGO-DL. LINGO-DL is based on the fragmentation of canonical SMILES into substrings of three different lengths rather than one in LINGO method. Retrospective virtual screening experiments with MDDR, DUD, and MUV datasets show that the LINGO-DL outperforms the LINGO method, especially when the active molecules being sought have a high degree of structural heterogeneity.

Entities:  

Keywords:  Drug discovery; LINGO; Ligand-based virtual screening; Molecular fingerprints; SMILES

Mesh:

Substances:

Year:  2021        PMID: 33797669     DOI: 10.1007/s10822-021-00383-9

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  21 in total

1.  Similarity searching in files of three-dimensional chemical structures: analysis of the BIOSTER database using two-dimensional fingerprints and molecular field descriptors

Authors: 
Journal:  J Chem Inf Comput Sci       Date:  2000-03

Review 2.  Why do we need so many chemical similarity search methods?

Authors:  Robert P Sheridan; Simon K Kearsley
Journal:  Drug Discov Today       Date:  2002-09-01       Impact factor: 7.851

Review 3.  Molecular similarity: a key technique in molecular informatics.

Authors:  Andreas Bender; Robert C Glen
Journal:  Org Biomol Chem       Date:  2004-10-14       Impact factor: 3.876

Review 4.  Similarity-based virtual screening using 2D fingerprints.

Authors:  Peter Willett
Journal:  Drug Discov Today       Date:  2006-10-20       Impact factor: 7.851

Review 5.  Molecular similarity and diversity in chemoinformatics: from theory to applications.

Authors:  Ana G Maldonado; J P Doucet; Michel Petitjean; Bo-Tao Fan
Journal:  Mol Divers       Date:  2006-02       Impact factor: 2.943

6.  How similar are similarity searching methods? A principal component analysis of molecular descriptor space.

Authors:  Andreas Bender; Jeremy L Jenkins; Josef Scheiber; Sai Chetan K Sukuru; Meir Glick; John W Davies
Journal:  J Chem Inf Model       Date:  2009-01       Impact factor: 4.956

7.  SYBYL line notation (SLN): a single notation to represent chemical structures, queries, reactions, and virtual libraries.

Authors:  R Webster Homer; Jon Swanson; Robert J Jilek; Tad Hurst; Robert D Clark
Journal:  J Chem Inf Model       Date:  2008-12       Impact factor: 4.956

Review 8.  Computational methods in drug discovery.

Authors:  Gregory Sliwoski; Sandeepkumar Kothiwale; Jens Meiler; Edward W Lowe
Journal:  Pharmacol Rev       Date:  2013-12-31       Impact factor: 25.468

9.  Selecting optimally diverse compounds from structure databases: a validation study of two-dimensional and three-dimensional molecular descriptors.

Authors:  H Matter
Journal:  J Med Chem       Date:  1997-04-11       Impact factor: 7.446

Review 10.  A review of ligand-based virtual screening web tools and screening algorithms in large molecular databases in the age of big data.

Authors:  Antonio-Jesús Banegas-Luna; José P Cerón-Carrasco; Horacio Pérez-Sánchez
Journal:  Future Med Chem       Date:  2018-11-30       Impact factor: 3.808

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.