Literature DB >> 15961479

Kernels for small molecules and the prediction of mutagenicity, toxicity and anti-cancer activity.

S Joshua Swamidass1, Jonathan Chen, Jocelyne Bruand, Peter Phung, Liva Ralaivola, Pierre Baldi.   

Abstract

MOTIVATION: Small molecules play a fundamental role in organic chemistry and biology. They can be used to probe biological systems and to discover new drugs and other useful compounds. As increasing numbers of large datasets of small molecules become available, it is necessary to develop computational methods that can deal with molecules of variable size and structure and predict their physical, chemical and biological properties.
RESULTS: Here we develop several new classes of kernels for small molecules using their 1D, 2D and 3D representations. In 1D, we consider string kernels based on SMILES strings. In 2D, we introduce several similarity kernels based on conventional or generalized fingerprints. Generalized fingerprints are derived by counting in different ways subpaths contained in the graph of bonds, using depth-first searches. In 3D, we consider similarity measures between histograms of pairwise distances between atom classes. These kernels can be computed efficiently and are applied to problems of classification and prediction of mutagenicity, toxicity and anti-cancer activity on three publicly available datasets. The results derived using cross-validation methods are state-of-the-art. Tradeoffs between various kernels are briefly discussed. AVAILABILITY: Datasets available from http://www.igb.uci.edu/servers/servers.html

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15961479     DOI: 10.1093/bioinformatics/bti1055

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval.

Authors:  S Joshua Swamidass; Chloé-Agathe Azencott; Kenny Daily; Pierre Baldi
Journal:  Bioinformatics       Date:  2010-04-07       Impact factor: 6.937

2.  Bounds and algorithms for fast exact searches of chemical fingerprints in linear and sublinear time.

Authors:  S Joshua Swamidass; Pierre Baldi
Journal:  J Chem Inf Model       Date:  2007-02-28       Impact factor: 4.956

3.  Lossless compression of chemical fingerprints using integer entropy codes improves storage and retrieval.

Authors:  Pierre Baldi; Ryan W Benz; Daniel S Hirschberg; S Joshua Swamidass
Journal:  J Chem Inf Model       Date:  2007-10-30       Impact factor: 4.956

Review 4.  Novel paradigms for drug discovery: computational multitarget screening.

Authors:  Ekachai Jenwitheesuk; Jeremy A Horst; Kasey L Rivas; Wesley C Van Voorhis; Ram Samudrala
Journal:  Trends Pharmacol Sci       Date:  2008-01-10       Impact factor: 14.819

5.  Naïve Bayesian Models for Vero Cell Cytotoxicity.

Authors:  Alexander L Perryman; Jimmy S Patel; Riccardo Russo; Eric Singleton; Nancy Connell; Sean Ekins; Joel S Freundlich
Journal:  Pharm Res       Date:  2018-06-29       Impact factor: 4.200

6.  Machine learning assisted design of highly active peptides for drug discovery.

Authors:  Sébastien Giguère; François Laviolette; Mario Marchand; Denise Tremblay; Sylvain Moineau; Xinxia Liang; Éric Biron; Jacques Corbeil
Journal:  PLoS Comput Biol       Date:  2015-04-07       Impact factor: 4.475

7.  Heterogeneous biomedical database integration using a hybrid strategy: a p53 cancer research database.

Authors:  Vadim Y Bichutskiy; Richard Colman; Rainer K Brachmann; Richard H Lathrop
Journal:  Cancer Inform       Date:  2007-02-20

Review 8.  Machine learning for in silico virtual screening and chemical genomics: new strategies.

Authors:  Jean-Philippe Vert; Laurent Jacob
Journal:  Comb Chem High Throughput Screen       Date:  2008-09       Impact factor: 1.339

9.  A constructive approach for discovering new drug leads: Using a kernel methodology for the inverse-QSAR problem.

Authors:  William Wl Wong; Forbes J Burkowski
Journal:  J Cheminform       Date:  2009-04-28       Impact factor: 5.514

10.  Estimation of the applicability domain of kernel-based machine learning models for virtual screening.

Authors:  Nikolas Fechner; Andreas Jahn; Georg Hinselmann; Andreas Zell
Journal:  J Cheminform       Date:  2010-03-11       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.