Literature DB >> 17125180

Chemoinformatics-based classification of prohibited substances employed for doping in sport.

Edward O Cannon1, Andreas Bender, David S Palmer, John B O Mitchell.   

Abstract

Representative molecules from 10 classes of prohibited substances were taken from the World Anti-Doping Agency (WADA) list, augmented by molecules from corresponding activity classes found in the MDDR database. Together with some explicitly allowed compounds, these formed a set of 5245 molecules. Five types of fingerprints were calculated for these substances. The random forest classification method was used to predict membership of each prohibited class on the basis of each type of fingerprint, using 5-fold cross-validation. We also used a k-nearest neighbors (kNN) approach, which worked well for the smallest values of k. The most successful classifiers are based on Unity 2D fingerprints and give very similar Matthews correlation coefficients of 0.836 (kNN) and 0.829 (random forest). The kNN classifiers tend to give a higher recall of positives at the expense of lower precision. A naïve Bayesian classifier, however, lies much further toward the extreme of high recall and low precision. Our results suggest that it will be possible to produce a reliable and quantitative assignment of membership or otherwise of each class of prohibited substances. This should aid the fight against the use of bioactive novel compounds as doping agents, while also protecting athletes against unjust disqualification.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17125180     DOI: 10.1021/ci0601160

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  8 in total

1.  Towards interoperable and reproducible QSAR analyses: Exchange of datasets.

Authors:  Ola Spjuth; Egon L Willighagen; Rajarshi Guha; Martin Eklund; Jarl Es Wikberg
Journal:  J Cheminform       Date:  2010-06-30       Impact factor: 5.514

2.  Predicting targets of compounds against neurological diseases using cheminformatic methodology.

Authors:  Katarina Nikolic; Lazaros Mavridis; Oscar M Bautista-Aguilera; José Marco-Contelles; Holger Stark; Maria do Carmo Carreiras; Ilaria Rossi; Paola Massarelli; Danica Agbaba; Rona R Ramsay; John B O Mitchell
Journal:  J Comput Aided Mol Des       Date:  2014-11-26       Impact factor: 3.686

3.  Predicting phospholipidosis using machine learning.

Authors:  Robert Lowe; Robert C Glen; John B O Mitchell
Journal:  Mol Pharm       Date:  2010-09-10       Impact factor: 4.939

4.  Influence relevance voting: an accurate and interpretable virtual high throughput screening method.

Authors:  S Joshua Swamidass; Chloé-Agathe Azencott; Ting-Wan Lin; Hugo Gramajo; Shiou-Chuan Tsai; Pierre Baldi
Journal:  J Chem Inf Model       Date:  2009-04       Impact factor: 4.956

5.  Support vector inductive logic programming outperforms the naive Bayes classifier and inductive logic programming for the classification of bioactive chemical compounds.

Authors:  Edward O Cannon; Ata Amini; Andreas Bender; Michael J E Sternberg; Stephen H Muggleton; Robert C Glen; John B O Mitchell
Journal:  J Comput Aided Mol Des       Date:  2007-03-27       Impact factor: 4.179

6.  A novel hybrid ultrafast shape descriptor method for use in virtual screening.

Authors:  Edward O Cannon; Florian Nigsch; John B O Mitchell
Journal:  Chem Cent J       Date:  2008-02-18       Impact factor: 4.215

7.  Predicting the protein targets for athletic performance-enhancing substances.

Authors:  Lazaros Mavridis; John Bo Mitchell
Journal:  J Cheminform       Date:  2013-06-25       Impact factor: 5.514

8.  Machine learning methods in chemoinformatics.

Authors:  John B O Mitchell
Journal:  Wiley Interdiscip Rev Comput Mol Sci       Date:  2014-09-01
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.