Literature DB >> 17238267

Contemporary QSAR classifiers compared.

Craig L Bruce1, James L Melville, Stephen D Pickett, Jonathan D Hirst.   

Abstract

We present a comparative assessment of several state-of-the-art machine learning tools for mining drug data, including support vector machines (SVMs) and the ensemble decision tree methods boosting, bagging, and random forest, using eight data sets and two sets of descriptors. We demonstrate, by rigorous multiple comparison statistical tests, that these techniques can provide consistent improvements in predictive performance over single decision trees. However, within these methods, there is no clearly best-performing algorithm. This motivates a more in-depth investigation into the properties of random forests. We identify a set of parameters for the random forest that provide optimal performance across all the studied data sets. Additionally, the tree ensemble structure of the forest may provide an interpretable model, a considerable advantage over SVMs. We test this possibility and compare it with standard decision tree models.

Mesh:

Year:  2007        PMID: 17238267     DOI: 10.1021/ci600332j

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  22 in total

1.  Prediction of carcinogenicity for diverse chemicals based on substructure grouping and SVM modeling.

Authors:  Kazutoshi Tanabe; Bono Lučić; Dragan Amić; Takio Kurita; Mikio Kaihara; Natsuo Onodera; Takahiro Suzuki
Journal:  Mol Divers       Date:  2010-02-26       Impact factor: 2.943

2.  Reliably assessing prediction reliability for high dimensional QSAR data.

Authors:  Jianping Huang; Xiaohui Fan
Journal:  Mol Divers       Date:  2012-12-19       Impact factor: 2.943

3.  Representing descriptors derived from multiple conformations as uncertain features for machine learning.

Authors:  Ulf Norinder; Henrik Boström
Journal:  J Mol Model       Date:  2013-03-12       Impact factor: 1.810

4.  Brainstorming: weighted voting prediction of inhibitors for protein targets.

Authors:  Dariusz Plewczynski
Journal:  J Mol Model       Date:  2010-09-21       Impact factor: 1.810

5.  Models for anti-tumor activity of bisphosphonates using refined topochemical descriptors.

Authors:  Rakesh K Goyal; G Singh; A K Madan
Journal:  Naturwissenschaften       Date:  2011-09-04

Review 6.  Considerations and recent advances in QSAR models for cytochrome P450-mediated drug metabolism prediction.

Authors:  Haiyan Li; Jin Sun; Xiaowen Fan; Xiaofan Sui; Lan Zhang; Yongjun Wang; Zhonggui He
Journal:  J Comput Aided Mol Des       Date:  2008-06-24       Impact factor: 3.686

7.  A model-based ensembling approach for developing QSARs.

Authors:  Qianyi Zhang; Jacqueline M Hughes-Oliver; Raymond T Ng
Journal:  J Chem Inf Model       Date:  2009-08       Impact factor: 4.956

8.  AZOrange - High performance open source machine learning for QSAR modeling in a graphical programming environment.

Authors:  Jonna C Stålring; Lars A Carlsson; Pedro Almeida; Scott Boyer
Journal:  J Cheminform       Date:  2011-07-28       Impact factor: 5.514

9.  The influence of the inactives subset generation on the performance of machine learning methods.

Authors:  Sabina Smusz; Rafał Kurczab; Andrzej J Bojarski
Journal:  J Cheminform       Date:  2013-04-05       Impact factor: 5.514

10.  QSAR workbench: automating QSAR modeling to drive compound design.

Authors:  Richard Cox; Darren V S Green; Christopher N Luscombe; Noj Malcolm; Stephen D Pickett
Journal:  J Comput Aided Mol Des       Date:  2013-04-25       Impact factor: 3.686

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.