Literature DB >> 14632445

Random forest: a classification and regression tool for compound classification and QSAR modeling.

Vladimir Svetnik1, Andy Liaw, Christopher Tong, J Christopher Culberson, Robert P Sheridan, Bradley P Feuston.   

Abstract

A new classification and regression tool, Random Forest, is introduced and investigated for predicting a compound's quantitative or categorical biological activity based on a quantitative description of the compound's molecular structure. Random Forest is an ensemble of unpruned classification or regression trees created by using bootstrap samples of the training data and random feature selection in tree induction. Prediction is made by aggregating (majority vote or averaging) the predictions of the ensemble. We built predictive models for six cheminformatics data sets. Our analysis demonstrates that Random Forest is a powerful tool capable of delivering performance that is among the most accurate methods to date. We also present three additional features of Random Forest: built-in performance assessment, a measure of relative importance of descriptors, and a measure of compound similarity that is weighted by the relative importance of descriptors. It is the combination of relatively high prediction accuracy and its collection of desired features that makes Random Forest uniquely suited for modeling in cheminformatics.

Year:  2003        PMID: 14632445     DOI: 10.1021/ci034160g

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  404 in total

1.  Computational analysis of HIV-1 protease protein binding pockets.

Authors:  Gene M Ko; A Srinivas Reddy; Sunil Kumar; Barbara A Bailey; Rajni Garg
Journal:  J Chem Inf Model       Date:  2010-10-25       Impact factor: 4.956

2.  Can we really do computer-aided drug design?

Authors:  Matthew Segall
Journal:  J Comput Aided Mol Des       Date:  2011-12-11       Impact factor: 3.686

3.  A seven-gene expression panel distinguishing clonal expansions of pre-leukemic and chronic lymphocytic leukemia B cells from normal B lymphocytes.

Authors:  Brian A McCarthy; Sophia Yancopoulos; Mike Tipping; Xiao-Jie Yan; Xue Ping Wang; Fiona Bennett; Wentian Li; Martin Lesser; Santanu Paul; Erin Boyle; Carolina Moreno; Rosa Catera; Bradley T Messmer; Giovanna Cutrona; Manlio Ferrarini; Jonathan E Kolitz; Steven L Allen; Kanti R Rai; Andrew C Rawstron; Nicholas Chiorazzi
Journal:  Immunol Res       Date:  2015-12       Impact factor: 2.829

4.  Discovery of Influenza A virus neuraminidase inhibitors using support vector machine and Naïve Bayesian models.

Authors:  Wenwen Lian; Jiansong Fang; Chao Li; Xiaocong Pang; Ai-Lin Liu; Guan-Hua Du
Journal:  Mol Divers       Date:  2015-12-21       Impact factor: 2.943

5.  Shallow Representation Learning via Kernel PCA Improves QSAR Modelability.

Authors:  Stefano E Rensi; Russ B Altman
Journal:  J Chem Inf Model       Date:  2017-08-07       Impact factor: 4.956

6.  Segmenting patients and physicians using preferences from discrete choice experiments.

Authors:  Ken Deal
Journal:  Patient       Date:  2014       Impact factor: 3.883

7.  Ensemble of trees approaches to risk adjustment for evaluating a hospital's performance.

Authors:  Yang Liu; Mikhail Traskin; Scott A Lorch; Edward I George; Dylan Small
Journal:  Health Care Manag Sci       Date:  2014-04-29

Review 8.  Machine learning in chemoinformatics and drug discovery.

Authors:  Yu-Chen Lo; Stefano E Rensi; Wen Torng; Russ B Altman
Journal:  Drug Discov Today       Date:  2018-05-08       Impact factor: 7.851

Review 9.  Computational approaches to analyse and predict small molecule transport and distribution at cellular and subcellular levels.

Authors:  Kyoung Ah Min; Xinyuan Zhang; Jing-yu Yu; Gus R Rosania
Journal:  Biopharm Drug Dispos       Date:  2013-12-10       Impact factor: 1.627

10.  Multiplex protein analysis to determine fibrosis stage and progression in patients with chronic hepatitis C.

Authors:  Keyur Patel; Katja S Remlinger; Terence G Walker; Peter Leitner; Joseph E Lucas; Stephen D Gardner; John G McHutchison; Will Irving; Indra Neil Guha
Journal:  Clin Gastroenterol Hepatol       Date:  2014-05-09       Impact factor: 11.382

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.