Literature DB >> 22458505

Decision tree models for data mining in hit discovery.

Felix Hammann1, Juergen Drewe.   

Abstract

INTRODUCTION: Decision tree induction (DTI) is a powerful means of modeling data without much prior preparation. Models are readable by humans, robust and easily applied in real-world applications, features that are mutually exclusive in other commonly used machine learning paradigms. While DTI is widely used in disciplines ranging from economics to medicine, they are an intriguing option in pharmaceutical research, especially when dealing with large data stores. AREAS COVERED: This review covers the automated technologies available for creating decision trees and other rules efficiently, even from large datasets such as chemical libraries. The authors discuss the need for properly documented and validated models. Lastly, the authors cover several case studies in hit discovery, drug metabolism and toxicology, and drug surveillance, and compare them with other established techniques. EXPERT OPINION: DTI is a competitive and easy-to-use tool in basic research as well as in hit and drug discovery. Its strengths lie in its ability to handle all sorts of different data formats, the visual nature of the models, and the small computational effort needed for implementation in real-world systems. Limitations include lack of robustness and over-fitted models for certain types of data. As with any modeling technique, proper validation and quality measures are of utmost importance.
© 2012 Informa UK, Ltd.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22458505     DOI: 10.1517/17460441.2012.668182

Source DB:  PubMed          Journal:  Expert Opin Drug Discov        ISSN: 1746-0441            Impact factor:   6.098


  3 in total

1.  Impact of Anesthetic Predictors on Postpartum Hospital Length of Stay and Adverse Events Following Cesarean Delivery: A Retrospective Study in 840 Consecutive Parturients.

Authors:  Ting Ting Oh; Colleen G Martel; Allison G Clark; Melissa B Russo; Bobby D Nossaman
Journal:  Ochsner J       Date:  2015

2.  Data Mining and Computational Modeling of High-Throughput Screening Datasets.

Authors:  Sean Ekins; Alex M Clark; Krishna Dole; Kellan Gregory; Andrew M Mcnutt; Anna Coulon Spektor; Charlie Weatherall; Nadia K Litterman; Barry A Bunin
Journal:  Methods Mol Biol       Date:  2018

3.  Computational prediction of blood-brain barrier permeability using decision tree induction.

Authors:  Claudia Suenderhauf; Felix Hammann; Jörg Huwyler
Journal:  Molecules       Date:  2012-08-31       Impact factor: 4.411

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.