Literature DB >> 17238253

Classification of highly unbalanced CYP450 data of drugs using cost sensitive machine learning techniques.

T Eitrich1, A Kless, C Druska, W Meyer, J Grotendorst.   

Abstract

In this paper, we study the classifications of unbalanced data sets of drugs. As an example we chose a data set of 2D6 inhibitors of cytochrome P450. The human cytochrome P450 2D6 isoform plays a key role in the metabolism of many drugs in the preclinical drug discovery process. We have collected a data set from annotated public data and calculated physicochemical properties with chemoinformatics methods. On top of this data, we have built classifiers based on machine learning methods. Data sets with different class distributions lead to the effect that conventional machine learning methods are biased toward the larger class. To overcome this problem and to obtain sensitive but also accurate classifiers we combine machine learning and feature selection methods with techniques addressing the problem of unbalanced classification, such as oversampling and threshold moving. We have used our own implementation of a support vector machine algorithm as well as the maximum entropy method. Our feature selection is based on the unsupervised McCabe method. The classification results from our test set are compared structurally with compounds from the training set. We show that the applied algorithms enable the effective high throughput in silico classification of potential drug candidates.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17238253     DOI: 10.1021/ci6002619

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  13 in total

1.  Exploration of the binding of curcumin analogues to human P450 2C9 based on docking and molecular dynamics simulation.

Authors:  Rongwei Shi; Yin Wang; Xiaolei Zhu; Xiaohua Lu
Journal:  J Mol Model       Date:  2011-11-12       Impact factor: 1.810

2.  Exploring different strategies for imbalanced ADME data problem: case study on Caco-2 permeability modeling.

Authors:  Hai Pham-The; Gerardo Casañola-Martin; Teresa Garrigues; Marival Bermejo; Isabel González-Álvarez; Nam Nguyen-Hai; Miguel Ángel Cabrera-Pérez; Huong Le-Thi-Thu
Journal:  Mol Divers       Date:  2015-12-07       Impact factor: 2.943

3.  In silico prediction of pesticide aquatic toxicity with chemical category approaches.

Authors:  Fuxing Li; Defang Fan; Hao Wang; Hongbin Yang; Weihua Li; Yun Tang; Guixia Liu
Journal:  Toxicol Res (Camb)       Date:  2017-07-31       Impact factor: 3.524

4.  Supervised Methods for Biomarker Detection from Microarray Experiments.

Authors:  Angela Serra; Luca Cattelani; Michele Fratello; Vittorio Fortino; Pia Anneli Sofia Kinaret; Dario Greco
Journal:  Methods Mol Biol       Date:  2022

5.  Virtual screening of bioassay data.

Authors:  Amanda C Schierz
Journal:  J Cheminform       Date:  2009-12-22       Impact factor: 5.514

Review 6.  Computational prediction of metabolism: sites, products, SAR, P450 enzyme dynamics, and mechanisms.

Authors:  Johannes Kirchmair; Mark J Williamson; Jonathan D Tyzack; Lu Tan; Peter J Bond; Andreas Bender; Robert C Glen
Journal:  J Chem Inf Model       Date:  2012-02-17       Impact factor: 4.956

7.  Synthesis, antibiotic modifying activity, ADMET study and molecular docking of chalcone (E)-3-(2,4-dichlorophenyl)-1-(2-hydroxyphenyl)prop-2-en-1-one in strains of Staphylococcus aureus carrying MepA efflux pumps.

Authors:  Janaína Esmeraldo Rocha; Thiago Sampaio de Freitas; Jayze da Cunha Xavier; Raimundo Luiz Silva Pereira; Francisco Nascimento Pereira; Carlos Emídio Sampaio Nogueira; Márcia Machado Marinho; Paulo Nogueira Bandeira; Maria Alyce Albuquerque Fernandes; Emmanuel Silva Marinho; Alexandre Magno Rodrigues Teixeira; Hélcio Silva Dos Santos; Henrique Douglas Melo Coutinho
Journal:  Arch Microbiol       Date:  2021-12-23       Impact factor: 2.552

8.  Computational models for in-vitro anti-tubercular activity of molecules based on high-throughput chemical biology screening datasets.

Authors:  Vinita Periwal; Shireesha Kishtapuram; Vinod Scaria
Journal:  BMC Pharmacol       Date:  2012-03-31

9.  Asymmetric bagging and feature selection for activities prediction of drug molecules.

Authors:  Guo-Zheng Li; Hao-Hua Meng; Wen-Cong Lu; Jack Y Yang; Mary Qu Yang
Journal:  BMC Bioinformatics       Date:  2008-05-28       Impact factor: 3.169

10.  A chemoinformatics approach to the discovery of lead-like molecules from marine and microbial sources en route to antitumor and antibiotic drugs.

Authors:  Florbela Pereira; Diogo A R S Latino; Susana P Gaudêncio
Journal:  Mar Drugs       Date:  2014-01-27       Impact factor: 5.118

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.