Literature DB >> 26643659

Exploring different strategies for imbalanced ADME data problem: case study on Caco-2 permeability modeling.

Hai Pham-The1, Gerardo Casañola-Martin2,3,4, Teresa Garrigues5, Marival Bermejo6, Isabel González-Álvarez6, Nam Nguyen-Hai1, Miguel Ángel Cabrera-Pérez5,6,7, Huong Le-Thi-Thu8.   

Abstract

In many absorption, distribution, metabolism, and excretion (ADME) modeling problems, imbalanced data could negatively affect classification performance of machine learning algorithms. Solutions for handling imbalanced dataset have been proposed, but their application for ADME modeling tasks is underexplored. In this paper, various strategies including cost-sensitive learning and resampling methods were studied to tackle the moderate imbalance problem of a large Caco-2 cell permeability database. Simple physicochemical molecular descriptors were utilized for data modeling. Support vector machine classifiers were constructed and compared using multiple comparison tests. Results showed that the models developed on the basis of resampling strategies displayed better performance than the cost-sensitive classification models, especially in the case of oversampling data where misclassification rates for minority class have values of 0.11 and 0.14 for training and test set, respectively. A consensus model with enhanced applicability domain was subsequently constructed and showed improved performance. This model was used to predict a set of randomly selected high-permeability reference drugs according to the biopharmaceutics classification system. Overall, this study provides a comparison of numerous rebalancing strategies and displays the effectiveness of oversampling methods to deal with imbalanced permeability data problems.

Entities:  

Keywords:  ADME modeling; Biopharmaceutics classification system; Caco-2 cell permeability; Cost-sensitive learning; Resampling technique; Support vector machine

Mesh:

Year:  2015        PMID: 26643659     DOI: 10.1007/s11030-015-9649-4

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  24 in total

1.  Evaluation of human intestinal absorption data and subsequent derivation of a quantitative structure-activity relationship (QSAR) with the Abraham descriptors.

Authors:  Y H Zhao; J Le; M H Abraham; A Hersey; P J Eddershaw; C N Luscombe; D Butina; G Beck; B Sherborne; I Cooper; J A Platts; D Boutina
Journal:  J Pharm Sci       Date:  2001-06       Impact factor: 3.534

2.  A 'rule of three' for fragment-based lead discovery?

Authors:  Miles Congreve; Robin Carr; Chris Murray; Harren Jhoti
Journal:  Drug Discov Today       Date:  2003-10-01       Impact factor: 7.851

3.  Summary workshop report: biopharmaceutics classification system--implementation challenges and extension opportunities.

Authors:  James E Polli; Lawrence X Yu; Jack A Cook; Gordon L Amidon; Ronald T Borchardt; Beth A Burnside; Philip S Burton; Mei-Ling Chen; Dale P Conner; Patrick J Faustino; Amale A Hawi; Ajaz S Hussain; Hemant N Joshi; Gloria Kwei; Vincent H L Lee; Lawrence J Lesko; Robert A Lipper; Alice E Loper; Shriniwas G Nerurkar; Joseph W Polli; Dilip R Sanvordeker; Rajneesh Taneja; Ramana S Uppoor; Chandra S Vattikonda; Ian Wilding; Guohua Zhang
Journal:  J Pharm Sci       Date:  2004-06       Impact factor: 3.534

4.  Classification of highly unbalanced CYP450 data of drugs using cost sensitive machine learning techniques.

Authors:  T Eitrich; A Kless; C Druska; W Meyer; J Grotendorst
Journal:  J Chem Inf Model       Date:  2007 Jan-Feb       Impact factor: 4.956

5.  Phylogenetic structure of angiosperm communities during tropical forest succession.

Authors:  Susan G Letcher
Journal:  Proc Biol Sci       Date:  2009-10-02       Impact factor: 5.349

6.  Variability of permeability estimation from different protocols of subculture and transport experiments in cell monolayers.

Authors:  Davinia Oltra-Noguera; Victor Mangas-Sanjuan; Amparo Centelles-Sangüesa; Ignacio Gonzalez-Garcia; Gloria Sanchez-Castaño; Marta Gonzalez-Alvarez; Vicente-German Casabo; Virginia Merino; Isabel Gonzalez-Alvarez; Marival Bermejo
Journal:  J Pharmacol Toxicol Methods       Date:  2014-11-26       Impact factor: 1.950

7.  The Use of Rule-Based and QSPR Approaches in ADME Profiling: A Case Study on Caco-2 Permeability.

Authors:  Hai Pham-The; Isabel González-Álvarez; Marival Bermejo; Teresa Garrigues; Huong Le-Thi-Thu; Miguel Ángel Cabrera-Pérez
Journal:  Mol Inform       Date:  2013-05-15       Impact factor: 3.353

8.  In Silico Prediction of Caco-2 Cell Permeability by a Classification QSAR Approach.

Authors:  Hai Pham The; Isabel González-Álvarez; Marival Bermejo; Victor Mangas Sanjuan; Inmaculada Centelles; Teresa M Garrigues; Miguel Ángel Cabrera-Pérez
Journal:  Mol Inform       Date:  2011-03-31       Impact factor: 3.353

9.  An exploratory study of two Caco-2 cell models for oral absorption: a report on their within-laboratory and between-laboratory variability, and their predictive capacity.

Authors:  Pilar Prieto; Sebastian Hoffmann; Valentina Tirelli; Francesco Tancredi; Isabel González; Marival Bermejo; Isabella De Angelis
Journal:  Altern Lab Anim       Date:  2010-10       Impact factor: 1.303

10.  Virtual screening of bioassay data.

Authors:  Amanda C Schierz
Journal:  J Cheminform       Date:  2009-12-22       Impact factor: 5.514

View more
  3 in total

1.  Structure-activity relationship-based chemical classification of highly imbalanced Tox21 datasets.

Authors:  Gabriel Idakwo; Sundar Thangapandian; Joseph Luttrell; Yan Li; Nan Wang; Zhaoxian Zhou; Huixiao Hong; Bei Yang; Chaoyang Zhang; Ping Gong
Journal:  J Cheminform       Date:  2020-10-27       Impact factor: 5.514

2.  Exploratory study on classification of diabetes mellitus through a combined Random Forest Classifier.

Authors:  Xuchun Wang; Mengmeng Zhai; Zeping Ren; Hao Ren; Meichen Li; Dichen Quan; Limin Chen; Lixia Qiu
Journal:  BMC Med Inform Decis Mak       Date:  2021-03-20       Impact factor: 2.796

3.  A novel adaptive ensemble classification framework for ADME prediction.

Authors:  Ming Yang; Jialei Chen; Liwen Xu; Xiufeng Shi; Xin Zhou; Zhijun Xi; Rui An; Xinhong Wang
Journal:  RSC Adv       Date:  2018-03-26       Impact factor: 4.036

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.