Literature DB >> 21504223

A comprehensive support vector machine binary hERG classification model based on extensive but biased end point hERG data sets.

Meng-Yu Shen1, Bo-Han Su, Emilio Xavier Esposito, Anton J Hopfinger, Yufeng J Tseng.   

Abstract

The human ether-a-go-go related gene (hERG) potassium ion channel plays a key role in cardiotoxicity and is therefore a key target as part of preclinical drug discovery toxicity screening. The PubChem hERG Bioassay data set, composed of 1668 compounds, was used to construct an in silico screening model. The corresponding trial models were constructed from a descriptor pool composed of 4D fingerprints (4D-FP) and traditional 2D and 3D VolSurf-like molecular descriptors. A final binary classification model was constructed via a support vector machine (SVM). The resultant model was then validated using the PubChem hERG Bioassay data set (AID 376) and an external hERG data set by evaluating the model's ability to determine hERG blockers from nonblockers. The external data set (the test set) consisted of 356 compounds collected from available literature data and consisting of 287 actives and 69 inactives. Four different sampling protocols and a 10-fold cross-correlation analysis--used in the validation process to evaluate classification models--explored the impact of the active--inactive data imbalance distribution of the PubChem high-throughput data set. Four different data sets were explored, and the one employing Lipinski's rule-of-five coupled with measures of relative molecular lipophilicity performed the best in the 10-fold cross-correlation validation of the training data set as well as overall prediction accuracy of the external test sets. The linear SVM binary classification model building strategy was applied to different combinations of MOE (traditional 2D, "21/2D", and 3D VolSurf-like) and 4D-FP molecular descriptors to further explore and refine previously proposed key descriptors, identify new significant features that contribute to the prediction of hERG toxicity, and construct the optimal SVM binary classification model from a shrunken descriptor pool. The accuracy, sensitivity, and specificity of the best model determined from 10-fold cross-validation are 95, 90, and 96%, respectively; the overall accuracy is near 87% for the external set. The models constructed in this study demonstrate the following: (i) robustness based upon performance in accuracy across the structural diversity of the training set, (ii) ability to predict a compound's "predisposition" to block hERG ion channels, and (iii) define and illustrate structural features that can be overlaid onto the chemical structures to aid in the 3D structure-activity interpretation of the hERG blocking effect.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21504223     DOI: 10.1021/tx200099j

Source DB:  PubMed          Journal:  Chem Res Toxicol        ISSN: 0893-228X            Impact factor:   3.739


  14 in total

1.  QSAR classification of metabolic activation of chemicals into covalently reactive species.

Authors:  Chin Yee Liew; Chuen Pan; Andre Tan; Ke Xin Magneline Ang; Chun Wei Yap
Journal:  Mol Divers       Date:  2012-02-28       Impact factor: 2.943

2.  The great descriptor melting pot: mixing descriptors for the common good of QSAR models.

Authors:  Yufeng J Tseng; Anton J Hopfinger; Emilio Xavier Esposito
Journal:  J Comput Aided Mol Des       Date:  2011-12-27       Impact factor: 3.686

3.  Prediction of skin sensitization potency using machine learning approaches.

Authors:  Qingda Zang; Michael Paris; David M Lehmann; Shannon Bell; Nicole Kleinstreuer; David Allen; Joanna Matheson; Abigail Jacobs; Warren Casey; Judy Strickland
Journal:  J Appl Toxicol       Date:  2017-01-10       Impact factor: 3.446

4.  Novel Bayesian classification models for predicting compounds blocking hERG potassium channels.

Authors:  Li-li Liu; Jing Lu; Yin Lu; Ming-yue Zheng; Xiao-min Luo; Wei-liang Zhu; Hua-liang Jiang; Kai-xian Chen
Journal:  Acta Pharmacol Sin       Date:  2014-06-30       Impact factor: 6.150

5.  In silico prediction of hERG potassium channel blockage by chemical category approaches.

Authors:  Chen Zhang; Yuan Zhou; Shikai Gu; Zengrui Wu; Wenjie Wu; Changming Liu; Kaidong Wang; Guixia Liu; Weihua Li; Philip W Lee; Yun Tang
Journal:  Toxicol Res (Camb)       Date:  2016-01-14       Impact factor: 3.524

6.  A critical assessment of combined ligand- and structure-based approaches to HERG channel blocker modeling.

Authors:  Lei Du-Cuny; Lu Chen; Shuxing Zhang
Journal:  J Chem Inf Model       Date:  2011-10-13       Impact factor: 4.956

Review 7.  Paradigm shift in toxicity testing and modeling.

Authors:  Hongmao Sun; Menghang Xia; Christopher P Austin; Ruili Huang
Journal:  AAPS J       Date:  2012-04-20       Impact factor: 4.009

Review 8.  Getting the most out of PubChem for virtual screening.

Authors:  Sunghwan Kim
Journal:  Expert Opin Drug Discov       Date:  2016-08-05       Impact factor: 6.098

Review 9.  On exploring structure-activity relationships.

Authors:  Rajarshi Guha
Journal:  Methods Mol Biol       Date:  2013

10.  Multivariate models for prediction of human skin sensitization hazard.

Authors:  Judy Strickland; Qingda Zang; Michael Paris; David M Lehmann; David Allen; Neepa Choksi; Joanna Matheson; Abigail Jacobs; Warren Casey; Nicole Kleinstreuer
Journal:  J Appl Toxicol       Date:  2016-08-02       Impact factor: 3.446

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.