Literature DB >> 15446820

Effect of molecular descriptor feature selection in support vector machine classification of pharmacokinetic and toxicological properties of chemical agents.

Y Xue1, Z R Li, C W Yap, L Z Sun, X Chen, Y Z Chen.   

Abstract

Statistical-learning methods have been developed for facilitating the prediction of pharmacokinetic and toxicological properties of chemical agents. These methods employ a variety of molecular descriptors to characterize structural and physicochemical properties of molecules. Some of these descriptors are specifically designed for the study of a particular type of properties or agents, and their use for other properties or agents might generate noise and affect the prediction accuracy of a statistical learning system. This work examines to what extent the reduction of this noise can improve the prediction accuracy of a statistical learning system. A feature selection method, recursive feature elimination (RFE), is used to automatically select molecular descriptors for support vector machines (SVM) prediction of P-glycoprotein substrates (P-gp), human intestinal absorption of molecules (HIA), and agents that cause torsades de pointes (TdP), a rare but serious side effect. RFE significantly reduces the number of descriptors for each of these properties thereby increasing the computational speed for their classification. The SVM prediction accuracies of P-gp and HIA are substantially increased and that of TdP remains unchanged by RFE. These prediction accuracies are comparable to those of earlier studies derived from a selective set of descriptors. Our study suggests that molecular feature selection is useful for improving the speed and, in some cases, the accuracy of statistical learning methods for the prediction of pharmacokinetic and toxicological properties of chemical agents. Copyright 2004 American Chemical Society

Entities:  

Mesh:

Year:  2004        PMID: 15446820     DOI: 10.1021/ci049869h

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  24 in total

1.  Toward better QSAR/QSPR modeling: simultaneous outlier detection and variable selection using distribution of model features.

Authors:  Dongsheng Cao; Yizeng Liang; Qingsong Xu; Yifeng Yun; Hongdong Li
Journal:  J Comput Aided Mol Des       Date:  2010-11-13       Impact factor: 3.686

2.  Integrating statistical predictions and experimental verifications for enhancing protein-chemical interaction predictions in virtual screening.

Authors:  Nobuyoshi Nagamine; Takayuki Shirakawa; Yusuke Minato; Kentaro Torii; Hiroki Kobayashi; Masaya Imoto; Yasubumi Sakakibara
Journal:  PLoS Comput Biol       Date:  2009-06-05       Impact factor: 4.475

3.  Consensus model for identification of novel PI3K inhibitors in large chemical library.

Authors:  Chin Yee Liew; Xiao Hua Ma; Chun Wei Yap
Journal:  J Comput Aided Mol Des       Date:  2010-02-11       Impact factor: 3.686

4.  Prediction of carcinogenicity for diverse chemicals based on substructure grouping and SVM modeling.

Authors:  Kazutoshi Tanabe; Bono Lučić; Dragan Amić; Takio Kurita; Mikio Kaihara; Natsuo Onodera; Takahiro Suzuki
Journal:  Mol Divers       Date:  2010-02-26       Impact factor: 2.943

5.  ADMET evaluation in drug discovery. 12. Development of binary classification models for prediction of hERG potassium channel blockage.

Authors:  Sichao Wang; Youyong Li; Junmei Wang; Lei Chen; Liling Zhang; Huidong Yu; Tingjun Hou
Journal:  Mol Pharm       Date:  2012-03-16       Impact factor: 4.939

6.  LigSeeSVM: ligand-based virtual screening using support vector machines and data fusion.

Authors:  Yen-Fu Chen; Kai-Cheng Hsu; Po-Tsun Lin; D Frank Hsu; Bruce S Kristal; Jinn-Moon Yang
Journal:  Int J Comput Biol Drug Des       Date:  2011-07-21

7.  Computational identification of potential molecular interactions in Arabidopsis.

Authors:  Mingzhi Lin; Bin Hu; Lijuan Chen; Peng Sun; Yi Fan; Ping Wu; Xin Chen
Journal:  Plant Physiol       Date:  2009-07-10       Impact factor: 8.340

8.  Novel topological descriptors for analyzing biological networks.

Authors:  Matthias M Dehmer; Nicola N Barbarini; Kurt K Varmuza; Armin A Graber
Journal:  BMC Struct Biol       Date:  2010-06-17

Review 9.  Considerations and recent advances in QSAR models for cytochrome P450-mediated drug metabolism prediction.

Authors:  Haiyan Li; Jin Sun; Xiaowen Fan; Xiaofan Sui; Lan Zhang; Yongjun Wang; Zhonggui He
Journal:  J Comput Aided Mol Des       Date:  2008-06-24       Impact factor: 3.686

10.  Prediction of human intestinal absorption by GA feature selection and support vector machine regression.

Authors:  Aixia Yan; Zhi Wang; Zongyuan Cai
Journal:  Int J Mol Sci       Date:  2008-10-20       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.