Literature DB >> 15154759

Feature selection for descriptor based classification models. 2. Human intestinal absorption (HIA).

Jörg K Wegner1, Holger Fröhlich, Andreas Zell.   

Abstract

We show that the topological polar surface area (TPSA) descriptor and the radial distribution function (RDF) applied to electronic and steric atom properties, like the conjugated electrotopological state (CETS), are the most relevant features/descriptors for predicting the human intestinal absorption (HIA) out of a large set of 2934 features/descriptors. A HIA data set with 196 molecules with measured HIA values and 2934 features/descriptors were calculated using JOELib and MOE. We used an adaptive boosting algorithm to solve the binary classification problem (AdaBoost.M1) and Genetic Algorithms based on Shannon Entropy Cliques (GA-SEC) variants as hybrid feature selection algorithms. The selection of relevant features was applied with respect to the generalization ability of the classification model, avoiding a high variance for unseen molecules (overfitting).

Entities:  

Mesh:

Year:  2004        PMID: 15154759     DOI: 10.1021/ci034233w

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  7 in total

1.  The prediction of human oral absorption for diffusion rate-limited drugs based on heuristic method and support vector machine.

Authors:  H X Liu; R J Hu; R S Zhang; X J Yao; M C Liu; Z D Hu; B T Fan
Journal:  J Comput Aided Mol Des       Date:  2005-01       Impact factor: 3.686

2.  A radial-distribution-function approach for predicting rodent carcinogenicity.

Authors:  Aliuska Helguera Morales; Miguel Angel Cabrera Pérez; Maykel Pérez González
Journal:  J Mol Model       Date:  2006-01-19       Impact factor: 1.810

Review 3.  Modeling kinetics of subcellular disposition of chemicals.

Authors:  Stefan Balaz
Journal:  Chem Rev       Date:  2009-05       Impact factor: 60.622

Review 4.  Considerations and recent advances in QSAR models for cytochrome P450-mediated drug metabolism prediction.

Authors:  Haiyan Li; Jin Sun; Xiaowen Fan; Xiaofan Sui; Lan Zhang; Yongjun Wang; Zhonggui He
Journal:  J Comput Aided Mol Des       Date:  2008-06-24       Impact factor: 3.686

5.  ChemMine tools: an online service for analyzing and clustering small molecules.

Authors:  Tyler W H Backman; Yiqun Cao; Thomas Girke
Journal:  Nucleic Acids Res       Date:  2011-05-16       Impact factor: 16.971

6.  Discovery of Small-Molecule Activators for Glucose-6-Phosphate Dehydrogenase (G6PD) Using Machine Learning Approaches.

Authors:  Madhu Sudhana Saddala; Anton Lennikov; Hu Huang
Journal:  Int J Mol Sci       Date:  2020-02-23       Impact factor: 5.923

7.  Prediction of human intestinal absorption by GA feature selection and support vector machine regression.

Authors:  Aixia Yan; Zhi Wang; Zongyuan Cai
Journal:  Int J Mol Sci       Date:  2008-10-20       Impact factor: 5.923

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.