Literature DB >> 12767155

VSMP: a novel variable selection and modeling method based on the prediction.

Shu-Shen Liu1, Hai-Ling Liu, Chun-Sheng Yin, Lian-Sheng Wang.   

Abstract

The use of numerous descriptors that are indicative of molecular structure and topology is becoming more common in quantitative structure-activity relationship (QSAR). How to choose the adequate descriptors for QSAR studies is important but difficult because there are no absolute rules to govern this choice. A variety of variable selection techniques including stepwise, partial least squares/principal component analysis (PLS/PCA), neural network, and evolutionary algorithm such as genetic algorithm have been applied to this common problem. All-subsets regression (ASR) is capable of finding out the best variable subset from among a large pool. In this paper, a novel variable selection and modeling method based on the prediction, for short VSMP, has been developed. Here two controllable parameters, the interrelation coefficient between the pairs of the independent variables (r(int)) and the correlation coefficient (q(2)) obtained using the leave-one-out (LOO) cross-validation technique, are introduced into the ASR to improve its performances. This technique differs from the other variable selection procedures related to the ASR by two main features: (1) The search of various optimal subset search is controlled by the statistic q(2) or root-mean-square error (RMSEP) in the LOO cross-validation step rather than the correlation coefficient obtained in the modeling step (r(2)). (2) The searching speed of all optimal subsets is expedited by the statistic r(int) together with q(2). A comparison of the results of the VSMP applied to the Selwood data set (n = 31 compounds, m = 53 descriptors) with those obtained from alternative algorithms shows the good performance of the technique.

Entities:  

Year:  2003        PMID: 12767155     DOI: 10.1021/ci020377j

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  8 in total

1.  Molecular electronegativity distance vector model for the prediction of bioconcentration factors in fish.

Authors:  Shu-Shen Liu; Li-Tang Qin; Hai-Ling Liu; Da-Qiang Yin
Journal:  J Mol Model       Date:  2007-12-13       Impact factor: 1.810

2.  QSPR model for bioconcentration factors of nonpolar organic compounds using molecular electronegativity distance vector descriptors.

Authors:  Li-Tang Qin; Shu-Shen Liu; Hai-Ling Liu
Journal:  Mol Divers       Date:  2009-04-15       Impact factor: 2.943

3.  Oxidation of disinfectants with Cl-substituted structure by a Fenton-like system Cu(2+)/H2O2 and analysis on their structure-reactivity relationship.

Authors:  Jianbiao Peng; Jianhua Li; Huanhuan Shi; Zunyao Wang; Shixiang Gao
Journal:  Environ Sci Pollut Res Int       Date:  2015-09-26       Impact factor: 4.223

4.  A study on quantitative structure-activity relationship and molecular docking of metalloproteinase inhibitors based on L-tyrosine scaffold.

Authors:  Maryam Abbasi; Fatemeh Ramezani; Maryam Elyasi; Hojjat Sadeghi-Aliabadi; Massoud Amanlou
Journal:  Daru       Date:  2015-04-29       Impact factor: 3.117

5.  Descriptor Selection via Log-Sum Regularization for the Biological Activities of Chemical Structure.

Authors:  Liang-Yong Xia; Yu-Wei Wang; De-Yu Meng; Xiao-Jun Yao; Hua Chai; Yong Liang
Journal:  Int J Mol Sci       Date:  2017-12-22       Impact factor: 5.923

6.  A QSAR study of environmental estrogens based on a novel variable selection method.

Authors:  Zhongsheng Yi; Aiqian Zhang
Journal:  Molecules       Date:  2012-05-21       Impact factor: 4.411

7.  Semi-empirical topological method for prediction of the gas chromatographic relative retention times of polybrominated diphenyl ethers (PBDEs).

Authors:  Hong-Yan Liu; Shu-Shen Liu; Li-Tang Qin
Journal:  J Mol Model       Date:  2007-03-28       Impact factor: 2.172

8.  Visual analytics in cheminformatics: user-supervised descriptor selection for QSAR methods.

Authors:  María Jimena Martínez; Ignacio Ponzoni; Mónica F Díaz; Gustavo E Vazquez; Axel J Soto
Journal:  J Cheminform       Date:  2015-08-19       Impact factor: 5.514

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.