Literature DB >> 15844449

The impact of variable selection on the modelling of oestrogenicity.

T Ghafourian1, M T D Cronin.   

Abstract

Many oestrogenic chemicals exert their activity via specific interactions with the oestrogen receptor (ER). The objective of the present study was to identify significant descriptors associated with the ER binding affinities of a large and diverse set of compounds to drive quantitative structure-activity relationships (QSARs). To this end, a variety of statistical methods were employed for variable selection. These included stepwise regression and partial least squares (PLS) analyses, as well as a non-linear recursive partitioning method (Formal Inference-based Recursive Modelling). A total of 157 molecular descriptors including quantum mechanical, graph theoretical, indicator variables and log P were used in the study. Furthermore, cluster analysis of variables was performed to identify groups of descriptors representing similar molecular features. Hierarchical PLS analyses were performed, where the scores of the significant components of either PLS or principle component analysis (PCA), performed separately on each cluster, were used as the variables for the top model. This reduced the number of the variables representing the larger clusters, leading to a similar number of descriptors for each distinct molecular feature. The results showed that the most important molecular properties for stronger ER binding affinity are molecular size and shape, the presence of a phenol moiety as well as other aromatic groups, hydrophobicity and presence of double bonds. The best PLS model obtained, in terms of predictive ability, was a hierarchical PLS model. However, a rigorous validation study showed that the MLR model using descriptors selected by stepwise regression has greater predictive power than the PLS models.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15844449     DOI: 10.1080/10629360412331319808

Source DB:  PubMed          Journal:  SAR QSAR Environ Res        ISSN: 1026-776X            Impact factor:   3.000


  5 in total

1.  In silico prediction of estrogen receptor subtype binding affinity and selectivity using statistical methods and molecular docking with 2-arylnaphthalenes and 2-arylquinolines.

Authors:  Zhizhong Wang; Yan Li; Chunzhi Ai; Yonghua Wang
Journal:  Int J Mol Sci       Date:  2010-09-20       Impact factor: 5.923

2.  Implementation of a dynamic intestinal gut-on-a-chip barrier model for transport studies of lipophilic dioxin congeners.

Authors:  Kornphimol Kulthong; Loes Duivenvoorde; Barbara Z Mizera; Deborah Rijkers; Guillaume Ten Dam; Gerlof Oegema; Tomasz Puzyn; Hans Bouwmeester; Meike van der Zande
Journal:  RSC Adv       Date:  2018-09-19       Impact factor: 4.036

3.  QSAR models for CXCR2 receptor antagonists based on the genetic algorithm for data preprocessing prior to application of the PLS linear regression method and design of the new compounds using in silico virtual screening.

Authors:  Tahereh Asadollahi; Shayessteh Dadfarnia; Ali Mohammad Haji Shabani; Jahan B Ghasemi; Maryam Sarkhosh
Journal:  Molecules       Date:  2011-02-25       Impact factor: 4.411

4.  Shuffling multivariate adaptive regression splines and adaptive neuro-fuzzy inference system as tools for QSAR study of SARS inhibitors.

Authors:  M Jalali-Heravi; M Asadollahi-Baboli; A Mani-Varnosfaderani
Journal:  J Pharm Biomed Anal       Date:  2009-07-14       Impact factor: 3.935

5.  On two novel parameters for validation of predictive QSAR models.

Authors:  Partha Pratim Roy; Somnath Paul; Indrani Mitra; Kunal Roy
Journal:  Molecules       Date:  2009-04-29       Impact factor: 4.411

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.