Literature DB >> 11410044

QSAR with few compounds and many features.

D M Hawkins1, S C Basak, X Shi.   

Abstract

Fitting quantitative structure-activity relationships (QSAR) requires different statistical methodologies and, to some degree, philosophies depending on the "shape" of the data matrix. When few features are used and there are many compounds, it is a reasonable expectation that good feature subset selection may be made and that nonlinearities and nonadditivities can be detected and diagnosed. Where there are many features and few compounds, this is unrealistic. Methods such as ridge regression RR, PLS, and principal component regression PCR, which abjure feature selection and rely on linearity may provide good predictions and fair understanding. We report a development of ridge regression for the underdetermined case by using generalized cross-validation to choose the ridge constant and perform F-tests for additional information. Conventional regression diagnostics can be used in followup to identify nonlinearities and other departures from model. We illustrate the approach with QSAR models of four data sets using calculated molecular descriptors.

Entities:  

Mesh:

Year:  2001        PMID: 11410044     DOI: 10.1021/ci0001177

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  6 in total

1.  Predicting allergic contact dermatitis: a hierarchical structure-activity relationship (SAR) approach to chemical classification using topological and quantum chemical descriptors.

Authors:  Subhash C Basak; Denise Mills; Douglas M Hawkins
Journal:  J Comput Aided Mol Des       Date:  2008-03-13       Impact factor: 3.686

2.  Generic Feature Selection with Short Fat Data.

Authors:  B Clarke; J-H Chu
Journal:  J Indian Soc Agric Stat       Date:  2014

3.  Modeling the bioconcentration factors and bioaccumulation factors of polychlorinated biphenyls with posetic quantitative super-structure/activity relationships (QSSAR).

Authors:  Teodora Ivanciuc; Ovidiu Ivanciuc; Douglas J Klein
Journal:  Mol Divers       Date:  2006-05-19       Impact factor: 2.943

Review 4.  Towards reproducible computational drug discovery.

Authors:  Nalini Schaduangrat; Samuel Lampa; Saw Simeon; Matthew Paul Gleeson; Ola Spjuth; Chanin Nantasenamat
Journal:  J Cheminform       Date:  2020-01-28       Impact factor: 5.514

5.  Structure-activity models of oral clearance, cytotoxicity, and LD50: a screen for promising anticancer compounds.

Authors:  John C Boik; Robert A Newman
Journal:  BMC Pharmacol       Date:  2008-06-13

6.  OOMMPPAA: a tool to aid directed synthesis by the combined analysis of activity and structural data.

Authors:  Anthony R Bradley; Ian D Wall; Darren V S Green; Charlotte M Deane; Brian D Marsden
Journal:  J Chem Inf Model       Date:  2014-10-09       Impact factor: 4.956

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.