Literature DB >> 12650589

Megavariate analysis of hierarchical QSAR data.

Lennart Eriksson1, Erik Johansson, Fredrik Lindgren, Michael Sjöström, Svante Wold.   

Abstract

Multivariate PCA- and PLS-models involving many variables are often difficult to interpret, because plots and lists of loadings, coefficients, VIPs, etc, rapidly become messy and hard to overview. There may then be a strong temptation to eliminate variables to obtain a smaller data set. Such a reduction of variables, however, often removes information and makes the modelling efforts less reliable. Model interpretation may be misleading and predictive power may deteriorate. A better alternative is usually to partition the variables into blocks of logically related variables and apply hierarchical data analysis. Such blocked data may be analyzed by PCA and PLS. This modelling forms the base-level of the hierarchical modelling set-up. On the base-level in-depth information is extracted for the different blocks. The score vectors formed on the base-level, here called 'super variables', may be linked together in new matrices on the top-level. On the top-level superficial relationships between the X- and the Y-data are investigated. In this paper the basic principles of hierarchical modelling by means of PCA and PLS are reviewed. One objective of the paper is to disseminate this concept to a broader QSAR audience. The hierarchical methods are used to analyze a set of 10 haloalkanes for which K = 30 chemical descriptors and M = 255 biological responses have been gathered. Due to the complexity of the biological data, they are sub-divided in four blocks. All the modelling steps on the base-level and the top-level are reported and the final QSAR model is interpreted thoroughly.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 12650589     DOI: 10.1023/a:1022450725545

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  4 in total

1.  Statistical molecular design of building blocks for combinatorial chemistry.

Authors:  A Linusson; J Gottfries; F Lindgren; S Wold
Journal:  J Med Chem       Date:  2000-04-06       Impact factor: 7.446

2.  A strategy for ranking environmentally occurring chemicals. Part VI. QSARs for the mutagenic effects of halogenated aliphatics.

Authors:  L Eriksson; S Hellberg; E Johansson; J Jonsson; M Sjöström; S Wold; R Berglind; B Karlsson
Journal:  Acta Chem Scand       Date:  1991-10

3.  Alignment of flexible molecules at their receptor site using 3D descriptors and Hi-PCA.

Authors:  A Berglund; M C De Rosa; S Wold
Journal:  J Comput Aided Mol Des       Date:  1997-11       Impact factor: 3.686

4.  Multivariate data analysis using D-optimal designs, partial least squares, and response surface modeling: A directional approach for the analysis of farnesyltransferase inhibitors.

Authors:  E Giraud; C Luttmann; F Lavelle; J F Riou; P Mailliet; A Laoui
Journal:  J Med Chem       Date:  2000-05-04       Impact factor: 7.446

  4 in total
  11 in total

1.  A support vector machine approach to classify human cytochrome P450 3A4 inhibitors.

Authors:  Jan M Kriegl; Thomas Arnhold; Bernd Beck; Thomas Fox
Journal:  J Comput Aided Mol Des       Date:  2005-03       Impact factor: 3.686

2.  Megavariate analysis of environmental QSAR data. Part II--investigating very complex problem formulations using hierarchical, non-linear and batch-wise extensions of PCA and PLS.

Authors:  Lennart Eriksson; Patrik L Andersson; Erik Johansson; Mats Tysklind
Journal:  Mol Divers       Date:  2006-06-27       Impact factor: 2.943

3.  Megavariate analysis of environmental QSAR data. Part I--a basic framework founded on principal component analysis (PCA), partial least squares (PLS), and statistical molecular design (SMD).

Authors:  Lennart Eriksson; Patrik L Andersson; Erik Johansson; Mats Tysklind
Journal:  Mol Divers       Date:  2006-06-13       Impact factor: 2.943

4.  Development, interpretation and temporal evaluation of a global QSAR of hERG electrophysiology screening data.

Authors:  Claire L Gavaghan; Catrin Hasselgren Arnby; Niklas Blomberg; Gert Strandlund; Scott Boyer
Journal:  J Comput Aided Mol Des       Date:  2007-03-24       Impact factor: 3.686

5.  On the interpretation and interpretability of quantitative structure-activity relationship models.

Authors:  Rajarshi Guha
Journal:  J Comput Aided Mol Des       Date:  2008-09-11       Impact factor: 3.686

6.  Meta-heuristics on quantitative structure-activity relationships: study on polychlorinated biphenyls.

Authors:  Lorentz Jäntschi; Sorana D Bolboacă; Radu E Sestraş
Journal:  J Mol Model       Date:  2009-07-17       Impact factor: 1.810

Review 7.  Methods for reliability and uncertainty assessment and for applicability evaluations of classification- and regression-based QSARs.

Authors:  Lennart Eriksson; Joanna Jaworska; Andrew P Worth; Mark T D Cronin; Robert M McDowell; Paola Gramatica
Journal:  Environ Health Perspect       Date:  2003-08       Impact factor: 9.031

8.  Gene expression in acute Stanford type A dissection: a comparative microarray study.

Authors:  Barbara Theresia Weis-Müller; Olga Modlich; Irina Drobinskaya; Derya Unay; Rita Huber; Hans Bojar; Jochen D Schipke; Peter Feindt; Emmeran Gams; Wolfram Müller; Timm Goecke; Wilhelm Sandmann
Journal:  J Transl Med       Date:  2006-07-06       Impact factor: 5.531

9.  Lipidomic Signature of Progression of Chronic Kidney Disease in the Chronic Renal Insufficiency Cohort.

Authors:  Farsad Afshinnia; Thekkelnaycke M Rajendiran; Alla Karnovsky; Tanu Soni; Xue Wang; Dawei Xie; Wei Yang; Tariq Shafi; Matthew R Weir; Jiang He; Carolyn S Brecklin; Eugene P Rhee; Jeffrey R Schelling; Akinlolu Ojo; Harold Feldman; George Michailidis; Subramaniam Pennathur
Journal:  Kidney Int Rep       Date:  2016-08-18

10.  Metabolic profiling of hepatitis B virus-related hepatocellular carcinoma with diverse differentiation grades.

Authors:  Lei Zhang; Ya Huang; Mingjian Lian; Zhijuan Fan; Yaqiong Tian; Yufan Wang; Hua Kang; Shuang Liu; Shuye Liu; Tong Li; Zhongqiang Shan
Journal:  Oncol Lett       Date:  2017-01-11       Impact factor: 2.967

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.