Literature DB >> 21492133

Simultaneous analysis of coupled data matrices subject to different amounts of noise.

Tom F Wilderjans1, Eva Ceulemans, Iven Van Mechelen, Robert A van den Berg.   

Abstract

In many areas of science, research questions imply the analysis of a set of coupled data blocks, with, for instance, each block being an experimental unit by variable matrix, and the variables being the same in all matrices. To obtain an overall picture of the mechanisms that play a role in the different data matrices, the information in these matrices needs to be integrated. This may be achieved by applying a data-analytic strategy in which a global model is fitted to all data matrices simultaneously, as in some forms of simultaneous component analysis (SCA). Since such a strategy implies that all data entries, regardless the matrix they belong to, contribute equally to the analysis, it may obfuscate the overall picture of the mechanisms underlying the data when the different data matrices are subject to different amounts of noise. One way out is to downweight entries from noisy data matrices in favour of entries from less noisy matrices. Information regarding the amount of noise that is present in each matrix, however, is, in most cases, not available. To deal with these problems, in this paper a novel maximum-likelihood-based simultaneous component analysis method, referred to as MxLSCA, is proposed. Being a stochastic extension of SCA, in MxLSCA the amount of noise in each data matrix is estimated and entries from noisy data matrices are downweighted. Both in an extensive simulation study and in an application to data stemming from cross-cultural emotion psychology, it is shown that the novel MxLSCA strategy outperforms the SCA strategy with respect to disclosing the mechanisms underlying the coupled data. ©2010 The British Psychological Society.

Mesh:

Year:  2011        PMID: 21492133     DOI: 10.1348/000711010X513263

Source DB:  PubMed          Journal:  Br J Math Stat Psychol        ISSN: 0007-1102            Impact factor:   3.380


  5 in total

1.  Modeling differences in the dimensionality of multiblock data by means of clusterwise simultaneous component analysis.

Authors:  Kim De Roover; Eva Ceulemans; Marieke E Timmerman; John B Nezlek; Patrick Onghena
Journal:  Psychometrika       Date:  2013-01-25       Impact factor: 2.500

2.  Principal Covariates Clusterwise Regression (PCCR): Accounting for Multicollinearity and Population Heterogeneity in Hierarchically Organized Data.

Authors:  Tom Frans Wilderjans; Eva Vande Gaer; Henk A L Kiers; Iven Van Mechelen; Eva Ceulemans
Journal:  Psychometrika       Date:  2016-11-30       Impact factor: 2.500

3.  A flexible framework for sparse simultaneous component based data integration.

Authors:  Katrijn Van Deun; Tom F Wilderjans; Robert A van den Berg; Anestis Antoniadis; Iven Van Mechelen
Journal:  BMC Bioinformatics       Date:  2011-11-15       Impact factor: 3.169

4.  RegularizedSCA: Regularized simultaneous component analysis of multiblock data in R.

Authors:  Zhengguo Gu; Katrijn Van Deun
Journal:  Behav Res Methods       Date:  2019-10

5.  Fusing metabolomics data sets with heterogeneous measurement errors.

Authors:  Sandra Waaijenborg; Oksana Korobko; Ko Willems van Dijk; Mirjam Lips; Thomas Hankemeier; Tom F Wilderjans; Age K Smilde; Johan A Westerhuis
Journal:  PLoS One       Date:  2018-04-26       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.