Literature DB >> 19222376

Sparse canonical correlation analysis with application to genomic data integration.

Elena Parkhomenko1, David Tritchler, Joseph Beyene.   

Abstract

Large scale genomic studies with multiple phenotypic or genotypic measures may require the identification of complex multivariate relationships. In multivariate analysis a common way to inspect the relationship between two sets of variables based on their correlation is canonical correlation analysis, which determines linear combinations of all variables of each type with maximal correlation between the two linear combinations. However, in high dimensional data analysis, when the number of variables under consideration exceeds tens of thousands, linear combinations of the entire sets of features may lack biological plausibility and interpretability. In addition, insufficient sample size may lead to computational problems, inaccurate estimates of parameters and non-generalizable results. These problems may be solved by selecting sparse subsets of variables, i.e. obtaining sparse loadings in the linear combinations of variables of each type. In this paper we present Sparse Canonical Correlation Analysis (SCCA) which examines the relationships between two types of variables and provides sparse solutions that include only small subsets of variables of each type by maximizing the correlation between the subsets of variables of different types while performing variable selection. We also present an extension of SCCA--adaptive SCCA. We evaluate their properties using simulated data and illustrate practical use by applying both methods to the study of natural variation in human gene expression.

Entities:  

Mesh:

Year:  2009        PMID: 19222376     DOI: 10.2202/1544-6115.1406

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  85 in total

Review 1.  Statistical approaches for the analysis of DNA methylation microarray data.

Authors:  Kimberly D Siegmund
Journal:  Hum Genet       Date:  2011-04-26       Impact factor: 4.132

2.  Simultaneous analysis of multiple data types in pharmacogenomic studies using weighted sparse canonical correlation analysis.

Authors:  Prabhakar Chalise; Anthony Batzler; Ryan Abo; Liewei Wang; Brooke L Fridley
Journal:  OMICS       Date:  2012-06-26

3.  Structured sparse canonical correlation analysis for brain imaging genetics: an improved GraphNet method.

Authors:  Lei Du; Heng Huang; Jingwen Yan; Sungeun Kim; Shannon L Risacher; Mark Inlow; Jason H Moore; Andrew J Saykin; Li Shen
Journal:  Bioinformatics       Date:  2016-01-21       Impact factor: 6.937

4.  Canonical variate regression.

Authors:  Chongliang Luo; Jin Liu; Dipak K Dey; Kun Chen
Journal:  Biostatistics       Date:  2016-02-09       Impact factor: 5.899

5.  A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis.

Authors:  Daniela M Witten; Robert Tibshirani; Trevor Hastie
Journal:  Biostatistics       Date:  2009-04-17       Impact factor: 5.899

6.  SPARSE INTEGRATIVE CLUSTERING OF MULTIPLE OMICS DATA SETS.

Authors:  Ronglai Shen; Sijian Wang; Qianxing Mo
Journal:  Ann Appl Stat       Date:  2013-04-09       Impact factor: 2.083

7.  Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

8.  Detecting genetic associations with brain imaging phenotypes in Alzheimer's disease via a novel structured SCCA approach.

Authors:  Lei Du; Kefei Liu; Xiaohui Yao; Shannon L Risacher; Junwei Han; Andrew J Saykin; Lei Guo; Li Shen
Journal:  Med Image Anal       Date:  2020-01-23       Impact factor: 8.545

9.  Sparse canonical correlation analysis relates network-level atrophy to multivariate cognitive measures in a neurodegenerative population.

Authors:  Brian B Avants; David J Libon; Katya Rascovsky; Ashley Boller; Corey T McMillan; Lauren Massimo; H Branch Coslett; Anjan Chatterjee; Rachel G Gross; Murray Grossman
Journal:  Neuroimage       Date:  2013-10-02       Impact factor: 6.556

10.  Dementia induces correlated reductions in white matter integrity and cortical thickness: a multivariate neuroimaging study with sparse canonical correlation analysis.

Authors:  Brian B Avants; Philip A Cook; Lyle Ungar; James C Gee; Murray Grossman
Journal:  Neuroimage       Date:  2010-01-18       Impact factor: 6.556

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.