Literature DB >> 19572827

Extensions of sparse canonical correlation analysis with applications to genomic data.

Daniela M Witten1, Robert J Tibshirani.   

Abstract

In recent work, several authors have introduced methods for sparse canonical correlation analysis (sparse CCA). Suppose that two sets of measurements are available on the same set of observations. Sparse CCA is a method for identifying sparse linear combinations of the two sets of variables that are highly correlated with each other. It has been shown to be useful in the analysis of high-dimensional genomic data, when two sets of assays are available on the same set of samples. In this paper, we propose two extensions to the sparse CCA methodology. (1) Sparse CCA is an unsupervised method; that is, it does not make use of outcome measurements that may be available for each observation (e.g., survival time or cancer subtype). We propose an extension to sparse CCA, which we call sparse supervised CCA, which results in the identification of linear combinations of the two sets of variables that are correlated with each other and associated with the outcome. (2) It is becoming increasingly common for researchers to collect data on more than two assays on the same set of samples; for instance, SNP, gene expression, and DNA copy number measurements may all be available. We develop sparse multiple CCA in order to extend the sparse CCA methodology to the case of more than two data sets. We demonstrate these new methods on simulated data and on a recently published and publicly available diffuse large B-cell lymphoma data set.

Entities:  

Mesh:

Year:  2009        PMID: 19572827      PMCID: PMC2861323          DOI: 10.2202/1544-6115.1470

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  16 in total

1.  Spatial smoothing and hot spot detection for CGH data using the fused lasso.

Authors:  Robert Tibshirani; Pei Wang
Journal:  Biostatistics       Date:  2007-05-18       Impact factor: 5.899

2.  Quantifying the association between gene expressions and DNA-markers by penalized canonical correlation analysis.

Authors:  Sandra Waaijenborg; Philip C Verselewel de Witt Hamer; Aeilko H Zwinderman
Journal:  Stat Appl Genet Mol Biol       Date:  2008-01-23

3.  Sparse canonical correlation analysis with application to genomic data integration.

Authors:  Elena Parkhomenko; David Tritchler; Joseph Beyene
Journal:  Stat Appl Genet Mol Biol       Date:  2009-01-06

4.  Impact of DNA amplification on gene expression patterns in breast cancer.

Authors:  Elizabeth Hyman; Päivikki Kauraniemi; Sampsa Hautaniemi; Maija Wolf; Spyro Mousses; Ester Rozenblum; Markus Ringnér; Guido Sauter; Outi Monni; Abdel Elkahloun; Olli-P Kallioniemi; Anne Kallioniemi
Journal:  Cancer Res       Date:  2002-11-01       Impact factor: 12.701

5.  Diagnosis of multiple cancer types by shrunken centroids of gene expression.

Authors:  Robert Tibshirani; Trevor Hastie; Balasubramanian Narasimhan; Gilbert Chu
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-14       Impact factor: 11.205

6.  Relative impact of nucleotide and copy number variation on gene expression phenotypes.

Authors:  Barbara E Stranger; Matthew S Forrest; Mark Dunning; Catherine E Ingle; Claude Beazley; Natalie Thorne; Richard Redon; Christine P Bird; Anna de Grassi; Charles Lee; Chris Tyler-Smith; Nigel Carter; Stephen W Scherer; Simon Tavaré; Panagiotis Deloukas; Matthew E Hurles; Emmanouil T Dermitzakis
Journal:  Science       Date:  2007-02-09       Impact factor: 47.728

7.  Genome-wide associations of gene expression variation in humans.

Authors:  Barbara E Stranger; Matthew S Forrest; Andrew G Clark; Mark J Minichiello; Samuel Deutsch; Robert Lyle; Sarah Hunt; Brenda Kahl; Stylianos E Antonarakis; Simon Tavaré; Panagiotis Deloukas; Emmanouil T Dermitzakis
Journal:  PLoS Genet       Date:  2005-12-16       Impact factor: 5.917

8.  Semi-supervised methods to predict patient survival from gene expression data.

Authors:  Eric Bair; Robert Tibshirani
Journal:  PLoS Biol       Date:  2004-04-13       Impact factor: 8.029

9.  Sparse canonical methods for biological data integration: application to a cross-platform study.

Authors:  Kim-Anh Lê Cao; Pascal G P Martin; Christèle Robert-Granié; Philippe Besse
Journal:  BMC Bioinformatics       Date:  2009-01-26       Impact factor: 3.169

10.  Genome-wide sparse canonical correlation of gene expression with genotypes.

Authors:  David Tritchler; Joseph Beyene; Elena Parkhomenko
Journal:  BMC Proc       Date:  2007-12-18
View more
  137 in total

Review 1.  Statistical approaches for the analysis of DNA methylation microarray data.

Authors:  Kimberly D Siegmund
Journal:  Hum Genet       Date:  2011-04-26       Impact factor: 4.132

2.  Simultaneous analysis of multiple data types in pharmacogenomic studies using weighted sparse canonical correlation analysis.

Authors:  Prabhakar Chalise; Anthony Batzler; Ryan Abo; Liewei Wang; Brooke L Fridley
Journal:  OMICS       Date:  2012-06-26

3.  Structured sparse canonical correlation analysis for brain imaging genetics: an improved GraphNet method.

Authors:  Lei Du; Heng Huang; Jingwen Yan; Sungeun Kim; Shannon L Risacher; Mark Inlow; Jason H Moore; Andrew J Saykin; Li Shen
Journal:  Bioinformatics       Date:  2016-01-21       Impact factor: 6.937

4.  Canonical variate regression.

Authors:  Chongliang Luo; Jin Liu; Dipak K Dey; Kun Chen
Journal:  Biostatistics       Date:  2016-02-09       Impact factor: 5.899

5.  Modeling gene-wise dependencies improves the identification of drug response biomarkers in cancer studies.

Authors:  Olga Nikolova; Russell Moser; Christopher Kemp; Mehmet Gönen; Adam A Margolin
Journal:  Bioinformatics       Date:  2017-05-01       Impact factor: 6.937

6.  SPARSE INTEGRATIVE CLUSTERING OF MULTIPLE OMICS DATA SETS.

Authors:  Ronglai Shen; Sijian Wang; Qianxing Mo
Journal:  Ann Appl Stat       Date:  2013-04-09       Impact factor: 2.083

7.  Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

8.  Detecting genetic associations with brain imaging phenotypes in Alzheimer's disease via a novel structured SCCA approach.

Authors:  Lei Du; Kefei Liu; Xiaohui Yao; Shannon L Risacher; Junwei Han; Andrew J Saykin; Lei Guo; Li Shen
Journal:  Med Image Anal       Date:  2020-01-23       Impact factor: 8.545

9.  Sparse canonical correlation analysis relates network-level atrophy to multivariate cognitive measures in a neurodegenerative population.

Authors:  Brian B Avants; David J Libon; Katya Rascovsky; Ashley Boller; Corey T McMillan; Lauren Massimo; H Branch Coslett; Anjan Chatterjee; Rachel G Gross; Murray Grossman
Journal:  Neuroimage       Date:  2013-10-02       Impact factor: 6.556

10.  Sparse Principal Component based High-Dimensional Mediation Analysis.

Authors:  Yi Zhao; Martin A Lindquist; Brian S Caffo
Journal:  Comput Stat Data Anal       Date:  2019-09-03       Impact factor: 1.681

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.