Literature DB >> 19377034

A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis.

Daniela M Witten1, Robert Tibshirani, Trevor Hastie.   

Abstract

We present a penalized matrix decomposition (PMD), a new framework for computing a rank-K approximation for a matrix. We approximate the matrix X as circumflexX = sigma(k=1)(K) d(k)u(k)v(k)(T), where d(k), u(k), and v(k) minimize the squared Frobenius norm of X - circumflexX, subject to penalties on u(k) and v(k). This results in a regularized version of the singular value decomposition. Of particular interest is the use of L(1)-penalties on u(k) and v(k), which yields a decomposition of X using sparse vectors. We show that when the PMD is applied using an L(1)-penalty on v(k) but not on u(k), a method for sparse principal components results. In fact, this yields an efficient algorithm for the "SCoTLASS" proposal (Jolliffe and others 2003) for obtaining sparse principal components. This method is demonstrated on a publicly available gene expression data set. We also establish connections between the SCoTLASS method for sparse principal component analysis and the method of Zou and others (2006). In addition, we show that when the PMD is applied to a cross-products matrix, it results in a method for penalized canonical correlation analysis (CCA). We apply this penalized CCA method to simulated data and to a genomic data set consisting of gene expression and DNA copy number measurements on the same set of samples.

Mesh:

Substances:

Year:  2009        PMID: 19377034      PMCID: PMC2697346          DOI: 10.1093/biostatistics/kxp008

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  12 in total

1.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  Spatial smoothing and hot spot detection for CGH data using the fused lasso.

Authors:  Robert Tibshirani; Pei Wang
Journal:  Biostatistics       Date:  2007-05-18       Impact factor: 5.899

3.  Quantifying the association between gene expressions and DNA-markers by penalized canonical correlation analysis.

Authors:  Sandra Waaijenborg; Philip C Verselewel de Witt Hamer; Aeilko H Zwinderman
Journal:  Stat Appl Genet Mol Biol       Date:  2008-01-23

4.  Sparse canonical correlation analysis with application to genomic data integration.

Authors:  Elena Parkhomenko; David Tritchler; Joseph Beyene
Journal:  Stat Appl Genet Mol Biol       Date:  2009-01-06

5.  Impact of DNA amplification on gene expression patterns in breast cancer.

Authors:  Elizabeth Hyman; Päivikki Kauraniemi; Sampsa Hautaniemi; Maija Wolf; Spyro Mousses; Ester Rozenblum; Markus Ringnér; Guido Sauter; Outi Monni; Abdel Elkahloun; Olli-P Kallioniemi; Anne Kallioniemi
Journal:  Cancer Res       Date:  2002-11-01       Impact factor: 12.701

6.  Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors.

Authors:  Jonathan R Pollack; Therese Sørlie; Charles M Perou; Christian A Rees; Stefanie S Jeffrey; Per E Lonning; Robert Tibshirani; David Botstein; Anne-Lise Børresen-Dale; Patrick O Brown
Journal:  Proc Natl Acad Sci U S A       Date:  2002-09-24       Impact factor: 11.205

7.  Genomic and transcriptional aberrations linked to breast cancer pathophysiologies.

Authors:  Koei Chin; Sandy DeVries; Jane Fridlyand; Paul T Spellman; Ritu Roydasgupta; Wen-Lin Kuo; Anna Lapuk; Richard M Neve; Zuwei Qian; Tom Ryder; Fanqing Chen; Heidi Feiler; Taku Tokuyasu; Chris Kingsley; Shanaz Dairkee; Zhenhang Meng; Karen Chew; Daniel Pinkel; Ajay Jain; Britt Marie Ljung; Laura Esserman; Donna G Albertson; Frederic M Waldman; Joe W Gray
Journal:  Cancer Cell       Date:  2006-12       Impact factor: 31.743

8.  Relative impact of nucleotide and copy number variation on gene expression phenotypes.

Authors:  Barbara E Stranger; Matthew S Forrest; Mark Dunning; Catherine E Ingle; Claude Beazley; Natalie Thorne; Richard Redon; Christine P Bird; Anna de Grassi; Charles Lee; Chris Tyler-Smith; Nigel Carter; Stephen W Scherer; Simon Tavaré; Panagiotis Deloukas; Matthew E Hurles; Emmanouil T Dermitzakis
Journal:  Science       Date:  2007-02-09       Impact factor: 47.728

9.  Genome-wide associations of gene expression variation in humans.

Authors:  Barbara E Stranger; Matthew S Forrest; Andrew G Clark; Mark J Minichiello; Samuel Deutsch; Robert Lyle; Sarah Hunt; Brenda Kahl; Stylianos E Antonarakis; Simon Tavaré; Panagiotis Deloukas; Emmanouil T Dermitzakis
Journal:  PLoS Genet       Date:  2005-12-16       Impact factor: 5.917

10.  Genome-wide sparse canonical correlation of gene expression with genotypes.

Authors:  David Tritchler; Joseph Beyene; Elena Parkhomenko
Journal:  BMC Proc       Date:  2007-12-18
View more
  314 in total

1.  Sparse Biclustering of Transposable Data.

Authors:  Kean Ming Tan; Daniela M Witten
Journal:  J Comput Graph Stat       Date:  2014       Impact factor: 2.302

2.  Matrix Factorization for Transcriptional Regulatory Network Inference.

Authors:  Michael F Ochs; Elana J Fertig
Journal:  IEEE Symp Comput Intell Bioinforma Comput Biol Proc       Date:  2012-05

3.  SPARSE INTEGRATIVE CLUSTERING OF MULTIPLE OMICS DATA SETS.

Authors:  Ronglai Shen; Sijian Wang; Qianxing Mo
Journal:  Ann Appl Stat       Date:  2013-04-09       Impact factor: 2.083

4.  Identifying Associations Between Brain Imaging Phenotypes and Genetic Factors via A Novel Structured SCCA Approach.

Authors:  Lei Du; Tuo Zhang; Kefei Liu; Jingwen Yan; Xiaohui Yao; Shannon L Risacher; Andrew J Saykin; Junwei Han; Lei Guo; Li Shen
Journal:  Inf Process Med Imaging       Date:  2017-05-23

5.  Associations between lipids in selected brain regions, plasma miRNA, and behavioral and cognitive measures following 28Si ion irradiation.

Authors:  Jessica Minnier; Mark R Emmett; Ruby Perez; Liang-Hao Ding; Brooke L Barnette; Rianna E Larios; Changjin Hong; Tae Hyun Hwang; Yongjia Yu; Christina M Fallgren; Michael D Story; Michael M Weil; Jacob Raber
Journal:  Sci Rep       Date:  2021-07-21       Impact factor: 4.379

6.  Discovering and deciphering relationships across disparate data modalities.

Authors:  Joshua T Vogelstein; Eric W Bridgeford; Qing Wang; Carey E Priebe; Mauro Maggioni; Cencheng Shen
Journal:  Elife       Date:  2019-01-15       Impact factor: 8.140

7.  Detecting genetic associations with brain imaging phenotypes in Alzheimer's disease via a novel structured SCCA approach.

Authors:  Lei Du; Kefei Liu; Xiaohui Yao; Shannon L Risacher; Junwei Han; Andrew J Saykin; Lei Guo; Li Shen
Journal:  Med Image Anal       Date:  2020-01-23       Impact factor: 8.545

8.  JOINT EXPLORATION AND MINING OF MEMORY-RELEVANT BRAIN ANATOMIC AND CONNECTOMIC PATTERNS VIA A THREE-WAY ASSOCIATION MODEL.

Authors:  Jingwen Yan; Kefei Liu; Huang Li; Enrico Amico; Shannon L Risacher; Yu-Chien Wu; Shiaofen Fang; Olaf Sporns; Andrew J Saykin; Joaquín Goñi; Li Shen
Journal:  Proc IEEE Int Symp Biomed Imaging       Date:  2018-05-24

9.  Finding imaging patterns of structural covariance via Non-Negative Matrix Factorization.

Authors:  Aristeidis Sotiras; Susan M Resnick; Christos Davatzikos
Journal:  Neuroimage       Date:  2014-12-12       Impact factor: 6.556

10.  Sparse canonical correlation analysis relates network-level atrophy to multivariate cognitive measures in a neurodegenerative population.

Authors:  Brian B Avants; David J Libon; Katya Rascovsky; Ashley Boller; Corey T McMillan; Lauren Massimo; H Branch Coslett; Anjan Chatterjee; Rachel G Gross; Murray Grossman
Journal:  Neuroimage       Date:  2013-10-02       Impact factor: 6.556

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.