Literature DB >> 19756232

TESTING SIGNIFICANCE OF FEATURES BY LASSOED PRINCIPAL COMPONENTS.

Daniela M Witten1, Robert Tibshirani.   

Abstract

We consider the problem of testing the significance of features in high-dimensional settings. In particular, we test for differentially-expressed genes in a microarray experiment. We wish to identify genes that are associated with some type of outcome, such as survival time or cancer type. We propose a new procedure, called Lassoed Principal Components (LPC), that builds upon existing methods and can provide a sizable improvement. For instance, in the case of two-class data, a standard (albeit simple) approach might be to compute a two-sample t-statistic for each gene. The LPC method involves projecting these conventional gene scores onto the eigenvectors of the gene expression data covariance matrix and then applying an L(1) penalty in order to de-noise the resulting projections. We present a theoretical framework under which LPC is the logical choice for identifying significant genes, and we show that LPC can provide a marked reduction in false discovery rates over the conventional methods on both real and simulated data. Moreover, this flexible procedure can be applied to a variety of types of data and can be used to improve many existing methods for the identification of significant features.

Entities:  

Year:  2008        PMID: 19756232      PMCID: PMC2743444          DOI: 10.1214/08-AOAS182SUPP

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  18 in total

1.  Singular value decomposition for genome-wide expression data processing and modeling.

Authors:  O Alter; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

2.  Improved statistical tests for differential gene expression by shrinking variance components estimates.

Authors:  Xiangqin Cui; J T Gene Hwang; Jing Qiu; Natalie J Blades; Gary A Churchill
Journal:  Biostatistics       Date:  2005-01       Impact factor: 5.899

Review 3.  Microarray data analysis: from disarray to consolidation and consensus.

Authors:  David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal:  Nat Rev Genet       Date:  2006-01       Impact factor: 53.242

4.  Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

Authors:  Gordon K Smyth
Journal:  Stat Appl Genet Mol Biol       Date:  2004-02-12

5.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

6.  The consensus coding sequences of human breast and colorectal cancers.

Authors:  Tobias Sjöblom; Siân Jones; Laura D Wood; D Williams Parsons; Jimmy Lin; Thomas D Barber; Diana Mandelker; Rebecca J Leary; Janine Ptak; Natalie Silliman; Steve Szabo; Phillip Buckhaults; Christopher Farrell; Paul Meeh; Sanford D Markowitz; Joseph Willis; Dawn Dawson; James K V Willson; Adi F Gazdar; James Hartigan; Leo Wu; Changsheng Liu; Giovanni Parmigiani; Ben Ho Park; Kurtis E Bachman; Nickolas Papadopoulos; Bert Vogelstein; Kenneth W Kinzler; Victor E Velculescu
Journal:  Science       Date:  2006-09-07       Impact factor: 47.728

7.  High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics.

Authors:  Carlos M Carvalho; Jeffrey Chang; Joseph E Lucas; Joseph R Nevins; Quanli Wang; Mike West
Journal:  J Am Stat Assoc       Date:  2008-12-01       Impact factor: 5.033

8.  The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma.

Authors:  Andreas Rosenwald; George Wright; Wing C Chan; Joseph M Connors; Elias Campo; Richard I Fisher; Randy D Gascoyne; H Konrad Muller-Hermelink; Erlend B Smeland; Jena M Giltnane; Elaine M Hurt; Hong Zhao; Lauren Averett; Liming Yang; Wyndham H Wilson; Elaine S Jaffe; Richard Simon; Richard D Klausner; John Powell; Patricia L Duffey; Dan L Longo; Timothy C Greiner; Dennis D Weisenburger; Warren G Sanger; Bhavana J Dave; James C Lynch; Julie Vose; James O Armitage; Emilio Montserrat; Armando López-Guillermo; Thomas M Grogan; Thomas P Miller; Michel LeBlanc; German Ott; Stein Kvaloy; Jan Delabie; Harald Holte; Peter Krajci; Trond Stokke; Louis M Staudt
Journal:  N Engl J Med       Date:  2002-06-20       Impact factor: 91.245

9.  Gene-expression profiles predict survival of patients with lung adenocarcinoma.

Authors:  David G Beer; Sharon L R Kardia; Chiang-Ching Huang; Thomas J Giordano; Albert M Levin; David E Misek; Lin Lin; Guoan Chen; Tarek G Gharib; Dafydd G Thomas; Michelle L Lizyness; Rork Kuick; Satoru Hayasaka; Jeremy M G Taylor; Mark D Iannettoni; Mark B Orringer; Samir Hanash
Journal:  Nat Med       Date:  2002-07-15       Impact factor: 53.440

10.  Semi-supervised methods to predict patient survival from gene expression data.

Authors:  Eric Bair; Robert Tibshirani
Journal:  PLoS Biol       Date:  2004-04-13       Impact factor: 8.029

View more
  5 in total

1.  De-correlating expression in gene-set analysis.

Authors:  Dougu Nam
Journal:  Bioinformatics       Date:  2010-09-15       Impact factor: 6.937

Review 2.  Survival analysis with high-dimensional covariates.

Authors:  Daniela M Witten; Robert Tibshirani
Journal:  Stat Methods Med Res       Date:  2009-08-04       Impact factor: 3.021

3.  Identification of significant features in DNA microarray data.

Authors:  Eric Bair
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2013-07

4.  Hypertrophic Cardiomyopathy Registry: The rationale and design of an international, observational study of hypertrophic cardiomyopathy.

Authors:  Christopher M Kramer; Evan Appelbaum; Milind Y Desai; Patrice Desvigne-Nickens; John P DiMarco; Matthias G Friedrich; Nancy Geller; Sarahfaye Heckler; Carolyn Y Ho; Michael Jerosch-Herold; Elizabeth A Ivey; Julianna Keleti; Dong-Yun Kim; Paul Kolm; Raymond Y Kwong; Martin S Maron; Jeanette Schulz-Menger; Stefan Piechnik; Hugh Watkins; William S Weintraub; Pan Wu; Stefan Neubauer
Journal:  Am Heart J       Date:  2015-05-22       Impact factor: 4.749

5.  Transcriptomic profiles of high and low antibody responders to smallpox vaccine.

Authors:  R B Kennedy; A L Oberg; I G Ovsyannikova; I H Haralambieva; D Grill; G A Poland
Journal:  Genes Immun       Date:  2013-04-18       Impact factor: 2.676

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.