Literature DB >> 23909925

A decision-theory approach to interpretable set analysis for high-dimensional data.

Simina M Boca1, Héctor Céorrada Bravo, Brian Caffo, Jeffrey T Leek, Giovanni Parmigiani.   

Abstract

A key problem in high-dimensional significance analysis is to find pre-defined sets that show enrichment for a statistical signal of interest; the classic example is the enrichment of gene sets for differentially expressed genes. Here, we propose a new decision-theory approach to the analysis of gene sets which focuses on estimating the fraction of non-null variables in a set. We introduce the idea of "atoms," non-overlapping sets based on the original pre-defined set annotations. Our approach focuses on finding the union of atoms that minimizes a weighted average of the number of false discoveries and missed discoveries. We introduce a new false discovery rate for sets, called the atomic false discovery rate (afdr), and prove that the optimal estimator in our decision-theory framework is to threshold the afdr. These results provide a coherent and interpretable framework for the analysis of sets that addresses the key issues of overlapping annotations and difficulty in interpreting p values in both competitive and self-contained tests. We illustrate our method and compare it to a popular existing method using simulated examples, as well as gene-set and brain ROI data analyses.
© 2013, The International Biometric Society.

Entities:  

Keywords:  Atomic false discovery rate; Gene-sets; Hypothesis testing; Set-level inference

Mesh:

Year:  2013        PMID: 23909925      PMCID: PMC3927844          DOI: 10.1111/biom.12060

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  23 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  DRAGON View: information visualization for annotated microarray data.

Authors:  Christopher M L S Bouton; Jonathan Pevsner
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

Review 3.  Computational analysis of microarray data.

Authors:  J Quackenbush
Journal:  Nat Rev Genet       Date:  2001-06       Impact factor: 53.242

4.  An automated method for neuroanatomic and cytoarchitectonic atlas-based interrogation of fMRI data sets.

Authors:  Joseph A Maldjian; Paul J Laurienti; Robert A Kraft; Jonathan H Burdette
Journal:  Neuroimage       Date:  2003-07       Impact factor: 6.556

5.  Extensions to gene set enrichment.

Authors:  Zhen Jiang; Robert Gentleman
Journal:  Bioinformatics       Date:  2006-11-24       Impact factor: 6.937

6.  Analyzing gene expression data in terms of gene sets: methodological issues.

Authors:  Jelle J Goeman; Peter Bühlmann
Journal:  Bioinformatics       Date:  2007-02-15       Impact factor: 6.937

7.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

8.  GOing Bayesian: model-based gene set analysis of genome-scale data.

Authors:  Sebastian Bauer; Julien Gagneur; Peter N Robinson
Journal:  Nucleic Acids Res       Date:  2010-02-19       Impact factor: 16.971

9.  An integrated genomic analysis of human glioblastoma multiforme.

Authors:  D Williams Parsons; Siân Jones; Xiaosong Zhang; Jimmy Cheng-Ho Lin; Rebecca J Leary; Philipp Angenendt; Parminder Mankoo; Hannah Carter; I-Mei Siu; Gary L Gallia; Alessandro Olivi; Roger McLendon; B Ahmed Rasheed; Stephen Keir; Tatiana Nikolskaya; Yuri Nikolsky; Dana A Busam; Hanna Tekleab; Luis A Diaz; James Hartigan; Doug R Smith; Robert L Strausberg; Suely Kazue Nagahashi Marie; Sueli Mieko Oba Shinjo; Hai Yan; Gregory J Riggins; Darell D Bigner; Rachel Karchin; Nick Papadopoulos; Giovanni Parmigiani; Bert Vogelstein; Victor E Velculescu; Kenneth W Kinzler
Journal:  Science       Date:  2008-09-04       Impact factor: 47.728

10.  Multiple locus linkage analysis of genomewide expression in yeast.

Authors:  John D Storey; Joshua M Akey; Leonid Kruglyak
Journal:  PLoS Biol       Date:  2005-07-26       Impact factor: 8.029

View more
  1 in total

1.  Computational Pathology: A Path Ahead.

Authors:  David N Louis; Michael Feldman; Alexis B Carter; Anand S Dighe; John D Pfeifer; Lynn Bry; Jonas S Almeida; Joel Saltz; Jonathan Braun; John E Tomaszewski; John R Gilbertson; John H Sinard; Georg K Gerber; Stephen J Galli; Jeffrey A Golden; Michael J Becich
Journal:  Arch Pathol Lab Med       Date:  2015-06-22       Impact factor: 5.534

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.