Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A decision-theory approach to interpretable set analysis for high-dimensional data.

Literature DB >> 23909925

A decision-theory approach to interpretable set analysis for high-dimensional data.

Simina M Boca¹, Héctor Céorrada Bravo, Brian Caffo, Jeffrey T Leek, Giovanni Parmigiani.

Abstract

A key problem in high-dimensional significance analysis is to find pre-defined sets that show enrichment for a statistical signal of interest; the classic example is the enrichment of gene sets for differentially expressed genes. Here, we propose a new decision-theory approach to the analysis of gene sets which focuses on estimating the fraction of non-null variables in a set. We introduce the idea of "atoms," non-overlapping sets based on the original pre-defined set annotations. Our approach focuses on finding the union of atoms that minimizes a weighted average of the number of false discoveries and missed discoveries. We introduce a new false discovery rate for sets, called the atomic false discovery rate (afdr), and prove that the optimal estimator in our decision-theory framework is to threshold the afdr. These results provide a coherent and interpretable framework for the analysis of sets that addresses the key issues of overlapping annotations and difficulty in interpreting p values in both competitive and self-contained tests. We illustrate our method and compare it to a popular existing method using simulated examples, as well as gene-set and brain ROI data analyses.

Entities: Chemical Disease Gene Species

Keywords: Atomic false discovery rate; Gene-sets; Hypothesis testing; Set-level inference

Mesh：

Year: 2013 PMID： 23909925 PMCID： PMC3927844 DOI： 10.1111/biom.12060

Source DB: PubMed Journal: Biometrics ISSN： 0006-341X Impact factor: 2.571

23 in total

1. KEGG: kyoto encyclopedia of genes and genomes.

Authors: M Kanehisa; S Goto
Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971

2. DRAGON View: information visualization for annotated microarray data.

Authors: Christopher M L S Bouton; Jonathan Pevsner
Journal: Bioinformatics Date: 2002-02 Impact factor: 6.937

Review 3. Computational analysis of microarray data.

Authors: J Quackenbush
Journal: Nat Rev Genet Date: 2001-06 Impact factor: 53.242

4. An automated method for neuroanatomic and cytoarchitectonic atlas-based interrogation of fMRI data sets.

Authors: Joseph A Maldjian; Paul J Laurienti; Robert A Kraft; Jonathan H Burdette
Journal: Neuroimage Date: 2003-07 Impact factor: 6.556

5. Extensions to gene set enrichment.

Authors: Zhen Jiang; Robert Gentleman
Journal: Bioinformatics Date: 2006-11-24 Impact factor: 6.937

6. Analyzing gene expression data in terms of gene sets: methodological issues.

Authors: Jelle J Goeman; Peter Bühlmann
Journal: Bioinformatics Date: 2007-02-15 Impact factor: 6.937

7. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors: Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal: Proc Natl Acad Sci U S A Date: 2005-09-30 Impact factor: 11.205

8. GOing Bayesian: model-based gene set analysis of genome-scale data.

Authors: Sebastian Bauer; Julien Gagneur; Peter N Robinson
Journal: Nucleic Acids Res Date: 2010-02-19 Impact factor: 16.971

9. An integrated genomic analysis of human glioblastoma multiforme.

Authors: D Williams Parsons; Siân Jones; Xiaosong Zhang; Jimmy Cheng-Ho Lin; Rebecca J Leary; Philipp Angenendt; Parminder Mankoo; Hannah Carter; I-Mei Siu; Gary L Gallia; Alessandro Olivi; Roger McLendon; B Ahmed Rasheed; Stephen Keir; Tatiana Nikolskaya; Yuri Nikolsky; Dana A Busam; Hanna Tekleab; Luis A Diaz; James Hartigan; Doug R Smith; Robert L Strausberg; Suely Kazue Nagahashi Marie; Sueli Mieko Oba Shinjo; Hai Yan; Gregory J Riggins; Darell D Bigner; Rachel Karchin; Nick Papadopoulos; Giovanni Parmigiani; Bert Vogelstein; Victor E Velculescu; Kenneth W Kinzler
Journal: Science Date: 2008-09-04 Impact factor: 47.728

10. Multiple locus linkage analysis of genomewide expression in yeast.

Authors: John D Storey; Joshua M Akey; Leonid Kruglyak
Journal: PLoS Biol Date: 2005-07-26 Impact factor: 8.029

1 in total

1. Computational Pathology: A Path Ahead.

Authors: David N Louis; Michael Feldman; Alexis B Carter; Anand S Dighe; John D Pfeifer; Lynn Bry; Jonas S Almeida; Joel Saltz; Jonathan Braun; John E Tomaszewski; John R Gilbertson; John H Sinard; Georg K Gerber; Stephen J Galli; Jeffrey A Golden; Michael J Becich
Journal: Arch Pathol Lab Med Date: 2015-06-22 Impact factor: 5.534

1 in total