Literature DB >> 28204549

Using predictive specificity to determine when gene set analysis is biologically meaningful.

Sara Ballouz1, Paul Pavlidis2, Jesse Gillis1.   

Abstract

Gene set analysis, which translates gene lists into enriched functions, is among the most common bioinformatic methods. Yet few would advocate taking the results at face value. Not only is there no agreement on the algorithms themselves, there is no agreement on how to benchmark them. In this paper, we evaluate the robustness and uniqueness of enrichment results as a means of assessing methods even where correctness is unknown. We show that heavily annotated (‘multifunctional’) genes are likely to appear in genomics study results and drive the generation of biologically non-specific enrichment results as well as highly fragile significances. By providing a means of determining where enrichment analyses report non-specific and non-robust findings, we are able to assess where we can be confident in their use. We find significant progress in recent bias correction methods for enrichment and provide our own software implementation. Our approach can be readily adapted to any pre-existing package.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28204549      PMCID: PMC5389513          DOI: 10.1093/nar/gkw957

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  48 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Gene function analysis in complex data sets using ErmineJ.

Authors:  Jesse Gillis; Meeta Mistry; Paul Pavlidis
Journal:  Nat Protoc       Date:  2010-06-03       Impact factor: 13.491

3.  Analyzing gene expression data in terms of gene sets: methodological issues.

Authors:  Jelle J Goeman; Peter Bühlmann
Journal:  Bioinformatics       Date:  2007-02-15       Impact factor: 6.937

4.  Ontologizer 2.0--a multifunctional tool for GO term enrichment analysis and data exploration.

Authors:  Sebastian Bauer; Steffen Grossmann; Martin Vingron; Peter N Robinson
Journal:  Bioinformatics       Date:  2008-05-29       Impact factor: 6.937

5.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

6.  Down-weighting overlapping genes improves gene set analysis.

Authors:  Adi Laurentiu Tarca; Sorin Draghici; Gaurav Bhatti; Roberto Romero
Journal:  BMC Bioinformatics       Date:  2012-06-19       Impact factor: 3.169

7.  "Guilt by association" is the exception rather than the rule in gene networks.

Authors:  Jesse Gillis; Paul Pavlidis
Journal:  PLoS Comput Biol       Date:  2012-03-29       Impact factor: 4.475

8.  A probabilistic generative model for GO enrichment analysis.

Authors:  Yong Lu; Roni Rosenfeld; Itamar Simon; Gerard J Nau; Ziv Bar-Joseph
Journal:  Nucleic Acids Res       Date:  2008-08-01       Impact factor: 16.971

9.  ErmineJ: tool for functional analysis of gene expression data sets.

Authors:  Homin K Lee; William Braynen; Kiran Keshav; Paul Pavlidis
Journal:  BMC Bioinformatics       Date:  2005-11-09       Impact factor: 3.169

10.  GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists.

Authors:  Eran Eden; Roy Navon; Israel Steinfeld; Doron Lipson; Zohar Yakhini
Journal:  BMC Bioinformatics       Date:  2009-02-03       Impact factor: 3.169

View more
  12 in total

1.  CEA: Combination-based gene set functional enrichment analysis.

Authors:  Duanchen Sun; Yinliang Liu; Xiang-Sun Zhang; Ling-Yun Wu
Journal:  Sci Rep       Date:  2018-08-30       Impact factor: 4.379

2.  Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap.

Authors:  Jüri Reimand; Ruth Isserlin; Veronique Voisin; Mike Kucera; Christian Tannus-Lopes; Asha Rostamianfar; Lina Wadi; Mona Meyer; Jeff Wong; Changjiang Xu; Daniele Merico; Gary D Bader
Journal:  Nat Protoc       Date:  2019-02       Impact factor: 13.491

3.  Monitoring changes in the Gene Ontology and their impact on genomic data analysis.

Authors:  Matthew Jacobson; Adriana Estela Sedeño-Cortés; Paul Pavlidis
Journal:  Gigascience       Date:  2018-08-01       Impact factor: 6.524

4.  Interpretation of biological experiments changes with evolution of the Gene Ontology and its annotations.

Authors:  Aurelie Tomczak; Jonathan M Mortensen; Rainer Winnenburg; Charles Liu; Dominique T Alessi; Varsha Swamy; Francesco Vallania; Shane Lofgren; Winston Haynes; Nigam H Shah; Mark A Musen; Purvesh Khatri
Journal:  Sci Rep       Date:  2018-03-23       Impact factor: 4.379

5.  The distinct effects of orally administered Lactobacillus rhamnosus GG and Lactococcus lactis subsp. lactis C59 on gene expression in the murine small intestine.

Authors:  Chise Suzuki; Ayako Aoki-Yoshida; Reiji Aoki; Keisuke Sasaki; Yoshiharu Takayama; Koko Mizumachi
Journal:  PLoS One       Date:  2017-12-08       Impact factor: 3.240

6.  Differential regulation of the immune system in a brain-liver-fats organ network during short-term fasting.

Authors:  Susie S Y Huang; Melanie Makhlouf; Eman H AbouMoussa; Mayra L Ruiz Tejada Segura; Lisa S Mathew; Kun Wang; Man C Leung; Damien Chaussabel; Darren W Logan; Antonio Scialdone; Mathieu Garand; Luis R Saraiva
Journal:  Mol Metab       Date:  2020-06-08       Impact factor: 7.422

7.  Prenatal Alcohol Exposure: Profiling Developmental DNA Methylation Patterns in Central and Peripheral Tissues.

Authors:  Alexandre A Lussier; Tamara S Bodnar; Matthew Mingay; Alexandre M Morin; Martin Hirst; Michael S Kobor; Joanne Weinberg
Journal:  Front Genet       Date:  2018-12-04       Impact factor: 4.599

8.  Mega-Analysis of Gene Expression in Mouse Models of Alzheimer's Disease.

Authors:  Beryl Zhuang; B Ogan Mancarci; Lilah Toker; Paul Pavlidis
Journal:  eNeuro       Date:  2019-12-04

9.  Overcoming false-positive gene-category enrichment in the analysis of spatially resolved transcriptomic brain atlas data.

Authors:  Ben D Fulcher; Aurina Arnatkeviciute; Alex Fornito
Journal:  Nat Commun       Date:  2021-05-11       Impact factor: 14.919

10.  A new method for evaluating the impacts of semantic similarity measures on the annotation of gene sets.

Authors:  Aarón Ayllón-Benítez; Fleur Mougin; Julien Allali; Rodolphe Thiébaut; Patricia Thébault
Journal:  PLoS One       Date:  2018-11-27       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.