Literature DB >> 32026945

Toward a gold standard for benchmarking gene set enrichment analysis.

Ludwig Geistlinger1, Gergely Csaba2, Mara Santarelli3, Marcel Ramos4, Lucas Schiffer5, Nitesh Turaga6, Charity Law7, Sean Davis8, Vincent Carey9, Martin Morgan, Ralf Zimmer, Levi Waldron1.   

Abstract

MOTIVATION: Although gene set enrichment analysis has become an integral part of high-throughput gene expression data analysis, the assessment of enrichment methods remains rudimentary and ad hoc. In the absence of suitable gold standards, evaluations are commonly restricted to selected datasets and biological reasoning on the relevance of resulting enriched gene sets.
RESULTS: We develop an extensible framework for reproducible benchmarking of enrichment methods based on defined criteria for applicability, gene set prioritization and detection of relevant processes. This framework incorporates a curated compendium of 75 expression datasets investigating 42 human diseases. The compendium features microarray and RNA-seq measurements, and each dataset is associated with a precompiled GO/KEGG relevance ranking for the corresponding disease under investigation. We perform a comprehensive assessment of 10 major enrichment methods, identifying significant differences in runtime and applicability to RNA-seq data, fraction of enriched gene sets depending on the null hypothesis tested and recovery of the predefined relevance rankings. We make practical recommendations on how methods originally developed for microarray data can efficiently be applied to RNA-seq data, how to interpret results depending on the type of gene set test conducted and which methods are best suited to effectively prioritize gene sets with high phenotype relevance. AVAILABILITY: http://bioconductor.org/packages/GSEABenchmarkeR. CONTACT: ludwig.geistlinger@sph.cuny.edu.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  RNA-seq; gene expression data; gene set analysis; microarray; pathway analysis

Year:  2021        PMID: 32026945      PMCID: PMC7820859          DOI: 10.1093/bib/bbz158

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  70 in total

1.  Rigorous assessment of gene set enrichment tests.

Authors:  Haroon Naeem; Ralf Zimmer; Pegah Tavakkolkhah; Robert Küffner
Journal:  Bioinformatics       Date:  2012-04-05       Impact factor: 6.937

2.  Using GOstats to test gene lists for GO term association.

Authors:  S Falcon; R Gentleman
Journal:  Bioinformatics       Date:  2006-11-10       Impact factor: 6.937

Review 3.  Microarrays, deep sequencing and the true measure of the transcriptome.

Authors:  John H Malone; Brian Oliver
Journal:  BMC Biol       Date:  2011-05-31       Impact factor: 7.431

4.  Oncogenic Signaling Pathways in The Cancer Genome Atlas.

Authors:  Francisco Sanchez-Vega; Marco Mina; Joshua Armenia; Walid K Chatila; Augustin Luna; Konnor C La; Sofia Dimitriadoy; David L Liu; Havish S Kantheti; Sadegh Saghafinia; Debyani Chakravarty; Foysal Daian; Qingsong Gao; Matthew H Bailey; Wen-Wei Liang; Steven M Foltz; Ilya Shmulevich; Li Ding; Zachary Heins; Angelica Ochoa; Benjamin Gross; Jianjiong Gao; Hongxin Zhang; Ritika Kundra; Cyriac Kandoth; Istemi Bahceci; Leonard Dervishi; Ugur Dogrusoz; Wanding Zhou; Hui Shen; Peter W Laird; Gregory P Way; Casey S Greene; Han Liang; Yonghong Xiao; Chen Wang; Antonio Iavarone; Alice H Berger; Trever G Bivona; Alexander J Lazar; Gary D Hammer; Thomas Giordano; Lawrence N Kwong; Grant McArthur; Chenfei Huang; Aaron D Tward; Mitchell J Frederick; Frank McCormick; Matthew Meyerson; Eliezer M Van Allen; Andrew D Cherniack; Giovanni Ciriello; Chris Sander; Nikolaus Schultz
Journal:  Cell       Date:  2018-04-05       Impact factor: 41.582

5.  NCBI GEO: archive for functional genomics data sets--update.

Authors:  Tanya Barrett; Stephen E Wilhite; Pierre Ledoux; Carlos Evangelista; Irene F Kim; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Michelle Holko; Andrey Yefanov; Hyeseung Lee; Naigong Zhang; Cynthia L Robertson; Nadezhda Serova; Sean Davis; Alexandra Soboleva
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

6.  ROAST: rotation gene set tests for complex microarray experiments.

Authors:  Di Wu; Elgene Lim; François Vaillant; Marie-Liesse Asselin-Labat; Jane E Visvader; Gordon K Smyth
Journal:  Bioinformatics       Date:  2010-07-07       Impact factor: 6.937

7.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

Review 8.  Methods and approaches in the topology-based analysis of biological pathways.

Authors:  Cristina Mitrea; Zeinab Taghavi; Behzad Bokanizad; Samer Hanoudi; Rebecca Tagett; Michele Donato; Călin Voichiţa; Sorin Drăghici
Journal:  Front Physiol       Date:  2013-10-10       Impact factor: 4.566

9.  Data, information, knowledge and principle: back to metabolism in KEGG.

Authors:  Minoru Kanehisa; Susumu Goto; Yoko Sato; Masayuki Kawashima; Miho Furumichi; Mao Tanabe
Journal:  Nucleic Acids Res       Date:  2013-11-07       Impact factor: 16.971

10.  Combining multiple tools outperforms individual methods in gene set enrichment analyses.

Authors:  Monther Alhamdoosh; Milica Ng; Nicholas J Wilson; Julie M Sheridan; Huy Huynh; Michael J Wilson; Matthew E Ritchie
Journal:  Bioinformatics       Date:  2017-02-01       Impact factor: 6.937

View more
  25 in total

1.  Towards a comprehensive assessment of QSP models: what would it take?

Authors:  Ioannis P Androulakis
Journal:  J Pharmacokinet Pharmacodyn       Date:  2022-08-13       Impact factor: 2.410

2.  Microarray profiling identifies hsa_circ_0082003 as a novel tumor promoter for papillary thyroid carcinoma.

Authors:  J Ye; J-W Feng; W-X Wu; G-F Qi; F Wang; J Hu; L-Z Hong; S-Y Liu; Y Jiang
Journal:  J Endocrinol Invest       Date:  2022-09-17       Impact factor: 5.467

3.  GenomicSuperSignature facilitates interpretation of RNA-seq experiments through robust, efficient comparison to public databases.

Authors:  Sehyun Oh; Ludwig Geistlinger; Marcel Ramos; Daniel Blankenberg; Marius van den Beek; Jaclyn N Taroni; Vincent J Carey; Casey S Greene; Levi Waldron; Sean Davis
Journal:  Nat Commun       Date:  2022-06-27       Impact factor: 17.694

4.  SIGNAL: A web-based iterative analysis platform integrating pathway and network approaches optimizes hit selection from genome-scale assays.

Authors:  Samuel Katz; Jian Song; Kyle P Webb; Nicolas W Lounsbury; Clare E Bryant; Iain D C Fraser
Journal:  Cell Syst       Date:  2021-03-24       Impact factor: 11.091

5.  Network- and systems-based re-engineering of dendritic cells with non-coding RNAs for cancer immunotherapy.

Authors:  Xin Lai; Florian S Dreyer; Martina Cantone; Martin Eberhardt; Kerstin F Gerer; Tanushree Jaitly; Steffen Uebe; Christopher Lischer; Arif Ekici; Jürgen Wittmann; Hans-Martin Jäck; Niels Schaft; Jan Dörrie; Julio Vera
Journal:  Theranostics       Date:  2021-01-01       Impact factor: 11.556

6.  Popularity and performance of bioinformatics software: the case of gene set analysis.

Authors:  Chengshu Xie; Shaurya Jauhari; Antonio Mora
Journal:  BMC Bioinformatics       Date:  2021-04-15       Impact factor: 3.169

7.  GeneWalk identifies relevant gene functions for a biological context using network representation learning.

Authors:  Robert Ietswaart; Benjamin M Gyori; John A Bachman; Peter K Sorger; L Stirling Churchman
Journal:  Genome Biol       Date:  2021-02-02       Impact factor: 13.583

8.  Per-sample standardization and asymmetric winsorization lead to accurate clustering of RNA-seq expression profiles.

Authors:  Davide Risso; Stefano Maria Pagnotta
Journal:  Bioinformatics       Date:  2021-02-09       Impact factor: 6.937

9.  ReactomeGSA - Efficient Multi-Omics Comparative Pathway Analysis.

Authors:  Johannes Griss; Guilherme Viteri; Konstantinos Sidiropoulos; Vy Nguyen; Antonio Fabregat; Henning Hermjakob
Journal:  Mol Cell Proteomics       Date:  2020-09-09       Impact factor: 7.381

10.  Multiomic Integration of Public Oncology Databases in Bioconductor.

Authors:  Marcel Ramos; Ludwig Geistlinger; Sehyun Oh; Lucas Schiffer; Rimsha Azhar; Hanish Kodali; Ino de Bruijn; Jianjiong Gao; Vincent J Carey; Martin Morgan; Levi Waldron
Journal:  JCO Clin Cancer Inform       Date:  2020-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.