| Literature DB >> 36035404 |
Xiaowei Huang1, Xuanyi Lu1, Chengshu Xie1, Shaurya Jauhari1, Zihong Xie1, Songqing Mei2, Antonio Mora1.
Abstract
Gene Set Analysis (GSA) is one of the most commonly used strategies to analyze omics data. Hundreds of GSA-related papers have been published, giving birth to a GSA field in Bioinformatics studies. However, as the field grows, it is becoming more difficult to obtain a clear view of all available methods, resources, and their quality. In this paper, we introduce a web platform called "GSA Central" which, as its name indicates, acts as a focal point to centralize GSA information and tools useful to beginners, average users, and experts in the GSA field. "GSA Central" contains five different resources: A Galaxy instance containing GSA tools ("Galaxy-GSA"), a portal to educational material ("GSA Classroom"), a comprehensive database of articles ("GSARefDB"), a set of benchmarking tools ("GSA BenchmarKING"), and a blog ("GSA Blog"). We expect that "GSA Central" will become a useful resource for users looking for introductory learning, state-of-the-art updates, method/tool selection guidelines and insights, tool usage, tool integration under a Galaxy environment, tool design, and tool validation/benchmarking. Moreover, we expect this kind of platform to become an example of a "thematic platform" containing all the resources that people in the field might need, an approach that could be extended to other bioinformatics topics or scientific fields.Entities:
Keywords: benchmarking; database; education; galaxy; gene set analysis; pathway analysis; web platform
Year: 2022 PMID: 36035404 PMCID: PMC9403262 DOI: 10.3389/fmed.2022.965908
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
FIGURE 1GSA Central architecture.
A list of current servers and individual tools for Galaxy-GSA.
| Tool name | Tool type | Location | |
| 1 | Galaxy-GSA | WEB |
|
| 2 | Galaxy-GSA | Docker |
|
| 3 | Galaxy-GSA | Virtual machine |
|
| 4 | Gene set uploader | Data uploader | Toolshed |
| 5 | ReactomePA | ORA | Toolshed |
| 6 | SPIA | PT | Toolshed |
| 7 | PLAGE | SS | Toolshed |
| 8 | ZSCORE | SS | Toolshed |
| 9 | SSGSEA | SS | Toolshed |
| 10 | GSVA | SS | Toolshed |
| 11 | ChIPEnrich | GR | Toolshed |
| 12 | PolyEnrich | GR | Toolshed |
| 13 | BroadEnrich | GR | Toolshed |
| 14 | methylGSA | GR | Toolshed |
| 15 | Mogsa | INTEG | Toolshed |
| 16 | WW | FCS | Toolshed |
| 17 | KS | FCS | Toolshed |
| 18 | Agg-F | FCS | Toolshed |
| 19 | GSNCA | FCS | Toolshed |
WEB, Web platform; Docker, Docker image; VM, Virtual Machine; ORA, Over-representation analysis; FCS, Functional-class scoring; PT, Pathway topology-based; SS, Single-sample; GR, Genomic Region; INTEG, Integra-tive; Toolshed, Galaxy toolshed (https://galaxyproject.org/toolshed/ or https://toolshed.g2.bx.psu.edu/).
FIGURE 2Galaxy-GSA screenshot. (A) All installed Galaxy tools (including all Galaxy-GSA tools). (B) Input boxes for the currently open tool (here, ReactomePA). (C) Help for the currently open tool. (D) History (including intermediate and final results).
FIGURE 3GSA Classroom screenshot. In this example, we searched for the keyword “DAVID” and found all online videos where it appears either in the title or in the keyword columns.
Statistics of GSARefDB v.2.0.
| General methods and tools | Reviews and benchmarks | Genomic GSA | ncRNA GSA | MS-based GSA | Meta-omics GSA | Integromics GSA | Total | |
| Number of papers | 386 | 85 | 65 | 33 | 29 | 23 | 20 | 641 |
| Number of citations | 131,332 | 21,727 | 9,657 | 4,329 | 8,656 | 17,985 | 1,065 | 194,751 |
FIGURE 4GSARefDB screenshot. (A) Main Menu: The user can choose between the database of general methods and tools, the database of reviews and benchmarks, the specialized databases (genomic GSA, nc-RNA GSA, MS-based GSA, meta-omics GSA, and integromics GSA), general GSARefDB statistics, the archive, and FAQs. (B) The search box: Here, we search for papers with the keyword “network”. (C) The columns with all the information we can obtain from each paper: ID, publication year, tool name, first author, paper title, DOI, number of citations, type of GSA, programming language, and website info.
FIGURE 5GSA BenchmarKING screenshot. (A) A jupyter notebook with an R workflow for the benchmark of six different single-sample GSA tools using datasets related to respiratory disease. (B) A shiny app to benchmark five different single-sample GSA tools using different gold standards, target pathways, and benchmark metrics.