| Literature DB >> 28521008 |
Siwei Chen1,2,3, Juan F Beltrán1,2, Clara Esteban-Jurado4, Sebastià Franch-Expósito4, Sergi Castellví-Bel4, Steven Lipkin5, Xiaomu Wei2,5, Haiyuan Yu1,2.
Abstract
Integrative analysis of whole-genome/exome-sequencing data has been challenging, especially for the non-programming research community, as it requires simultaneously managing a large number of computational tools. Even computational biologists find it unexpectedly difficult to reproduce results from others or optimize their strategies in an end-to-end workflow. We introduce Germline Mutation Scoring Tool fOr Next-generation sEquencing data (GeMSTONE), a cloud-based variant prioritization tool with high-level customization and a comprehensive collection of bioinformatics tools and data libraries (http://gemstone.yulab.org/). GeMSTONE generates and readily accepts a shareable 'recipe' file for each run to either replicate previous results or analyze new data with identical parameters and provides a centralized workflow for prioritizing germline mutations in human disease within a streamlined workflow rather than a pool of program executions.Entities:
Mesh:
Year: 2017 PMID: 28521008 PMCID: PMC5556704 DOI: 10.1093/nar/gkx398
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.GeMSTONE pipeline overview. The schematic represents the GeMSTONE's central analysis pipeline. The fundamental backbone filter cascade can be seen in blue, prioritizing rare and putatively damaging variants and genes. Different libraries are grouped in orange, used in annotation or filtering steps throughout the workflow as indicated.
Figure 2.Recapitulation of a published colorectal cancer (CRC) study. As a proof-of-concept case study, GeMSTONE recapitulated every step in the original Colorectal-cancer prioritization workflow1, rescuing 27 out of 28 candidate variants from the ∼30,000 variants in the raw whole exome sequencing dataset and hitting all hereditary CRC and CRC GWAS variants.
Figure 3.Heatmap comparison of GeMSTONE and other variant prioritization tools. This heatmap compares with other tools that have similar objectives on the aspects of (A) raw data inputs and prioritization, (B) knowledge-based annotation from external data resources and libraries, (C) inheritance models for co-segregation analysis and (D) strategy of reproducibility. Each row represents a different tool, while each column represents a specific feature. Dark blue indicates that a tool has similar capacity for a specific function while light blue indicates that a tool has a similar feature but with less powerful functionality than GeMSTONE (see Discussion).