Literature DB >> 31309221

VariantQC: a visual quality control report for variant evaluation.

Melissa Y Yan1, Betsy Ferguson1,2,3, Benjamin N Bimber1,4.   

Abstract

SUMMARY: Large scale genomic studies produce millions of sequence variants, generating datasets far too massive for manual inspection. To ensure variant and genotype data are consistent and accurate, it is necessary to evaluate variants prior to downstream analysis using quality control (QC) reports. Variant call format (VCF) files are the standard format for representing variant data; however, generating summary statistics from these files is not always straightforward. While tools to summarize variant data exist, they generally produce simple text file tables, which still require additional processing and interpretation. VariantQC fills this gap as a user friendly, interactive visual QC report that generates and concisely summarizes statistics from VCF files. The report aggregates and summarizes variants by dataset, chromosome, sample and filter type. The VariantQC report is useful for high-level dataset summary, quality control and helps flag outliers. Furthermore, VariantQC operates on VCF files, so it can be easily integrated into many existing variant pipelines.
AVAILABILITY AND IMPLEMENTATION: DISCVRSeq's VariantQC tool is freely available as a Java program, with the compiled JAR and source code available from https://github.com/BimberLab/DISCVRSeq/. Documentation and example reports are available at https://bimberlab.github.io/DISCVRSeq/.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2019        PMID: 31309221      PMCID: PMC7963085          DOI: 10.1093/bioinformatics/btz560

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

Review 1.  Human genome sequencing in health and disease.

Authors:  Claudia Gonzaga-Jauregui; James R Lupski; Richard A Gibbs
Journal:  Annu Rev Med       Date:  2012       Impact factor: 13.739

2.  From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline.

Authors:  Geraldine A Van der Auwera; Mauricio O Carneiro; Christopher Hartl; Ryan Poplin; Guillermo Del Angel; Ami Levy-Moonshine; Tadeusz Jordan; Khalid Shakir; David Roazen; Joel Thibault; Eric Banks; Kiran V Garimella; David Altshuler; Stacey Gabriel; Mark A DePristo
Journal:  Curr Protoc Bioinformatics       Date:  2013

Review 3.  Genotype and SNP calling from next-generation sequencing data.

Authors:  Rasmus Nielsen; Joshua S Paul; Anders Albrechtsen; Yun S Song
Journal:  Nat Rev Genet       Date:  2011-06       Impact factor: 53.242

4.  The variant call format and VCFtools.

Authors:  Petr Danecek; Adam Auton; Goncalo Abecasis; Cornelis A Albers; Eric Banks; Mark A DePristo; Robert E Handsaker; Gerton Lunter; Gabor T Marth; Stephen T Sherry; Gilean McVean; Richard Durbin
Journal:  Bioinformatics       Date:  2011-06-07       Impact factor: 6.937

5.  MultiQC: summarize analysis results for multiple tools and samples in a single report.

Authors:  Philip Ewels; Måns Magnusson; Sverker Lundin; Max Käller
Journal:  Bioinformatics       Date:  2016-06-16       Impact factor: 6.937

6.  Effective filtering strategies to improve data quality from population-based whole exome sequencing studies.

Authors:  Andrew R Carson; Erin N Smith; Hiroko Matsui; Sigrid K Brækkan; Kristen Jepsen; John-Bjarne Hansen; Kelly A Frazer
Journal:  BMC Bioinformatics       Date:  2014-05-02       Impact factor: 3.169

  6 in total
  1 in total

1.  A quality control portal for sequencing data deposited at the European genome-phenome archive.

Authors:  Dietmar Fernández-Orth; Manuel Rueda; Babita Singh; Mauricio Moldes; Aina Jene; Marta Ferri; Claudia Vasallo; Lauren A Fromont; Arcadi Navarro; Jordi Rambla
Journal:  Brief Bioinform       Date:  2022-05-13       Impact factor: 13.994

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.