| Literature DB >> 29881786 |
Márton Münz1, Shazia Mahamdallie1,2, Shawn Yost1,2, Andrew Rimmer1, Emma Poyastro-Pearson1,2, Ann Strydom1,2, Sheila Seal1,2, Elise Ruark1,2, Nazneen Rahman1,2,3.
Abstract
Quality assurance and quality control are essential for robust next generation sequencing (NGS). Here we present CoverView, a fast, flexible, user-friendly quality evaluation tool for NGS data. CoverView processes mapped sequencing reads and user-specified regions to report depth of coverage, base and mapping quality metrics with increasing levels of detail from a chromosome-level summary to per-base profiles. CoverView can flag regions that do not fulfil user-specified quality requirements, allowing suboptimal data to be systematically and automatically presented for review. It also provides an interactive graphical user interface (GUI) that can be opened in a web browser and allows intuitive exploration of results. We have integrated CoverView into our accredited clinical cancer predisposition gene testing laboratory that uses the TruSight Cancer Panel (TSCP). CoverView has been invaluable for optimisation and quality control of our testing pipeline, providing transparent, consistent quality metric information and automatic flagging of regions that fall below quality thresholds. We demonstrate this utility with TSCP data from the Genome in a Bottle reference sample, which CoverView analysed in 13 seconds. CoverView uses data routinely generated by NGS pipelines, reads standard input formats, and rapidly creates easy-to-parse output text (.txt) files that are customised by a simple configuration file. CoverView can therefore be easily integrated into any NGS pipeline. CoverView and detailed documentation for its use are freely available at github.com/RahmanTeamDevelopment/CoverView/releases and www.icr.ac.uk/CoverView.Entities:
Keywords: NGS; QSM; Quality Sequencing Minimimum; base quality; depth of coverage; genetic testing; mapping quality; quality assurance; quality control
Year: 2018 PMID: 29881786 PMCID: PMC5964631 DOI: 10.12688/wellcomeopenres.14306.1
Source DB: PubMed Journal: Wellcome Open Res ISSN: 2398-502X
Position specific metrics reported as per-base profiles in the specified genomic regions.
| Position specific metrics | Definition |
|---|---|
| Coverage (COV) | The number of mapped reads covering the position |
| Median Base Quality (MEDBQ) | Median base quality of all read bases mapping to the position |
| Fraction of Low Base Quality (FLBQ) | Fraction of read bases mapping to the position with a base quality
|
| Median Mapping Quality (MEDMQ) | Median mapping quality of all reads covering the position |
| Fraction of Low Mapping Quality (FLMQ) | Fraction of reads covering the position with a mapping quality
|
| Quality Coverage (QCOV) | Number of mapped reads covering the position with read mapping
|
Summary metrics for the specified genomic regions.
| Summary metrics of region | Definition |
|---|---|
| Read count (RC) | Total number of mapped reads overlapping the region |
| Median coverage (MEDCOV) | Median of COV values across all positions in the region |
| Minimum coverage (MINCOV) | Minimum of COV values across all positions in the region |
| Median quality coverage (MEDQCOV) | Median of QCOV values across all positions in the region |
| Minimum quality coverage (MINQCOV) | Minimum of QCOV values across all positions in the region |
| Maximum fraction of low mapping quality (MAXFLMQ) | Maximum of FLMQ values across all positions in the region |
| Maximum fraction of low base quality (MAXFLBQ) | Maximum of FLBQ values across all positions in the region |
Figure 1. CoverView GUI Genes View for TSCP analysis in GIAB sample.
CoverView GUI screenshot of the Genes View for TruSight Cancer Panel (TSCP) data generated for the Genome in a Bottle (GIAB) sample shows a bar plot of per-chromosome read counts and the list of flagged regions. Nine regions in eight genes were flagged for falling below MINQCOV ≥50 in the GIAB sample.
Figure 2. CoverView GUI Regions View for PMS2 in GIAB sample.
CoverView GUI screenshot of Regions View for PMS2 data in the GIAB sample showing the summary metrics per region. Metrics that fall below user-defined thresholds are shown in red and flagged. The MINQCOV for PMS2 exons 12 and 15 are below MINQCOV ≥50.
Figure 3. CoverView GUI Profiles View showing PMS2 exon 12 COV and QCOV data in GIAB sample.
CoverView GUI screenshot of Profiles View for PMS2 exon 12 data in the GIAB sample showing the quality coverage QCOV (blue), per-base coverage COV (red) and the minimum QCOV threshold as dashed horizontal lines, across the region. The useful (quality) coverage is only a small proportion of the total coverage.
Figure 4. CoverView GUI Profiles View showing PMS2 exon 12 QCOV and FLMQ data in GIAB sample.
CoverView GUI screenshot of Profiles View for PMS2 exon 12 data in the GIAB sample showing the quality coverage QCOV (blue), fraction of read bases with low mapping quality FMLQ (red) and the minimum QCOV threshold as dashed horizontal lines, across the region. This shows that the majority of reads mapping to this region have low mapping quality.