| Literature DB >> 27158454 |
Elise Ruark1, Anthony Renwick1, Matthew Clarke1, Katie Snape1, Emma Ramsay1, Anna Elliott1, Sandra Hanks1, Ann Strydom1, Sheila Seal1, Nazneen Rahman1,2.
Abstract
To provide a useful community resource for orthogonal assessment of NGS analysis software, we present the ICR142 NGS validation series. The dataset includes high-quality exome sequence data from 142 samples together with Sanger sequence data at 704 sites; 416 sites with variants and 288 sites at which variants were called by an NGS analysis tool, but no variant is present in the corresponding Sanger sequence. The dataset includes 293 indel variants and 247 negative indel sites, and thus the ICR142 validation dataset is of particular utility in evaluating indel calling performance. The FASTQ files and Sanger sequence results can be accessed in the European Genome-phenome Archive under the accession number EGAS00001001332.Entities:
Keywords: NGS; Variant calling; exome; indel; next-generation sequencing; validation
Year: 2016 PMID: 27158454 PMCID: PMC4857748 DOI: 10.12688/f1000research.8219.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Description of variant sites evaluated by Sanger sequencing in the ICR142 NGS validation series.