| Literature DB >> 32541955 |
Justin M Zook1, Nancy F Hansen2, Nathan D Olson3, Lesley Chapman3, James C Mullikin2, Chunlin Xiao4, Stephen Sherry4, Sergey Koren2, Adam M Phillippy2, Paul C Boutros5, Sayed Mohammad E Sahraeian6, Vincent Huang7, Alexandre Rouette8, Noah Alexander9, Christopher E Mason10,11,12,13, Iman Hajirasouliha10, Camir Ricketts10, Joyce Lee14, Rick Tearle15, Ian T Fiddes16, Alvaro Martinez Barrio16, Jeremiah Wala17, Andrew Carroll18, Noushin Ghaffari19, Oscar L Rodriguez20, Ali Bashir20, Shaun Jackman21, John J Farrell22, Aaron M Wenger23, Can Alkan24, Arda Soylev25, Michael C Schatz26, Shilpa Garg27, George Church27, Tobias Marschall28, Ken Chen29, Xian Fan30, Adam C English31, Jeffrey A Rosenfeld32,33, Weichen Zhou34, Ryan E Mills34, Jay M Sage35, Jennifer R Davis35, Michael D Kaiser35, John S Oliver35, Anthony P Catalano35, Mark J P Chaisson36, Noah Spies37, Fritz J Sedlazeck38, Marc Salit37.
Abstract
New technologies and analysis methods are enabling genomic structural variants (SVs) to be detected with ever-increasing accuracy, resolution and comprehensiveness. To help translate these methods to routine research and clinical practice, we developed a sequence-resolved benchmark set for identification of both false-negative and false-positive germline large insertions and deletions. To create this benchmark for a broadly consented son in a Personal Genome Project trio with broadly available cells and DNA, the Genome in a Bottle Consortium integrated 19 sequence-resolved variant calling methods from diverse technologies. The final benchmark set contains 12,745 isolated, sequence-resolved insertion (7,281) and deletion (5,464) calls ≥50 base pairs (bp). The Tier 1 benchmark regions, for which any extra calls are putative false positives, cover 2.51 Gbp and 5,262 insertions and 4,095 deletions supported by ≥1 diploid assembly. We demonstrate that the benchmark set reliably identifies false negatives and false positives in high-quality SV callsets from short-, linked- and long-read sequencing and optical mapping.Entities:
Year: 2020 PMID: 32541955 DOI: 10.1038/s41587-020-0538-8
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908