Literature DB >> 24531798

Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls.

Justin M Zook1, Brad Chapman2, Jason Wang3, David Mittelman4, Oliver Hofmann2, Winston Hide2, Marc Salit1.   

Abstract

Clinical adoption of human genome sequencing requires methods that output genotypes with known accuracy at millions or billions of positions across a genome. Because of substantial discordance among calls made by existing sequencing methods and algorithms, there is a need for a highly accurate set of genotypes across a genome that can be used as a benchmark. Here we present methods to make high-confidence, single-nucleotide polymorphism (SNP), indel and homozygous reference genotype calls for NA12878, the pilot genome for the Genome in a Bottle Consortium. We minimize bias toward any method by integrating and arbitrating between 14 data sets from five sequencing technologies, seven read mappers and three variant callers. We identify regions for which no confident genotype call could be made, and classify them into different categories based on reasons for uncertainty. Our genotype calls are publicly available on the Genome Comparison and Analytic Testing website to enable real-time benchmarking of any method.

Entities:  

Mesh:

Year:  2014        PMID: 24531798     DOI: 10.1038/nbt.2835

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


  23 in total

1.  Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing.

Authors:  Joke Reumers; Peter De Rijk; Hui Zhao; Anthony Liekens; Dominiek Smeets; John Cleary; Peter Van Loo; Maarten Van Den Bossche; Kirsten Catthoor; Bernard Sabbe; Evelyn Despierre; Ignace Vergote; Brian Hilbush; Diether Lambrechts; Jurgen Del-Favero
Journal:  Nat Biotechnol       Date:  2011-12-18       Impact factor: 54.908

2.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

3.  First FDA authorization for next-generation sequencer.

Authors:  Francis S Collins; Margaret A Hamburg
Journal:  N Engl J Med       Date:  2013-11-19       Impact factor: 91.245

4.  Accurate and comprehensive sequencing of personal genomes.

Authors:  Subramanian S Ajay; Stephen C J Parker; Hatice Ozel Abaan; Karin V Fuentes Fajardo; Elliott H Margulies
Journal:  Genome Res       Date:  2011-07-19       Impact factor: 9.043

5.  A comprehensive catalogue of somatic mutations from a human cancer genome.

Authors:  Erin D Pleasance; R Keira Cheetham; Philip J Stephens; David J McBride; Sean J Humphray; Chris D Greenman; Ignacio Varela; Meng-Lay Lin; Gonzalo R Ordóñez; Graham R Bignell; Kai Ye; Julie Alipaz; Markus J Bauer; David Beare; Adam Butler; Richard J Carter; Lina Chen; Anthony J Cox; Sarah Edkins; Paula I Kokko-Gonzales; Niall A Gormley; Russell J Grocock; Christian D Haudenschild; Matthew M Hims; Terena James; Mingming Jia; Zoya Kingsbury; Catherine Leroy; John Marshall; Andrew Menzies; Laura J Mudie; Zemin Ning; Tom Royce; Ole B Schulz-Trieglaff; Anastassia Spiridou; Lucy A Stebbings; Lukasz Szajkowski; Jon Teague; David Williamson; Lynda Chin; Mark T Ross; Peter J Campbell; David R Bentley; P Andrew Futreal; Michael R Stratton
Journal:  Nature       Date:  2009-12-16       Impact factor: 49.962

6.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

7.  Comprehensive molecular characterization of human colon and rectal cancer.

Authors: 
Journal:  Nature       Date:  2012-07-18       Impact factor: 49.962

8.  Genomic dark matter: the reliability of short read mapping illustrated by the genome mappability score.

Authors:  Hayan Lee; Michael C Schatz
Journal:  Bioinformatics       Date:  2012-06-04       Impact factor: 6.937

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  Low concordance of multiple variant-calling pipelines: practical implications for exome and genome sequencing.

Authors:  Jason O'Rawe; Tao Jiang; Guangqing Sun; Yiyang Wu; Wei Wang; Jingchu Hu; Paul Bodily; Lifeng Tian; Hakon Hakonarson; W Evan Johnson; Zhi Wei; Kai Wang; Gholson J Lyon
Journal:  Genome Med       Date:  2013-03-27       Impact factor: 11.117

View more
  324 in total

1.  FermiKit: assembly-based variant calling for Illumina resequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2015-07-27       Impact factor: 6.937

2.  16GT: a fast and sensitive variant caller using a 16-genotype probabilistic model.

Authors:  Ruibang Luo; Michael C Schatz; Steven L Salzberg
Journal:  Gigascience       Date:  2017-07-01       Impact factor: 6.524

3.  Quality control and integration of genotypes from two calling pipelines for whole genome sequence data in the Alzheimer's disease sequencing project.

Authors:  Adam C Naj; Honghuang Lin; Badri N Vardarajan; Simon White; Daniel Lancour; Yiyi Ma; Michael Schmidt; Fangui Sun; Mariusz Butkiewicz; William S Bush; Brian W Kunkle; John Malamon; Najaf Amin; Seung Hoan Choi; Kara L Hamilton-Nelson; Sven J van der Lee; Namrata Gupta; Daniel C Koboldt; Mohamad Saad; Bowen Wang; Alejandro Q Nato; Harkirat K Sohi; Amanda Kuzma; Li-San Wang; L Adrienne Cupples; Cornelia van Duijn; Sudha Seshadri; Gerard D Schellenberg; Eric Boerwinkle; Joshua C Bis; Josée Dupuis; William J Salerno; Ellen M Wijsman; Eden R Martin; Anita L DeStefano
Journal:  Genomics       Date:  2018-05-29       Impact factor: 5.736

4.  Determining Performance Metrics for Targeted Next-Generation Sequencing Panels Using Reference Materials.

Authors:  Megan H Cleveland; Justin M Zook; Marc Salit; Peter M Vallone
Journal:  J Mol Diagn       Date:  2018-06-26       Impact factor: 5.568

5.  Identification of large rearrangements in cancer genomes with barcode linked reads.

Authors:  Li C Xia; John M Bell; Christina Wood-Bouwens; Jiamin J Chen; Nancy R Zhang; Hanlee P Ji
Journal:  Nucleic Acids Res       Date:  2018-02-28       Impact factor: 16.971

6.  Extensive sequencing of seven human genomes to characterize benchmark reference materials.

Authors:  Justin M Zook; David Catoe; Jennifer McDaniel; Lindsay Vang; Noah Spies; Arend Sidow; Ziming Weng; Yuling Liu; Christopher E Mason; Noah Alexander; Elizabeth Henaff; Alexa B R McIntyre; Dhruva Chandramohan; Feng Chen; Erich Jaeger; Ali Moshrefi; Khoa Pham; William Stedman; Tiffany Liang; Michael Saghbini; Zeljko Dzakula; Alex Hastie; Han Cao; Gintaras Deikus; Eric Schadt; Robert Sebra; Ali Bashir; Rebecca M Truty; Christopher C Chang; Natali Gulbahce; Keyan Zhao; Srinka Ghosh; Fiona Hyland; Yutao Fu; Mark Chaisson; Chunlin Xiao; Jonathan Trow; Stephen T Sherry; Alexander W Zaranek; Madeleine Ball; Jason Bobe; Preston Estep; George M Church; Patrick Marks; Sofia Kyriazopoulou-Panagiotopoulou; Grace X Y Zheng; Michael Schnall-Levin; Heather S Ordonez; Patrice A Mudivarti; Kristina Giorda; Ying Sheng; Karoline Bjarnesdatter Rypdal; Marc Salit
Journal:  Sci Data       Date:  2016-06-07       Impact factor: 6.444

7.  The road from next-generation sequencing to personalized medicine.

Authors:  Manuel L Gonzalez-Garay
Journal:  Per Med       Date:  2014       Impact factor: 2.512

8.  QVZ: lossy compression of quality values.

Authors:  Greg Malysa; Mikel Hernaez; Idoia Ochoa; Milind Rao; Karthik Ganesan; Tsachy Weissman
Journal:  Bioinformatics       Date:  2015-05-28       Impact factor: 6.937

9.  Detection of Pathogenic Variants With Germline Genetic Testing Using Deep Learning vs Standard Methods in Patients With Prostate Cancer and Melanoma.

Authors:  Saud H AlDubayan; Jake R Conway; Sabrina Y Camp; Leora Witkowski; Eric Kofman; Brendan Reardon; Seunghun Han; Nicholas Moore; Haitham Elmarakeby; Keyan Salari; Hani Choudhry; Abdullah M Al-Rubaish; Abdulsalam A Al-Sulaiman; Amein K Al-Ali; Amaro Taylor-Weiner; Eliezer M Van Allen
Journal:  JAMA       Date:  2020-11-17       Impact factor: 56.272

10.  Good laboratory practice for clinical next-generation sequencing informatics pipelines.

Authors:  Amy S Gargis; Lisa Kalman; David P Bick; Cristina da Silva; David P Dimmock; Birgit H Funke; Sivakumar Gowrisankar; Madhuri R Hegde; Shashikant Kulkarni; Christopher E Mason; Rakesh Nagarajan; Karl V Voelkerding; Elizabeth A Worthey; Nazneen Aziz; John Barnes; Sarah F Bennett; Himani Bisht; Deanna M Church; Zoya Dimitrova; Shaw R Gargis; Nabil Hafez; Tina Hambuch; Fiona C L Hyland; Ruth Ann Luna; Duncan MacCannell; Tobias Mann; Megan R McCluskey; Timothy K McDaniel; Lilia M Ganova-Raeva; Heidi L Rehm; Jeffrey Reid; David S Campo; Richard B Resnick; Perry G Ridge; Marc L Salit; Pavel Skums; Lee-Jun C Wong; Barbara A Zehnbauer; Justin M Zook; Ira M Lubin
Journal:  Nat Biotechnol       Date:  2015-07       Impact factor: 54.908

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.