Literature DB >> 27271295

Extensive sequencing of seven human genomes to characterize benchmark reference materials.

Justin M Zook1, David Catoe1, Jennifer McDaniel1, Lindsay Vang1, Noah Spies1,2, Arend Sidow2, Ziming Weng2, Yuling Liu2, Christopher E Mason3, Noah Alexander3, Elizabeth Henaff3, Alexa B R McIntyre3, Dhruva Chandramohan3, Feng Chen4, Erich Jaeger4, Ali Moshrefi4, Khoa Pham5, William Stedman5, Tiffany Liang5, Michael Saghbini5, Zeljko Dzakula5, Alex Hastie5, Han Cao5, Gintaras Deikus6, Eric Schadt6, Robert Sebra6, Ali Bashir6, Rebecca M Truty7, Christopher C Chang7, Natali Gulbahce7, Keyan Zhao8, Srinka Ghosh8, Fiona Hyland8, Yutao Fu8, Mark Chaisson9, Chunlin Xiao10, Jonathan Trow10, Stephen T Sherry10, Alexander W Zaranek11, Madeleine Ball11, Jason Bobe6,11, Preston Estep11,12, George M Church11,12, Patrick Marks13, Sofia Kyriazopoulou-Panagiotopoulou13, Grace X Y Zheng13, Michael Schnall-Levin13, Heather S Ordonez13, Patrice A Mudivarti13, Kristina Giorda13, Ying Sheng14, Karoline Bjarnesdatter Rypdal14, Marc Salit1,2.   

Abstract

The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.

Entities:  

Mesh:

Year:  2016        PMID: 27271295      PMCID: PMC4896128          DOI: 10.1038/sdata.2016.25

Source DB:  PubMed          Journal:  Sci Data        ISSN: 2052-4463            Impact factor:   6.444


  11 in total

1.  Computational techniques for human genome resequencing using mated gapped reads.

Authors:  Paolo Carnevali; Jonathan Baccash; Aaron L Halpern; Igor Nazarenko; Geoffrey B Nilsen; Krishna P Pant; Jessica C Ebert; Anushka Brownley; Matt Morenzoni; Vitali Karpinchyk; Bruce Martin; Dennis G Ballinger; Radoje Drmanac
Journal:  J Comput Biol       Date:  2011-12-16       Impact factor: 1.479

2.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

3.  Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls.

Authors:  Justin M Zook; Brad Chapman; Jason Wang; David Mittelman; Oliver Hofmann; Winston Hide; Marc Salit
Journal:  Nat Biotechnol       Date:  2014-02-16       Impact factor: 54.908

4.  Highly efficient DNA synthesis by the phage phi 29 DNA polymerase. Symmetrical mode of DNA replication.

Authors:  L Blanco; A Bernad; J M Lázaro; G Martín; C Garmendia; M Salas
Journal:  J Biol Chem       Date:  1989-05-25       Impact factor: 5.157

5.  Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays.

Authors:  Radoje Drmanac; Andrew B Sparks; Matthew J Callow; Aaron L Halpern; Norman L Burns; Bahram G Kermani; Paolo Carnevali; Igor Nazarenko; Geoffrey B Nilsen; George Yeung; Fredrik Dahl; Andres Fernandez; Bryan Staker; Krishna P Pant; Jonathan Baccash; Adam P Borcherding; Anushka Brownley; Ryan Cedeno; Linsu Chen; Dan Chernikoff; Alex Cheung; Razvan Chirita; Benjamin Curson; Jessica C Ebert; Coleen R Hacker; Robert Hartlage; Brian Hauser; Steve Huang; Yuan Jiang; Vitali Karpinchyk; Mark Koenig; Calvin Kong; Tom Landers; Catherine Le; Jia Liu; Celeste E McBride; Matt Morenzoni; Robert E Morey; Karl Mutch; Helena Perazich; Kimberly Perry; Brock A Peters; Joe Peterson; Charit L Pethiyagoda; Kaliprasad Pothuraju; Claudia Richter; Abraham M Rosenbaum; Shaunak Roy; Jay Shafto; Uladzislau Sharanhovich; Karen W Shannon; Conrad G Sheppy; Michel Sun; Joseph V Thakuria; Anne Tran; Dylan Vu; Alexander Wait Zaranek; Xiaodi Wu; Snezana Drmanac; Arnold R Oliphant; William C Banyai; Bruce Martin; Dennis G Ballinger; George M Church; Clifford A Reid
Journal:  Science       Date:  2009-11-05       Impact factor: 47.728

6.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

7.  Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells.

Authors:  Brock A Peters; Bahram G Kermani; Andrew B Sparks; Oleg Alferov; Peter Hong; Andrei Alexeev; Yuan Jiang; Fredrik Dahl; Y Tom Tang; Juergen Haas; Kimberly Robasky; Alexander Wait Zaranek; Je-Hyuk Lee; Madeleine Price Ball; Joseph E Peterson; Helena Perazich; George Yeung; Jia Liu; Linsu Chen; Michael I Kennemer; Kaliprasad Pothuraju; Karel Konvicka; Mike Tsoupko-Sitnikov; Krishna P Pant; Jessica C Ebert; Geoffrey B Nilsen; Jonathan Baccash; Aaron L Halpern; George M Church; Radoje Drmanac
Journal:  Nature       Date:  2012-07-11       Impact factor: 49.962

8.  SAMBLASTER: fast duplicate marking and structural variant read extraction.

Authors:  Gregory G Faust; Ira M Hall
Journal:  Bioinformatics       Date:  2014-05-07       Impact factor: 6.937

9.  Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology.

Authors:  Hongzhi Cao; Alex R Hastie; Dandan Cao; Ernest T Lam; Yuhui Sun; Haodong Huang; Xiao Liu; Liya Lin; Warren Andrews; Saki Chan; Shujia Huang; Xin Tong; Michael Requa; Thomas Anantharaman; Anders Krogh; Huanming Yang; Han Cao; Xun Xu
Journal:  Gigascience       Date:  2014-12-30       Impact factor: 6.524

10.  svclassify: a method to establish benchmark structural variant calls.

Authors:  Hemang Parikh; Marghoob Mohiyuddin; Hugo Y K Lam; Hariharan Iyer; Desu Chen; Mark Pratt; Gabor Bartha; Noah Spies; Wolfgang Losert; Justin M Zook; Marc Salit
Journal:  BMC Genomics       Date:  2016-01-16       Impact factor: 3.969

View more
  205 in total

1.  Assessment of human diploid genome assembly with 10x Linked-Reads data.

Authors:  Lu Zhang; Xin Zhou; Ziming Weng; Arend Sidow
Journal:  Gigascience       Date:  2019-11-01       Impact factor: 6.524

2.  Discovery and quality analysis of a comprehensive set of structural variants and short tandem repeats.

Authors:  David Jakubosky; Erin N Smith; Matteo D'Antonio; Marc Jan Bonder; William W Young Greenwald; Agnieszka D'Antonio-Chronowska; Hiroko Matsui; Oliver Stegle; Stephen B Montgomery; Christopher DeBoever; Kelly A Frazer
Journal:  Nat Commun       Date:  2020-06-10       Impact factor: 14.919

3.  Determining Performance Metrics for Targeted Next-Generation Sequencing Panels Using Reference Materials.

Authors:  Megan H Cleveland; Justin M Zook; Marc Salit; Peter M Vallone
Journal:  J Mol Diagn       Date:  2018-06-26       Impact factor: 5.568

4.  MsPAC: a tool for haplotype-phased structural variant detection.

Authors:  Oscar L Rodriguez; Anna Ritz; Andrew J Sharp; Ali Bashir
Journal:  Bioinformatics       Date:  2020-02-01       Impact factor: 6.937

Review 5.  Reference standards for next-generation sequencing.

Authors:  Simon A Hardwick; Ira W Deveson; Tim R Mercer
Journal:  Nat Rev Genet       Date:  2017-06-19       Impact factor: 53.242

6.  Deep sequencing of 10,000 human genomes.

Authors:  Amalio Telenti; Levi C T Pierce; William H Biggs; Julia di Iulio; Emily H M Wong; Martin M Fabani; Ewen F Kirkness; Ahmed Moustafa; Naisha Shah; Chao Xie; Suzanne C Brewerton; Nadeem Bulsara; Chad Garner; Gary Metzker; Efren Sandoval; Brad A Perkins; Franz J Och; Yaron Turpaz; J Craig Venter
Journal:  Proc Natl Acad Sci U S A       Date:  2016-10-04       Impact factor: 11.205

7.  Statistical Binning for Barcoded Reads Improves Downstream Analyses.

Authors:  Ariya Shajii; Ibrahim Numanagić; Christopher Whelan; Bonnie Berger
Journal:  Cell Syst       Date:  2018-08-22       Impact factor: 10.304

8.  TranSurVeyor: an improved database-free algorithm for finding non-reference transpositions in high-throughput sequencing data.

Authors:  Ramesh Rajaby; Wing-Kin Sung
Journal:  Nucleic Acids Res       Date:  2018-11-16       Impact factor: 16.971

9.  Biocompute Objects-A Step towards Evaluation and Validation of Biomedical Scientific Computations.

Authors:  Vahan Simonyan; Jeremy Goecks; Raja Mazumder
Journal:  PDA J Pharm Sci Technol       Date:  2016-12-14

10.  FusorSV: an algorithm for optimally combining data from multiple structural variation detection methods.

Authors:  Timothy Becker; Wan-Ping Lee; Joseph Leone; Qihui Zhu; Chengsheng Zhang; Silvia Liu; Jack Sargent; Kritika Shanker; Adam Mil-Homens; Eliza Cerveira; Mallory Ryan; Jane Cha; Fabio C P Navarro; Timur Galeev; Mark Gerstein; Ryan E Mills; Dong-Guk Shin; Charles Lee; Ankit Malhotra
Journal:  Genome Biol       Date:  2018-03-20       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.