Literature DB >> 31711268

Improved assembly and variant detection of a haploid human genome using single-molecule, high-fidelity long reads.

Mitchell R Vollger1, Glennis A Logsdon1, Peter A Audano1, Arvis Sulovari1, David Porubsky1, Paul Peluso2, Aaron M Wenger2, Gregory T Concepcion2, Zev N Kronenberg2, Katherine M Munson1, Carl Baker1, Ashley D Sanders3, Diana C J Spierings4, Peter M Lansdorp4,5,6, Urvashi Surti7, Michael W Hunkapiller2, Evan E Eichler1,8.   

Abstract

The sequence and assembly of human genomes using long-read sequencing technologies has revolutionized our understanding of structural variation and genome organization. We compared the accuracy, continuity, and gene annotation of genome assemblies generated from either high-fidelity (HiFi) or continuous long-read (CLR) datasets from the same complete hydatidiform mole human genome. We find that the HiFi sequence data assemble an additional 10% of duplicated regions and more accurately represent the structure of tandem repeats, as validated with orthogonal analyses. As a result, an additional 5 Mbp of pericentromeric sequences are recovered in the HiFi assembly, resulting in a 2.5-fold increase in the NG50 within 1 Mbp of the centromere (HiFi 480.6 kbp, CLR 191.5 kbp). Additionally, the HiFi genome assembly was generated in significantly less time with fewer computational resources than the CLR assembly. Although the HiFi assembly has significantly improved continuity and accuracy in many complex regions of the genome, it still falls short of the assembly of centromeric DNA and the largest regions of segmental duplication using existing assemblers. Despite these shortcomings, our results suggest that HiFi may be the most effective standalone technology for de novo assembly of human genomes.
© 2019 John Wiley & Sons Ltd/University College London.

Entities:  

Keywords:  genome assembly; long-read sequencing; segmental duplications; structural variation; tandem repeats

Mesh:

Substances:

Year:  2019        PMID: 31711268      PMCID: PMC7015760          DOI: 10.1111/ahg.12364

Source DB:  PubMed          Journal:  Ann Hum Genet        ISSN: 0003-4800            Impact factor:   1.670


  29 in total

1.  Gepard: a rapid and sensitive tool for creating dotplots on genome scale.

Authors:  Jan Krumsiek; Roland Arnold; Thomas Rattei
Journal:  Bioinformatics       Date:  2007-02-19       Impact factor: 6.937

2.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

3.  Minimap2: pairwise alignment for nucleotide sequences.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2018-09-15       Impact factor: 6.937

4.  DNA template strand sequencing of single-cells maps genomic rearrangements at high resolution.

Authors:  Ester Falconer; Mark Hills; Ulrike Naumann; Steven S S Poon; Elizabeth A Chavez; Ashley D Sanders; Yongjun Zhao; Martin Hirst; Peter M Lansdorp
Journal:  Nat Methods       Date:  2012-10-07       Impact factor: 28.547

5.  The structure and evolution of centromeric transition regions within the human genome.

Authors:  Xinwei She; Julie E Horvath; Zhaoshi Jiang; Ge Liu; Terrence S Furey; Laurie Christ; Royden Clark; Tina Graves; Cassy L Gulden; Can Alkan; Jeff A Bailey; Cenk Sahinalp; Mariano Rocchi; David Haussler; Richard K Wilson; Webb Miller; Stuart Schwartz; Evan E Eichler
Journal:  Nature       Date:  2004-08-19       Impact factor: 49.962

6.  Resolving the complexity of the human genome using single-molecule sequencing.

Authors:  Mark J P Chaisson; John Huddleston; Megan Y Dennis; Peter H Sudmant; Maika Malig; Fereydoun Hormozdiari; Francesca Antonacci; Urvashi Surti; Richard Sandstrom; Matthew Boitano; Jane M Landolin; John A Stamatoyannopoulos; Michael W Hunkapiller; Jonas Korlach; Evan E Eichler
Journal:  Nature       Date:  2014-11-10       Impact factor: 49.962

7.  Nanopore sequencing and assembly of a human genome with ultra-long reads.

Authors:  Miten Jain; Sergey Koren; Karen H Miga; Josh Quick; Arthur C Rand; Thomas A Sasani; John R Tyson; Andrew D Beggs; Alexander T Dilthey; Ian T Fiddes; Sunir Malla; Hannah Marriott; Tom Nieto; Justin O'Grady; Hugh E Olsen; Brent S Pedersen; Arang Rhie; Hollian Richardson; Aaron R Quinlan; Terrance P Snutch; Louise Tee; Benedict Paten; Adam M Phillippy; Jared T Simpson; Nicholas J Loman; Matthew Loose
Journal:  Nat Biotechnol       Date:  2018-01-29       Impact factor: 54.908

8.  Characterizing the Major Structural Variant Alleles of the Human Genome.

Authors:  Peter A Audano; Arvis Sulovari; Tina A Graves-Lindsay; Stuart Cantsilieris; Melanie Sorensen; AnneMarie E Welch; Max L Dougherty; Bradley J Nelson; Ankeeta Shah; Susan K Dutcher; Wesley C Warren; Vincent Magrini; Sean D McGrath; Yang I Li; Richard K Wilson; Evan E Eichler
Journal:  Cell       Date:  2019-01-17       Impact factor: 41.582

9.  De novo assembly of haplotype-resolved genomes with trio binning.

Authors:  Sergey Koren; Arang Rhie; Brian P Walenz; Alexander T Dilthey; Derek M Bickhart; Sarah B Kingan; Stefan Hiendleder; John L Williams; Timothy P L Smith; Adam M Phillippy
Journal:  Nat Biotechnol       Date:  2018-10-22       Impact factor: 54.908

10.  Long-read sequence assembly of the gorilla genome.

Authors:  David Gordon; John Huddleston; Mark J P Chaisson; Christopher M Hill; Zev N Kronenberg; Katherine M Munson; Maika Malig; Archana Raja; Ian Fiddes; LaDeana W Hillier; Christopher Dunn; Carl Baker; Joel Armstrong; Mark Diekhans; Benedict Paten; Jay Shendure; Richard K Wilson; David Haussler; Chen-Shan Chin; Evan E Eichler
Journal:  Science       Date:  2016-04-01       Impact factor: 47.728

View more
  34 in total

1.  Mining the gaps of chromosome 8.

Authors:  Glennis A Logsdon; Evan E Eichler
Journal:  Nature       Date:  2021-05-14       Impact factor: 49.962

2.  HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads.

Authors:  Sergey Nurk; Brian P Walenz; Arang Rhie; Mitchell R Vollger; Glennis A Logsdon; Robert Grothe; Karen H Miga; Evan E Eichler; Adam M Phillippy; Sergey Koren
Journal:  Genome Res       Date:  2020-08-14       Impact factor: 9.043

Review 3.  Centromere studies in the era of 'telomere-to-telomere' genomics.

Authors:  Karen H Miga
Journal:  Exp Cell Res       Date:  2020-06-03       Impact factor: 3.905

Review 4.  Long-read human genome sequencing and its applications.

Authors:  Glennis A Logsdon; Mitchell R Vollger; Evan E Eichler
Journal:  Nat Rev Genet       Date:  2020-06-05       Impact factor: 53.242

5.  Sex-Based Analysis of De Novo Variants in Neurodevelopmental Disorders.

Authors:  Tychele N Turner; Amy B Wilfert; Trygve E Bakken; Raphael A Bernier; Micah R Pepper; Zhancheng Zhang; Rebecca I Torene; Kyle Retterer; Evan E Eichler
Journal:  Am J Hum Genet       Date:  2019-11-27       Impact factor: 11.025

6.  Human-specific tandem repeat expansion and differential gene expression during primate evolution.

Authors:  Arvis Sulovari; Ruiyang Li; Peter A Audano; David Porubsky; Mitchell R Vollger; Glennis A Logsdon; Wesley C Warren; Alex A Pollen; Mark J P Chaisson; Evan E Eichler
Journal:  Proc Natl Acad Sci U S A       Date:  2019-10-28       Impact factor: 11.205

Review 7.  Perspectives and Benefits of High-Throughput Long-Read Sequencing in Microbial Ecology.

Authors:  Leho Tedersoo; Mads Albertsen; Sten Anslan; Benjamin Callahan
Journal:  Appl Environ Microbiol       Date:  2021-08-11       Impact factor: 4.792

8.  Haplotype-resolved diverse human genomes and integrated analysis of structural variation.

Authors:  Peter Ebert; Peter A Audano; Qihui Zhu; Bernardo Rodriguez-Martin; Charles Lee; Jan O Korbel; Tobias Marschall; Evan E Eichler; David Porubsky; Marc Jan Bonder; Arvis Sulovari; Jana Ebler; Weichen Zhou; Rebecca Serra Mari; Feyza Yilmaz; Xuefang Zhao; PingHsun Hsieh; Joyce Lee; Sushant Kumar; Jiadong Lin; Tobias Rausch; Yu Chen; Jingwen Ren; Martin Santamarina; Wolfram Höps; Hufsah Ashraf; Nelson T Chuang; Xiaofei Yang; Katherine M Munson; Alexandra P Lewis; Susan Fairley; Luke J Tallon; Wayne E Clarke; Anna O Basile; Marta Byrska-Bishop; André Corvelo; Uday S Evani; Tsung-Yu Lu; Mark J P Chaisson; Junjie Chen; Chong Li; Harrison Brand; Aaron M Wenger; Maryam Ghareghani; William T Harvey; Benjamin Raeder; Patrick Hasenfeld; Allison A Regier; Haley J Abel; Ira M Hall; Paul Flicek; Oliver Stegle; Mark B Gerstein; Jose M C Tubio; Zepeng Mu; Yang I Li; Xinghua Shi; Alex R Hastie; Kai Ye; Zechen Chong; Ashley D Sanders; Michael C Zody; Michael E Talkowski; Ryan E Mills; Scott E Devine
Journal:  Science       Date:  2021-02-25       Impact factor: 47.728

Review 9.  Applying genomic and transcriptomic advances to mitochondrial medicine.

Authors:  William L Macken; Jana Vandrovcova; Michael G Hanna; Robert D S Pitceathly
Journal:  Nat Rev Neurol       Date:  2021-02-23       Impact factor: 42.937

Review 10.  Genomic Tackling of Human Satellite DNA: Breaking Barriers through Time.

Authors:  Mariana Lopes; Sandra Louzada; Margarida Gama-Carvalho; Raquel Chaves
Journal:  Int J Mol Sci       Date:  2021-04-29       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.