Literature DB >> 32686750

Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes.

Kishwar Shafin1, Trevor Pesout1, Ryan Lorig-Roach1, Marina Haukness1, Hugh E Olsen1, Colleen Bosworth1, Joel Armstrong1, Kristof Tigyi1,2, Nicholas Maurer1, Sergey Koren3, Fritz J Sedlazeck4, Tobias Marschall5, Simon Mayes6, Vania Costa6, Justin M Zook7, Kelvin J Liu8, Duncan Kilburn8, Melanie Sorensen9, Katy M Munson9, Mitchell R Vollger9, Jean Monlong1, Erik Garrison1, Evan E Eichler2,9, Sofie Salama1,2, David Haussler1,2, Richard E Green1, Mark Akeson1, Adam Phillippy3, Karen H Miga1, Paolo Carnevali10, Miten Jain11, Benedict Paten12.   

Abstract

De novo assembly of a human genome using nanopore long-read sequences has been reported, but it used more than 150,000 CPU hours and weeks of wall-clock time. To enable rapid human genome assembly, we present Shasta, a de novo long-read assembler, and polishing algorithms named MarginPolish and HELEN. Using a single PromethION nanopore sequencer and our toolkit, we assembled 11 highly contiguous human genomes de novo in 9 d. We achieved roughly 63× coverage, 42-kb read N50 values and 6.5× coverage in reads >100 kb using three flow cells per sample. Shasta produced a complete haploid human genome assembly in under 6 h on a single commercial compute node. MarginPolish and HELEN polished haploid assemblies to more than 99.9% identity (Phred quality score QV = 30) with nanopore reads alone. Addition of proximity-ligation sequencing enabled near chromosome-level scaffolds for all 11 genomes. We compare our assembly performance to existing methods for diploid, haploid and trio-binned human samples and report superior accuracy and speed.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32686750      PMCID: PMC7483855          DOI: 10.1038/s41587-020-0503-6

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


  46 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  Hi-C: a comprehensive technique to capture the conformation of genomes.

Authors:  Jon-Matthew Belton; Rachel Patton McCord; Johan Harmen Gibcus; Natalia Naumova; Ye Zhan; Job Dekker
Journal:  Methods       Date:  2012-05-29       Impact factor: 3.608

3.  Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.

Authors:  Keith R Bradnam; Joseph N Fass; Anton Alexandrov; Paul Baranay; Michael Bechner; Inanç Birol; Sébastien Boisvert; Jarrod A Chapman; Guillaume Chapuis; Rayan Chikhi; Hamidreza Chitsaz; Wen-Chi Chou; Jacques Corbeil; Cristian Del Fabbro; T Roderick Docking; Richard Durbin; Dent Earl; Scott Emrich; Pavel Fedotov; Nuno A Fonseca; Ganeshkumar Ganapathy; Richard A Gibbs; Sante Gnerre; Elénie Godzaridis; Steve Goldstein; Matthias Haimel; Giles Hall; David Haussler; Joseph B Hiatt; Isaac Y Ho; Jason Howard; Martin Hunt; Shaun D Jackman; David B Jaffe; Erich D Jarvis; Huaiyang Jiang; Sergey Kazakov; Paul J Kersey; Jacob O Kitzman; James R Knight; Sergey Koren; Tak-Wah Lam; Dominique Lavenier; François Laviolette; Yingrui Li; Zhenyu Li; Binghang Liu; Yue Liu; Ruibang Luo; Iain Maccallum; Matthew D Macmanes; Nicolas Maillet; Sergey Melnikov; Delphine Naquin; Zemin Ning; Thomas D Otto; Benedict Paten; Octávio S Paulo; Adam M Phillippy; Francisco Pina-Martins; Michael Place; Dariusz Przybylski; Xiang Qin; Carson Qu; Filipe J Ribeiro; Stephen Richards; Daniel S Rokhsar; J Graham Ruby; Simone Scalabrin; Michael C Schatz; David C Schwartz; Alexey Sergushichev; Ted Sharpe; Timothy I Shaw; Jay Shendure; Yujian Shi; Jared T Simpson; Henry Song; Fedor Tsarev; Francesco Vezzi; Riccardo Vicedomini; Bruno M Vieira; Jun Wang; Kim C Worley; Shuangye Yin; Siu-Ming Yiu; Jianying Yuan; Guojie Zhang; Hao Zhang; Shiguo Zhou; Ian F Korf
Journal:  Gigascience       Date:  2013-07-22       Impact factor: 6.524

Review 4.  Strand-seq: a unifying tool for studies of chromosome segregation.

Authors:  Ester Falconer; Peter M Lansdorp
Journal:  Semin Cell Dev Biol       Date:  2013-05-08       Impact factor: 7.727

5.  A universal SNP and small-indel variant caller using deep neural networks.

Authors:  Ryan Poplin; Pi-Chuan Chang; David Alexander; Scott Schwartz; Thomas Colthurst; Alexander Ku; Dan Newburger; Jojo Dijamco; Nam Nguyen; Pegah T Afshar; Sam S Gross; Lizzie Dorfman; Cory Y McLean; Mark A DePristo
Journal:  Nat Biotechnol       Date:  2018-09-24       Impact factor: 54.908

6.  Real-time DNA sequencing from single polymerase molecules.

Authors:  John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald Cicero; Sonya Clark; Ravindra Dalal; Alex Dewinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin; Paul Lundquist; Congcong Ma; Patrick Marks; Mark Maxham; Devon Murphy; Insil Park; Thang Pham; Michael Phillips; Joy Roy; Robert Sebra; Gene Shen; Jon Sorenson; Austin Tomaney; Kevin Travers; Mark Trulson; John Vieceli; Jeffrey Wegener; Dawn Wu; Alicia Yang; Denis Zaccarin; Peter Zhao; Frank Zhong; Jonas Korlach; Stephen Turner
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

Review 7.  Genome structural variation discovery and genotyping.

Authors:  Can Alkan; Bradley P Coe; Evan E Eichler
Journal:  Nat Rev Genet       Date:  2011-03-01       Impact factor: 53.242

8.  Improved data analysis for the MinION nanopore sequencer.

Authors:  Miten Jain; Ian T Fiddes; Karen H Miga; Hugh E Olsen; Benedict Paten; Mark Akeson
Journal:  Nat Methods       Date:  2015-02-16       Impact factor: 28.547

9.  Direct determination of diploid genome sequences.

Authors:  Neil I Weisenfeld; Vijay Kumar; Preyas Shah; Deanna M Church; David B Jaffe
Journal:  Genome Res       Date:  2017-04-05       Impact factor: 9.043

10.  An open resource for accurately benchmarking small variant and reference calls.

Authors:  Justin M Zook; Jennifer McDaniel; Nathan D Olson; Justin Wagner; Hemang Parikh; Haynes Heaton; Sean A Irvine; Len Trigg; Rebecca Truty; Cory Y McLean; Francisco M De La Vega; Chunlin Xiao; Stephen Sherry; Marc Salit
Journal:  Nat Biotechnol       Date:  2019-04-01       Impact factor: 54.908

View more
  81 in total

1.  Weighted minimizer sampling improves long read mapping.

Authors:  Chirag Jain; Arang Rhie; Haowen Zhang; Claudia Chu; Brian P Walenz; Sergey Koren; Adam M Phillippy
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

2.  HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads.

Authors:  Sergey Nurk; Brian P Walenz; Arang Rhie; Mitchell R Vollger; Glennis A Logsdon; Robert Grothe; Karen H Miga; Evan E Eichler; Adam M Phillippy; Sergey Koren
Journal:  Genome Res       Date:  2020-08-14       Impact factor: 9.043

Review 3.  Long-read human genome sequencing and its applications.

Authors:  Glennis A Logsdon; Mitchell R Vollger; Evan E Eichler
Journal:  Nat Rev Genet       Date:  2020-06-05       Impact factor: 53.242

4.  Sensitive alignment using paralogous sequence variants improves long-read mapping and variant calling in segmental duplications.

Authors:  Timofey Prodanov; Vikas Bansal
Journal:  Nucleic Acids Res       Date:  2020-11-04       Impact factor: 16.971

5.  High-quality carnivoran genomes from roadkill samples enable comparative species delineation in aardwolf and bat-eared fox.

Authors:  Rémi Allio; Marie-Ka Tilak; Celine Scornavacca; Nico L Avenant; Andrew C Kitchener; Erwan Corre; Benoit Nabholz; Frédéric Delsuc
Journal:  Elife       Date:  2021-02-18       Impact factor: 8.140

Review 6.  Advances in Genomic Discovery and Implications for Personalized Prevention and Medicine: Estonia as Example.

Authors:  Bram Peter Prins; Liis Leitsalu; Katri Pärna; Krista Fischer; Andres Metspalu; Toomas Haller; Harold Snieder
Journal:  J Pers Med       Date:  2021-04-29

7.  Haplotype diversity and sequence heterogeneity of human telomeres.

Authors:  Kirill Grigorev; Jonathan Foox; Daniela Bezdan; Daniel Butler; Jared J Luxton; Jake Reed; Miles J McKenna; Lynn Taylor; Kerry A George; Cem Meydan; Susan M Bailey; Christopher E Mason
Journal:  Genome Res       Date:  2021-06-23       Impact factor: 9.043

8.  Progressive Cactus is a multiple-genome aligner for the thousand-genome era.

Authors:  Joel Armstrong; Glenn Hickey; Mark Diekhans; Ian T Fiddes; Adam M Novak; Alden Deran; Qi Fang; Duo Xie; Shaohong Feng; Josefin Stiller; Diane Genereux; Jeremy Johnson; Voichita Dana Marinescu; Jessica Alföldi; Robert S Harris; Kerstin Lindblad-Toh; David Haussler; Elinor Karlsson; Erich D Jarvis; Guojie Zhang; Benedict Paten
Journal:  Nature       Date:  2020-11-11       Impact factor: 49.962

9.  Overcoming uncollapsed haplotypes in long-read assemblies of non-model organisms.

Authors:  Nadège Guiglielmoni; Antoine Houtain; Alessandro Derzelle; Karine Van Doninck; Jean-François Flot
Journal:  BMC Bioinformatics       Date:  2021-06-05       Impact factor: 3.169

Review 10.  Towards population-scale long-read sequencing.

Authors:  Wouter De Coster; Matthias H Weissensteiner; Fritz J Sedlazeck
Journal:  Nat Rev Genet       Date:  2021-05-28       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.