Literature DB >> 31984131

Benchmarking of long-read assemblers for prokaryote whole genome sequencing.

Ryan R Wick1, Kathryn E Holt1,2.   

Abstract

Background: Data sets from long-read sequencing platforms (Oxford Nanopore Technologies and Pacific Biosciences) allow for most prokaryote genomes to be completely assembled - one contig per chromosome or plasmid. However, the high per-read error rate of long-read sequencing necessitates different approaches to assembly than those used for short-read sequencing. Multiple assembly tools (assemblers) exist, which use a variety of algorithms for long-read assembly.
Methods:  We used 500 simulated read sets and 120 real read sets to assess the performance of eight long-read assemblers (Canu, Flye, Miniasm/Minipolish, NECAT, NextDenovo/NextPolish, Raven, Redbean and Shasta) across a wide variety of genomes and read parameters. Assemblies were assessed on their structural accuracy/completeness, sequence identity, contig circularisation and computational resources used.
Results:  Canu v2.1 produced reliable assemblies and was good with plasmids, but it performed poorly with circularisation and had the longest runtimes of all assemblers tested. Flye v2.8 was also reliable and made the smallest sequence errors, though it used the most RAM. Miniasm/Minipolish v0.3/v0.1.3 was the most likely to produce clean contig circularisation. NECAT v20200803 was reliable and good at circularisation but tended to make larger sequence errors. NextDenovo/NextPolish v2.3.1/v1.3.1 was reliable with chromosome assembly but bad with plasmid assembly. Raven v1.3.0 was reliable for chromosome assembly, though it did not perform well on small plasmids and had circularisation issues. Redbean v2.5 and Shasta v0.7.0 were computationally efficient but more likely to produce incomplete assemblies. Conclusions: Of the assemblers tested, Flye, Miniasm/Minipolish, NextDenovo/NextPolish and Raven performed best overall. However, no single tool performed well on all metrics, highlighting the need for continued development on long-read assembly algorithms. Copyright:
© 2021 Wick RR and Holt KE.

Entities:  

Keywords:  Assembly; Oxford Nanopore Technologies; Pacific Biosciences; benchmarking; long-read sequencing; microbial genomics

Year:  2019        PMID: 31984131      PMCID: PMC6966772.4          DOI: 10.12688/f1000research.21782.4

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


  45 in total

1.  Identification and Characterization of Mycobacterial Species Using Whole-Genome Sequences.

Authors:  Marco A Riojas; Andrew M Frank; Samuel R Greenfield; Stephen P King; Conor J Meehan; Michael Strong; Alice R Wattam; Manzour Hernando Hazbón
Journal:  Methods Mol Biol       Date:  2021

2.  Long-read Sequencing Data Reveals Dynamic Evolution of Mitochondrial Genome Size and the Phylogenetic Utility of Mitochondrial DNA in Hercules Beetles (Dynastes; Scarabaeidae).

Authors:  Brett Morgan; Tzi-Yuan Wang; Yi-Zhen Chen; Victor Moctezuma; Oscar Burgos; My Hanh Le; Jen-Pan Huang
Journal:  Genome Biol Evol       Date:  2022-10-07       Impact factor: 4.065

3.  The clove (Syzygium aromaticum) genome provides insights into the eugenol biosynthesis pathway.

Authors:  Sonia Ouadi; Nicolas Sierro; Simon Goepfert; Lucien Bovet; Gaetan Glauser; Armelle Vallat; Manuel C Peitsch; Felix Kessler; Nikolai V Ivanov
Journal:  Commun Biol       Date:  2022-07-09

4.  Comparative genomics of the Western Hemisphere soft tick-borne relapsing fever borreliae highlights extensive plasmid diversity.

Authors:  Alexander R Kneubehl; Aparna Krishnavajhala; Sebastián Muñoz Leal; Adam J Replogle; Luke C Kingry; Sergio E Bermúdez; Marcelo B Labruna; Job E Lopez
Journal:  BMC Genomics       Date:  2022-05-31       Impact factor: 4.547

5.  Complete Genome Sequence of a Pseudomonas simiae Strain with Biocontrol Potential against Aphanomyces Root Rot.

Authors:  Ashebir T Godebo; Keith D MacKenzie; Fran L Walley; James J Germida; Christopher K Yost
Journal:  Microbiol Resour Announc       Date:  2021-05-06

6.  LoReTTA, a user-friendly tool for assembling viral genomes from PacBio sequence data.

Authors:  Ahmed Al Qaffas; Jenna Nichols; Andrew J Davison; Amine Ourahmane; Laura Hertel; Michael A McVoy; Salvatore Camiolo
Journal:  Virus Evol       Date:  2021-04-23

7.  Genomic epidemiology of Escherichia coli isolates from a tertiary referral center in Lilongwe, Malawi.

Authors:  Gerald Tegha; Emily J Ciccone; Robert Krysiak; James Kaphatika; Tarsizio Chikaonda; Isaac Ndhlovu; David van Duin; Irving Hoffman; Jonathan J Juliano; Jeremy Wang
Journal:  Microb Genom       Date:  2021-01

8.  Complete microbial genomes for public health in Australia and the Southwest Pacific.

Authors:  Sarah L Baines; Anders Gonçalves da Silva; Glen P Carter; Amy Jennison; Irani Rathnayake; Rikki M Graham; Vitali Sintchenko; Qinning Wang; Rebecca J Rockett; Verlaine J Timms; Elena Martinez; Susan Ballard; Takehiro Tomita; Nicole Isles; Kristy A Horan; William Pitchers; Timothy P Stinear; Deborah A Williamson; Benjamin P Howden; Torsten Seemann
Journal:  Microb Genom       Date:  2020-11-12

9.  MicroPIPE: validating an end-to-end workflow for high-quality complete bacterial genome construction.

Authors:  Valentine Murigneux; Leah W Roberts; Brian M Forde; Minh-Duy Phan; Nguyen Thi Khanh Nhu; Adam D Irwin; Patrick N A Harris; David L Paterson; Mark A Schembri; David M Whiley; Scott A Beatson
Journal:  BMC Genomics       Date:  2021-06-25       Impact factor: 3.969

10.  Highly contiguous assemblies of 101 drosophilid genomes.

Authors:  Bernard Y Kim; Jeremy R Wang; Daniel R Matute; Dmitri A Petrov; Danny E Miller; Olga Barmina; Emily Delaney; Ammon Thompson; Aaron A Comeault; David Peede; Emmanuel R R D'Agostino; Julianne Pelaez; Jessica M Aguilar; Diler Haji; Teruyuki Matsunaga; Ellie E Armstrong; Molly Zych; Yoshitaka Ogawa; Marina Stamenković-Radak; Mihailo Jelić; Marija Savić Veselinović; Marija Tanasković; Pavle Erić; Jian-Jun Gao; Takehiro K Katoh; Masanori J Toda; Hideaki Watabe; Masayoshi Watada; Jeremy S Davis; Leonie C Moyle; Giulia Manoli; Enrico Bertolini; Vladimír Košťál; R Scott Hawley; Aya Takahashi; Corbin D Jones; Donald K Price; Noah Whiteman; Artyom Kopp
Journal:  Elife       Date:  2021-07-19       Impact factor: 8.713

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.