| Literature DB >> 35594250 |
Abstract
Nanopore sequencing produces long reads and offers unique advantages over next-generation sequencing, especially for the assembly of draft bacterial genomes with improved completeness. However, assembly errors can occur due to data characteristics and assembly algorithms. To address these issues, we developed MAECI, a pipeline for generating consensus sequences from multiple assemblies of the same nanopore sequencing data and error correction. Systematic evaluation showed that MAECI is an efficient and effective pipeline to improve the accuracy and completeness of bacterial genome assemblies. The available codes and implementation are at https://github.com/langjidong/MAECI.Entities:
Mesh:
Year: 2022 PMID: 35594250 PMCID: PMC9122195 DOI: 10.1371/journal.pone.0267066
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Overview of the MAECI assembly pipeline.
Fig 2A) Comparison of the performance of Canu, FlyeE, Wtdbg2, and MAECI in the assembly of 10 simulated ONT datasets of Enterobacter cloacae strain GGT036 (NZ_CP009756.1) before and after polishing with simulated NGS data. B) Box plots of coefficients of variation (CVs) of total length, GC content, mismatches per 100 kb, and indels per 100 kb from 10 datasets of each of the nine samples. * represents that the smaller CVs have been enlarged: the CVs of the total length were multiplied by 100, and the GC content was multiplied by 1,000.
Fig 3A) Performance comparison of Canu, FlyE, Wtdbg2 and MAECI on real datasets of A. faecalis PGB1 published by Lang et al. B) Comparison of MAECI and Trycycler on real ONT datasets of three strains published by Wick et al.
The simulated data statistics of three bacterial strains.
| Sample | Paper_data-Rapid (after Filtlong) | Simulated data | ||||
|---|---|---|---|---|---|---|
| Read count | Total size | N50 length | Read count | Total size | N50 length | |
| 69,135 | 741,274,205 | 15,130 | 70,000 | 543,179,080 | 9,385 | |
| 69,978 | 768,514,620 | 15,181 | 70,000 | 543,767,980 | 9,411 | |
| 85,440 | 948,228,758 | 15,364 | 90,000 | 698,317,203 | 9,363 | |