| Literature DB >> 33281796 |
Jared T Broddrick1, Richard Szubin2, Charles J Norsigian2, Jonathan M Monk2, Bernhard O Palsson2, Mary N Parenteau1.
Abstract
Advances in nanopore-based sequencing techniques have enabled rapid characterization of genomes and transcriptomes. An emerging application of this sequencing technology is point-of-care characterization of pathogenic bacteria. However, genome assessments alone are unable to provide a complete understanding of the pathogenic phenotype. Genome-scale metabolic reconstruction and analysis is a bottom-up Systems Biology technique that has elucidated the phenotypic nuances of antimicrobial resistant (AMR) bacteria and other human pathogens. Combining these genome-scale models (GEMs) with point-of-care nanopore sequencing is a promising strategy for combating the emerging health challenge of AMR pathogens. However, the sequencing errors inherent to the nanopore technique may negatively affect the quality, and therefore the utility, of GEMs reconstructed from nanopore assemblies. Here we describe and validate a workflow for rapid construction of GEMs from nanopore (MinION) derived assemblies. Benchmarking the pipeline against a high-quality reference GEM of Escherichia coli K-12 resulted in nanopore-derived models that were >99% complete even at sequencing depths of less than 10× coverage. Applying the pipeline to clinical isolates of pathogenic bacteria resulted in strain-specific GEMs that identified canonical AMR genome content and enabled simulations of strain-specific microbial growth. Additionally, we show that treating the sequencing run as a mock metagenome did not degrade the quality of models derived from metagenome assemblies. Taken together, this study demonstrates that combining nanopore sequencing with GEM construction pipelines enables rapid, in situ characterization of microbial metabolism.Entities:
Keywords: MinION long-read sequencing; MinION nanopore device®; antimicrobial resistance (AMR); constraint-based model; metabolic model reconstruction; nanopore sequencing
Year: 2020 PMID: 33281796 PMCID: PMC7688782 DOI: 10.3389/fmicb.2020.596626
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1A schematic of the genome assembly and model construction pipeline used in this study.
Summary of assembly statistics for E. coli strain K12 substr. BOP27.
| Basecaller | Coverage | Read N50 (bp) | Genome size (Mbp) | Contigs | SNPs | InDels | Basecalling time (min) | Assembly time (min) | |
| HAC+ | 307× | 22k | 4.640 | 1 | 32.0 | 21 | 598 | 177 | 207 |
| HAC− | 306× | 22k | 4.639 | 1 | 27.3 | 4452 | 901 | 176 | 207 |
| Fast | 260× | 22k | 4.638 | 1 | 24.3 | 10621 | 4316 | 53 | 197 |
Assembly quality for E. coli strain K12 substr. BOP27 at different steps in the pipeline for reads basecalled with the high accuracy algorithm with methylation calling enabled.
| Genome size (Mbp) | SNPs | InDels | ||
| MiniASM_R1 | 4.636 | 25.6 | 470 | 9279 |
| Flye | 4.649 | 25.3 | 76 | 10482 |
| Flye + Medaka | 4.641 | 30.7 | 27 | 1660 |
| Flye + R1 + M | 4.639 | 30.9 | 32 | 654 |
| Flye + R2 + M | 4.640 | 32.0 | 21 | 598 |
| Flye + R4 + M | 4.640 | 32.0 | 25 | 596 |
FIGURE 2Assembly statistics versus coverage depth. (A) Assembly time versus genome coverage depth. (B) Assembly accuracy versus coverage depth.
FIGURE 3Assembly accuracy versus the number of coding DNA sequences annotated.
Statistics of genome-scale metabolic reconstructions built from the assembly and annotation pipeline.
| Basecaller | Coverage | # contigs | Split CDS recovered (%) | ORFs recovered | Genes | Reactions | Growth rate (hr–1) |
| Fast | 6× | 24 | 4155 (76%) | 547 | 1502 | 2712 | 0.877 |
| 66× | 1 | 1725 (84%) | 40 | 1507 | 2712 | 0.877 | |
| 260× | 1 | 1576 (83%) | 32 | 1511 | 2712 | 0.877 | |
| HAC+mod | 7× | 5 | 2902 (82%) | 233 | 1510 | 2710 | 0.877 |
| 58× | 1 | 360 (93%) | 5 | 1515 | 2712 | 0.877 | |
| 307× | 1 | 176 (96%) | 3 | 1516 | 2712 | 0.877 | |
| iML1515 | N/A | N/A | N/A | N/A | 1516 | 2712 | 0.877 |
Assembly statistics for clinical isolates of pathogenic bacteria characterized in this study.
| HAC+ | HAC− | |||||||
| Barcode | Species ID | Size (Mbp) | Cov. | ORFs | Mean ORF length (bp) | ORFs | Mean ORF length (bp) | Assembly time (min) |
| BC07 | 3.93 | 13× | 5826 | 561 | 5376 | 622 | 13 | |
| BC09 | 2.89 | 46× | 3595 | 669 | 3217 | 762 | 34 | |
| BC11 | 2.84 | 75× | 2961 | 803 | 2788 | 859 | 27 | |
FIGURE 4Comparison of S. aureus USA300 TCH1516 versus the S. aureus clinical isolate based on outputs of the genome-scale metabolic reconstruction pipeline.
FIGURE 5Comparison of A. baumannii AYE versus the A. baumannii clinical isolate based on outputs of the genome-scale metabolic reconstruction pipeline.
Draft metabolic reconstructions of E. faecium clinical isolate using different reference genome-scale reconstructions.
| Reference Strain | Ref model genes | Model Genes | Unique | No BBH hits | |
| 517 | 342 | 35 | 1435 | ||
| 844 | 319 | 21 | 1492 | ||
| 854 | 448 | 77 | 1545 | ||
| 1515 | 434 | 97 | 1848 |
Assembly statistics for the MinION mock metagenome assembly.
| Contigs | Contig ID | Mean coverage | # contigs | Mean GC content (%) | Size (bp) |
| Circular contigs | 16 | 1 | 39 | 3924200 | |
| 125 | 1 | 38 | 19664 | ||
| 5 | 1 | 49 | 7257 | ||
| 23 | 1 | 55 | 2035 | ||
| 50 | 1 | 48 | 92091 | ||
| 57 | 1 | 38 | 2877951 | ||
| 275 | 1 | 36 | 57023 | ||
| 610 | 1 | 35 | 48025 | ||
| 3164 | 1 | 39 | 2014 | ||
| 80 | 1 | 33 | 2826253 | ||
| 33 | 1 | 60 | 7842 | ||
| Bin 1 | 17 | 3 | 50 | 737975 | |
| Bin 2 | 39 | 1 | 49 | 55052 | |
| Bin 3 | 829 | 2 | 44 | 3300 | |
| Bin 4 | 12 | 1 | 53 | 43757 | |
| Bin 5 | 27 | 3* | 50 | 4626033 | |
| Bin 6 | 71 | 1 | 36 | 164487 |
FIGURE 6Phylogenomic analysis of genome scale models in the BiGG Models database.