| Literature DB >> 29906261 |
Arnold Bainomugisa1,2, Tania Duarte1, Evelyn Lavu3, Sushil Pandey4, Chris Coulter4, Ben J Marais5, Lachlan M Coin1.
Abstract
A better understanding of the genomic changes that facilitate the emergence and spread of drug-resistant Mycobacterium tuberculosis strains is currently required. Here, we report the use of the MinION nanopore sequencer (Oxford Nanopore Technologies) to sequence and assemble an extensively drug-resistant (XDR) isolate, which is part of a modern Beijing sub-lineage strain, prevalent in Western Province, Papua New Guinea. Using 238-fold coverage obtained from a single flow-cell, de novo assembly of nanopore reads resulted into one contiguous assembly with 99.92 % assembly accuracy. Incorporation of complementary short read sequences (Illumina) as part of consensus error correction resulted in a 4 404 064 bp genome with 99.98 % assembly accuracy. This assembly had an average nucleotide identity of 99.7 % relative to the reference genome, H37Rv. We assembled nearly all GC-rich repetitive PE/PPE family genes (166/168) and identified variants within these genes. With an estimated genotypic error rate of 5.3 % from MinION data, we demonstrated identification of variants to include the conventional drug resistance mutations, and those that contribute to the resistance phenotype (efflux pumps/transporter) and virulence. Reference-based alignment of the assembly allowed detection of deletions and insertions. MinION sequencing provided a fully annotated assembly of a transmissible XDR strain from an endemic setting and showed its utility to provide further understanding of genomic processes within Mycobacterium tuberculosis.Entities:
Keywords: Mycobacterium tuberculosis; Oxford Nanopore Technologies
Mesh:
Year: 2018 PMID: 29906261 PMCID: PMC6113869 DOI: 10.1099/mgen.0.000188
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Bar graph showing the unimodal sequence length distribution from ONT.
CheckM quality evaluation and assembly accuracy of draft genomes obtained from the three consensus tools, relative to the SPAdes assembly
Racon tool used ONT reads only, Pilon used Illumina reads only and Pilon+Racon used a combination of ONT and Illumina reads.
| SPAdes assembly (4 346 566 bp) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Tool | Genome size (bp) | Completeness (%) | Contamination | Strain heterogeneity | No. of indels | No. of SNPs | Total errors | Accuracy (%) |
| Racon | 4 405 333 | 83.2 | 0.00 | 0.00 | 2421 | 1026 | 3447 | 99.92 |
| Pilon | 4 404 947 | 97.74 | 0.00 | 0.00 | 1048 | 236 | 1284 | 99.97 |
| Pilon+Racon | 4 404 064 | 100 | 0.00 | 0.00 | 807 | 217 | 1024 | 99.98 |
Fig. 2.Circular representation of a WP-XDR genome (modern Beijing sub-lineage 2.2.1.1 strain) with gene annotations. From the inner to outer ring: blue, GC content of coding sequence; scattered grey, SNPs relative to H37Rv; grey bars, reverse and forward rRNA; green, reverse coding sequences; red, forward coding sequences; yellow bars, unique regions compared to H37Rv; grey outside ring, assembled contig. Mb, million base pairs.
Mutations in candidate drug resistance genes identified from the WP-XDR assembly
| Drug | Investigated genes | Genes (mutation) | |
|---|---|---|---|
| Isoniazid | Resistant | ||
| Rifampicin | Resistant | ||
| Ethambutol | Resistant | ||
| Pyrazinamide | Resistant | ||
| Streptomycin | Resistant | ||
| Ethionamide | Resistant | ||
| Fluoroquinolone | Resistant | ||
| Amikacin | Susceptible | Nil | |
| Kanamycin | Susceptible | Nil | |
| Capreomycin | Resistant | ||
| PAS | Susceptible | Nil | |
| Cycloserine | Susceptible | Nil |
PAS, p-aminosalicylic acid.
Mutations in putative efflux pump/transporter genes identified from the WP-XDR assembly
| Genes investigated | Gene (mutation) identified | |
|---|---|---|
| p.Met74Thr | ||
| p.Pro1098Leu | ||
| p.Asp699Glu | ||
| p.Thr54Ser | ||
| p.Asp67His | ||
| Val127Leu | ||
| p.Met243Leu | ||
| p.Gln243Arg | ||
| p.Arg278Gly | ||
| p.Pro156Thr | ||
Characteristics of identified regions deleted from the WP-XDR assembly relative to the reference genome H37Rv
| Start | Stop | Size (bp) | Region of difference | Genes deleted | RNA deleted |
|---|---|---|---|---|---|
| 79 557 | 83 036 | 3479 | 105 | Rv0071, Rv0072, Rv0073 | ncRv10071, ncRv10071c |
| 1 779 278 | 1 788 514 | 9 236 | 149 | Rv1573, Rv1574, Rv1575, Rv1576c, Rv1577c, Rv1578c, Rv1579c, Rv1580c, Rv1581c, Rv1582c, Rv1583c, Rv1584c, Rv1585c, Rv1586c | Nil |
| 1 986 636 | 1 998 623 | 11 987 | 152 | plcD, Rv1756c, Rv1757c, cut1, Rv1759c, Rv1760, Rv1761c, Rv1762c, Rv1763, Rv1764, Rv1765c | Nil |
| 2 365 414 | 2 366 769 | 1 355 | Unknown* | Rv2105, Rv2106 | Nil |
| 2 535 431 | 2 536 142 | 711 | 181 | Rv2262c, Rv2263 | Nil |
| 3 122 553 | 3 127 944 | 5 391 | 207 | Rv2816c, Rv2817c, Rv2818c, Rv2819c, Rv2820c | Nil |
| 3 551 229 | 3 552 585 | 1 356 | Unknown* | Rv3184, Rv3185 | Nil |
| 3 552 710 | 3 554 025 | 1 315 | Unknown* | Rv3186, Rv3187 | Nil |
*Unknown deletions in the study strain.
Fig. 3.Integrative Genomic Viewer (IGV) of Illumina reads from different M. tuberculosis lineages (A, M. tuberculosis H37Rv; B, Indo-Oceanic; C, ancient Beijing; D, modern Beijing; E, East African Indian; F and G, Euro American) mapped on the ONT assembled genome highlighting the large insertions within the assembled genome. On the left: 4490 bp insertion (2 207 042–2 211 532) spanning seven annotated genes (blue checked arrows, NADH-dependent oxidoreducatse, iron-regulated elongation factor-tu, PE family protein and hypothetical/uncharacterized proteins). On the right: 390 bp insertion relative to H37Rv (3 488 211–3 488 601) spanning 323 bp (checked) at the end of a 654 bp coding sequence. ONT, Oxford Nanopore Technologies.