| Literature DB >> 31820709 |
Ivo Elliott1,2, Elizabeth M Batty3,1,4, Damien Ming2, Matthew T Robinson1,2, Pruksa Nawtaisong2, Mariateresa de Cesare4, Paul N Newton1,2, Rory Bowden4.
Abstract
The infrastructure challenges and costs of next-generation sequencing have been largely overcome, for many sequencing applications, by Oxford Nanopore Technologies' portable MinION sequencer. However, the question remains open whether MinION-based bacterial whole genome sequencing is by itself sufficient for the accurate assessment of phylogenetic and epidemiological relationships between isolates and whether such tasks can be undertaken in resource-limited settings. To investigate this question, we sequenced the genome of an isolate of Rickettsia typhi, an important and neglected cause of fever across much of the tropics and subtropics, for which only three genomic sequences previously existed. We prepared and sequenced libraries on a MinION in Vientiane, Lao PDR, using v9.5 chemistry, and in parallel, we sequenced the same isolate on the Illumina platform in a genomics laboratory in the United Kingdom. The MinION sequence reads yielded a single contiguous assembly, in which the addition of Illumina data revealed 226 base-substitution and 5,856 indel errors. The combined assembly represents the first complete genome sequence of a human R. typhi isolate collected in the last 50 years and differed from the genomes of existing strains collected over a 90-year time period at very few sites, with no rearrangements. Filtering based on the known error profile of MinION data improved the accuracy of the nanopore-only assembly. However, the frequency of false-positive errors remained greater than true sequence divergence from recorded sequences. Although nanopore-only sequencing cannot yet recover phylogenetic signals in R. typhi, such an approach may be applicable for more diverse organisms.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31820709 PMCID: PMC7008338 DOI: 10.4269/ajtmh.19-0383
Source DB: PubMed Journal: Am J Trop Med Hyg ISSN: 0002-9637 Impact factor: 2.345
Strain information
| Strain | Source | Accession | Genome length (bp) | Predicted genes | GC percentage |
|---|---|---|---|---|---|
| Wilmington | Human, North Carolina, USA. 1928 | NC_006142.1 | 1,111,496 | 865 | 0.289 |
| B9991CWPP | Bandicoot rat, Myanmar. 1970 | NC_017062.1 | 1,112,957 | 865 | 0.289 |
| TH1527 | Human, Chiang Rai, Thailand. 1965 | NC_017066.1 | 1,112,372 | 865 | 0.289 |
| TM2540 | Human, Bolikhamxay, Laos. 2012 | ERZ497871 | 1,111,939 | 866 | 0.289 |
The Wilmington, B991CWPP, and TH1527 strains are previously assembled strains of Rickettsia typhi; TM2540 is the assembly produced in this study.
Figure 1.Pavian visualization showing the proportions of Oxford Nanopore Technologies′ reads assigned up to the genus level[32]. This figure appears in color at
Figure 2.The observed distance from a homopolymer run for the errors corrected by Pilon in the draft genome, compared with the expected distribution if errors were randomly distributed across the genome. This figure appears in color at
Proportion of BUSCO bacterial gene set found as complete genes, fragmented genes, or missing for different assemblies of the TM2540 strain, and the reference Wilmington assembly
| Complete (%) | Fragmented (%) | Missing (%) | |
|---|---|---|---|
| Canu only | 0.00 | 3.40 | 96.60 |
| Canu + Nanopolish | 14.20 | 32.40 | 53.40 |
| Canu + Nanopolish + Pilon (1 round) | 85.10 | 2.70 | 12.20 |
| Canu + Nanopolish + Pilon (2 rounds) | 88.50 | 0.70 | 10.80 |
| Canu + Nanopolish + Pilon (3 rounds) | 88.50 | 0.70 | 10.80 |
| Canu + Nanopolish + Pilon (4 rounds) | 88.50 | 0.70 | 10.80 |
| Unicycler (Illumina only) | 88.50 | 0.70 | 10.80 |
| Reference genome (Wilmington) | 88.50 | 0.70 | 10.80 |
All complete genes were single copy.
Pairwise differences between strains based on the whole genome alignment
| Wilmington | B9991CWPP | TH1527 | TM2540 | |
|---|---|---|---|---|
| Wilmington | 0 | |||
| B9991CWPP | 27 | 0 | ||
| TH1527 | 18 | 21 | 0 | |
| TM2540 | 40 | 45 | 36 | 0 |