| Literature DB >> 35750759 |
Mathias Vandenbogaert1, Aurélia Kwasiborski1, Ella Gonofio2, Stéphane Descorps-Declère3, Benjamin Selekon2, Andriniaina Andy Nkili Meyong4, Rita Sem Ouilibona2, Antoine Gessain5, Jean-Claude Manuguerra1, Valérie Caro1, Emmanuel Nakoune2, Nicolas Berthet6,7.
Abstract
Monkeypox is an emerging and neglected zoonotic disease whose number of reported cases has been gradually increasing in Central Africa since 1980. This disease is caused by the monkeypox virus (MPXV), which belongs to the genus Orthopoxvirus in the family Poxviridae. Obtaining molecular data is particularly useful for establishing the relationships between the viral strains involved in outbreaks in countries affected by this disease. In this study, we evaluated the use of the MinION real-time sequencer as well as different polishing tools on MinION-sequenced genome for sequencing the MPXV genome originating from a pustular lesion in the context of an epidemic in a remote area of the Central African Republic. The reads corresponding to the MPXV genome were identified using two taxonomic classifiers, Kraken2 and Kaiju. Assembly of these reads led to a complete sequence of 196,956 bases, which is 6322 bases longer than the sequence previously obtained with Illumina sequencing from the same sample. The comparison of the two sequences showed mainly indels at the homopolymeric regions. However, the combined use of Canu with specific polishing tools such as Medaka and Homopolish was the best combination that reduced their numbers without adding mismatches. Although MinION sequencing is known to introduce a number of characteristic errors compared to Illumina sequencing, the new polishing tools allow a better-quality MinION-sequenced genome, thus to be used to help determine strain origin through phylogenetic analysis.Entities:
Mesh:
Year: 2022 PMID: 35750759 PMCID: PMC9232561 DOI: 10.1038/s41598-022-15073-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Workflow for the analysis of raw data, from real-time acquisition to the listing of SNPs and indels.
Figure 2Map of the Central African Republic in Africa. The map was edited in Microsoft PowerPoint (Version 2020).
Summary of the taxonomic assignments of the reads for each of the two classifiers used (Kraken2 and Kaiju).
| Kraken2 | Kaiju | |
|---|---|---|
| Number of reads | Number of reads | |
| 134,702 | 63,192 | |
| Other | 0 | 13,730 |
| Total | 134,702 (92.86%) | 76,922 (64.02%) |
| Proteobacteria | 1384 | 24,088 |
| Terrabacteria group | 23 | 7555 |
| Chlamydiae | 0 | 1077 |
| Other | 0 | 3382 |
| Total | 1407 (0.97%) | 36,102 (30.02%) |
| Family | 2168 | 2195 |
| Other | – | 127 |
| Total | 2198 (1.51%) | 2322 (1.93%) |
| Unassigned | – | 4841 (4.03%) |
| Bacterial metagenomes | 6761 (4.66%) | – |
| Total number of reads assigned to a taxon | 145,068 (100%) | 120,187 (100%) |
Figure 3Genome sequencing coverage and location of A/T homopolymers in the monkeypox virus (MPXV) genome. Homopolymer repeat length at each homopolymer position (coordinates) in the genome.
Figure 4Tandem repeat locations in monkeypox virus genome for short-reads Illumina (A) and long-reads MinION (B) sequenced genomes using Tander Repeat Finder tool.
Figure 5Phylogeny of monkeypox viruses (MPXV) based on complete genomes. The West African clade as well as the 5 groups of the Central African clade were highlighted by different colors: West African clade (orange), Central African group I (blue), II (grey), III (yellow), IV (white) and V (green). The final phylogenetic was generated using FigTree version 1.4.4.
Assembly and Polishing results using Illumina-sequenced genome (MN702446).
| Assembly/polishing methods | Q-score | Number of mismatches | Number of InDels |
|---|---|---|---|
| Canu | 26.16 | 0 | 433 |
| Canu/Homopolish | 30.45 | 2 | 159 |
| Canu/Medaka | 26.78 | 0 | 375 |
| Canu/Medaka/Homopolish | 30.51 | 0 | 159 |
| Canu/Racon | 28.71 | 1 | 236 |
| Canu/Racon/Medaka | 28.39 | 1 | 254 |
| Canu/Racon/Medaka/Homopolish | 30.5 | 1 | 156 |
Figure 6Accuracy of correction of the different polishing tools in combination with Canu for homopolymer sizes between 4 and 9 nt.
Figure 7Scatterplot showing effect of polishing using Homopolish (all polymer occurrences).
Figure 8Scatterplot/Barplots showing Effect of polishing using Homopolish (occurrences of polymer of length < 10 bp).