| Literature DB >> 32083020 |
Kazuo Imai1,2, Rina Nemoto3, Masahiro Kodana4, Norihito Tarumoto1,2, Jun Sakai1,2, Toru Kawamura4, Kenji Ikebuchi4, Kotaro Mitsutake5, Takashi Murakami2,3, Shigefumi Maesaki1,2, Taku Fujiwara6, Satoshi Hayakawa7, Tomonori Hoshino8, Mitsuko Seki7,8, Takuya Maeda2,4.
Abstract
Differentiation between mitis group streptococci (MGS) bacteria in routine laboratory tests has become important for obtaining accurate epidemiological information on the characteristics of MGS and understanding their clinical significance. The most reliable method of MGS species identification is multilocus sequence analysis (MLSA) with seven house-keeping genes; however, because this method is time-consuming, it is deemed unsuitable for use in most clinical laboratories. In this study, we established a scheme for identifying 12 species of MGS (S. pneumoniae, S. pseudopneumoniae, S. mitis, S. oralis, S. peroris, S. infantis, S. australis, S. parasanguinis, S. sinensis, S. sanguinis, S. gordonii, and S. cristatus) using the MinION nanopore sequencer (Oxford Nanopore Technologies, Oxford, UK) with the taxonomic aligner "What's in My Pot?" (WIMP; Oxford Nanopore's cloud-based analysis platform) and Kraken2 pipeline with the custom database adjusted for MGS species identification. The identities of the species in reference genomes (n = 514), clinical isolates (n = 31), and reference strains (n = 4) were confirmed via MLSA. The nanopore simulation reads were generated from reference genomes, and the optimal cut-off values for MGS species identification were determined. For 31 clinical isolates (S. pneumoniae = 8, S. mitis = 17 and S. oralis = 6) and 4 reference strains (S. pneumoniae = 1, S. mitis = 1, S. oralis = 1, and S. pseudopneumoniae = 1), a sequence library was constructed via a Rapid Barcoding Sequencing Kit for multiplex and real-time MinION sequencing. The optimal cut-off values for the identification of MGS species for analysis by WIMP and Kraken2 pipeline were determined. The workflow using Kraken2 pipeline with a custom database identified all 12 species of MGS, and WIMP identified 8 MGS bacteria except S. infantis, S. australis, S. peroris, and S. sinensis. The results obtained by MinION with WIMP and Kraken2 pipeline were consistent with the MGS species identified by MLSA analysis. The practical advantage of whole genome analysis using the MinION nanopore sequencer is that it can aid in MGS surveillance. We concluded that MinION sequencing with the taxonomic aligner enables accurate MGS species identification and could contribute to further epidemiological surveys.Entities:
Keywords: Kraken; MinION; WIMP; mitis group streptococci; whole genome sequencing
Mesh:
Year: 2020 PMID: 32083020 PMCID: PMC7002467 DOI: 10.3389/fcimb.2020.00011
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
Figure 1Phylogenetic analysis of mitis group streptococci based on multilocus sequence analysis. The phylogenetic tree was constructed by the Neighbor-Joining method, and the reliability of each tree topology was checked by 500 bootstrap replications. The strain names and reference numbers of reference genomes in the RefSeq database are listed in Table S1. Strains are colored according to species annotation in the RefSeq database. The detailed phylogenetic tree is shown in Figure S1.
Figure 2Heatmap of species-level read abundance based on WIMP and Kraken2 pipeline. Heatmaps show the species-level read abundance of simulation nanopore read sets for reference genomes in the RefSeq database. The species-level read abundance was calculated by WIMP (A) and Kraken2 pipeline (B). Blue indicates low abundance, yellow indicates intermediate, and red indicates high. The results of the Kraken2 database showed the high species-level read abundance for species identified by multilocus sequence analysis (MLSA) among mitis group streptococci bacteria, whilst the results of WIMP showed low species-level read abundance for S. peroris, S. infantis, S. australis, and S. sinensis. S. p, S. pneumoniae; S. pp, S. pseudopneumoniae; S. m, S. mitis; S. o, S. oralis; S. pero, S. peroris; S. infa, S. infantis; S. aust, S. australis; S. para, S. parasanguinis; S. sine, S. sinensis; S. sang, S. sanguinis; S. gord, S. gordonii; and S. cri, S. cristatus.
Results of taxonomic assignment via WIMP and Kraken2 pipeline.
| 74 | 95.60 | 1.68 | 75 | 1 | 100 | 100 | |
| 40 | 58.00 | 7.64 | 35 | 1 | 100 | 100 | |
| 98 | 54.38 | 9.81 | 35 | 1 | 100 | 100 | |
| 115 | 73.22 | 11.7 | 35 | 1 | 100 | 100 | |
| 1 | Undetectable | – | Undetectable | – | – | – | |
| 14 | Undetectable | – | Undetectable | – | – | – | |
| 5 | Undetectable | – | Undetectable | – | – | – | |
| 36 | 82.02 | 7.93 | 55 | 1 | 100 | 100 | |
| 1 | Undetectable | – | Undetectable | – | – | – | |
| 60 | 77.01 | 3.87 | 55 | 1 | 100 | 100 | |
| 46 | 94.68 | 2.28 | 70 | 1 | 100 | 100 | |
| 24 | 67.55 | 9.60 | 55 | 1 | 100 | 100 | |
| 74 | 81.33 | 1.94 | 65 | 1 | 100 | 100 | |
| 40 | 85.22 | 0.58 | 65 | 1 | 100 | 100 | |
| 98 | 95.99 | 3.46 | 65 | 1 | 100 | 100 | |
| 115 | 97.00 | 2.95 | 65 | 1 | 100 | 100 | |
| 1 | 98.17 | − | 75 | 1 | 100 | 100 | |
| 14 | 79.93 | 19.18 | 45 | 1 | 100 | 100 | |
| 5 | 86.87 | 20.81 | 45 | 1 | 100 | 100 | |
| 36 | 99.19 | 0.45 | 75 | 1 | 100 | 100 | |
| 1 | 98.97 | − | 75 | 1 | 100 | 100 | |
| 60 | 99.06 | 0.49 | 75 | 1 | 100 | 100 | |
| 46 | 98.44 | 0.58 | 75 | 1 | 100 | 100 | |
| 24 | 98.78 | 0.41 | 75 | 1 | 100 | 100 | |
S. p, S. pneumoniae; S. pp, S. pseudopneumoniae; S. m, S. mitis; S. o, S. oralis; S. pero, S. peroris; S. infa, S. infantis; S. aust =S. australis; S. para, S. parasanguinis; S. sine, S. sinensis; S. sang, S. sanguinis; S. gord, S. gordonii; and S. cri, S. cristatus.
Summary of species identification among clinical isolates.
| ATCC 49619 | 37,731 | 132.4 M | 3,508 | 60× | NA | NA | NA | NA | |||
| JNBP 05639 | 47,447 | 74.7 M | 1,574 | 34× | NA | NA | NA | NA | |||
| JNBP 08070 | 49,882 | 93.5 M | 1,875 | 43× | NA | NA | NA | NA | |||
| CCUG 49455 | 7,711 | 25.2 M | 3,267 | 11× | NA | NA | NA | NA | |||
| S1 | 18,395 | 31.0 M | 1,684 | 14× | S | + | |||||
| S2 | 26,806 | 43.9 M | 1,637 | 20× | S | + | |||||
| S3 | 27,316 | 127.1 M | 4,654 | 58× | S | + | |||||
| S4 | 11,765 | 34.9 M | 2,969 | 16× | S | + | |||||
| S5 | 16,869 | 50.6 M | 3,001 | 23× | S | + | |||||
| S6 | 36,377 | 167.4 M | 4,602 | 76× | S | + | |||||
| S7 | 7,579 | 33.2 M | 4,374 | 15× | S | + | |||||
| S8 | 22,514 | 70.8 M | 3,143 | 32× | S | + | |||||
| S9 | 31,807 | 63.5 M | 1,997 | 30× | R | − | |||||
| S10 | 20,513 | 79.7 M | 3,884 | 37× | R | +* | |||||
| S11 | 42,314 | 146.1 M | 3,452 | 68× | R | − | |||||
| S12 | 9,642 | 47.8 M | 4,960 | 22× | R | +* | |||||
| S13 | 4,259 | 17.2 M | 4,028 | 8× | R | − | |||||
| S14 | 10,523 | 31.3 M | 2,977 | 15× | R | − | |||||
| S15 | 77,786 | 199.7 M | 2,567 | 93× | R | − | |||||
| S16 | 22,768 | 83.8 M | 3,679 | 39× | R | − | |||||
| S17 | 4,819 | 15.5 M | 3,223 | 7× | R | − | |||||
| S18 | 80,010 | 208.0 M | 2,599 | 97× | R | − | |||||
| S19 | 52,061 | 200.3 M | 3,848 | 93× | R | − | |||||
| S20 | 14,388 | 64.1 M | 4,453 | 30× | R | − | |||||
| S21 | 13,395 | 51.4 M | 3,836 | 24× | R | − | |||||
| S22 | 6,614 | 14.2 M | 2,141 | 7× | R | − | |||||
| S23 | 14,644 | 32.7 M | 2,233 | 15× | R | − | |||||
| S24 | 11,277 | 266.5 M | 4,993 | 124× | R | − | |||||
| S25 | 23,957 | 37.8 M | 1,579 | 18× | R | − | |||||
| S26 | 45,499 | 128.7 M | 2,829 | 64× | R | − | |||||
| S27 | 10,793 | 22.7 M | 2,101 | 11× | R | − | |||||
| S28 | 15,630 | 48.0 M | 3,068 | 24× | R | − | |||||
| S29 | 1,680 | 2.5 M | 1,461 | 1× | R | − | |||||
| S30 | 6,548 | 5.2 M | 786 | 3× | R | − | |||||
| S31 | 25,997 | 56.5 M | 2,174 | 28× | R | − | |||||
S. p, S. pneumoniae; S. pp, S. pseudopneumoniae; S. m, S. mitis; S. o, S. oralis; S. sang, S. sanguinis; and G. m, G. morbillorum.
Asterisk indicates discrepancy in the results of multilocus sequence analysis as a reference identification method compared with other methods.
Figure 3Species-level read abundance based on WIMP and Kraken2 pipeline for clinical isolates and reference strains. The stacked bar graphs show the species-level read abundance in MinION sequencing reads (Kilian and Tettelin, 2019); abundance was calculated based on WIMP (A) and Kraken2 pipeline (B). S. p, S. pneumoniae; S. pp, S. pseudopneumoniae; S. m, S. mitis; S. o, S. oralis; others, S. peroris, S. infantis, S. australis, S. parasanguinis, S. sinensis, S. sanguinis, S. gordonii, S. cristatus, and Streptococcus. sp.