| Literature DB >> 34788113 |
Hiu-Yin Lao1, Timothy Ting-Leung Ng1, Ryan Yik-Lam Wong2, Celia Sze-Ting Wong2, Lam-Kwong Lee1, Denise Sze-Hang Wong1, Chloe Toi-Mei Chan1, Stephanie Hoi-Ching Jim1, Jake Siu-Lun Leung1, Hazel Wing-Hei Lo1, Ivan Tak-Fai Wong1, Miranda Chong-Yee Yau2, Jimmy Yiu-Wing Lam2, Alan Ka-Lun Wu2, Gilman Kit-Hang Siu1.
Abstract
Bacterial pathogens that cannot be identified using matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) are occasionally encountered in clinical laboratories. The 16S rRNA gene is often used for sequence-based analysis to identify these bacterial species. Nevertheless, traditional Sanger sequencing is laborious, time-consuming, and low throughput. Here, we compared two commercially available 16S rRNA gene sequencing tests that are based on Illumina and Nanopore sequencing technologies, respectively, in their ability to identify the species of 172 clinical isolates that failed to be identified by MALDI-TOF MS. Sequencing data were analyzed by the respective built-in programs (MiSeq Reporter software of Illumina and Epi2me of Nanopore) and BLAST+ (v2.11.0). Their agreement with Sanger sequencing on species-level identification was determined. Discrepancies were resolved by whole-genome sequencing. The diagnostic accuracy of each workflow was determined using the composite sequencing result as the reference standard. Despite the high base-calling accuracy of Illumina sequencing, we demonstrated that the Nanopore workflow had a higher taxonomic resolution at the species level. Using built-in analysis algorithms, the concordance of Sanger 16S with the Illumina and Nanopore workflows was 33.14% and 87.79%, respectively. The agreement was 65.70% and 83.14%, respectively, when BLAST+ was used for analysis. Compared with the reference standard, the diagnostic accuracy of Nanopore 16S was 96.36%, which was identical to that of Sanger 16S and better than that of Illumina 16S (69.07%). The turnaround time of the Illumina workflow and the Nanopore workflow was 78 h and 8.25 h, respectively. The per-sample cost of the Illumina and Nanopore workflows was US$28.5 and US$17.7, respectively.Entities:
Keywords: 16S rRNA gene; Illumina sequencing; Nanopore sequencing; Sanger sequencing; bacterial species
Mesh:
Substances:
Year: 2021 PMID: 34788113 PMCID: PMC8769742 DOI: 10.1128/JCM.01769-21
Source DB: PubMed Journal: J Clin Microbiol ISSN: 0095-1137 Impact factor: 5.948
FIG 1Boxplots showing the distribution of percentages of classified reads of all samples in Illumina (a) and Nanopore (b) sequencing.
FIG 2Concordance between bacterial taxa inferred by the two HTS workflows and Sanger sequencing.
FIG 3Venn diagram showing concordance of bacterial taxa inferred by different 16S rRNA sequencing platforms. (a) Concordance of the top classified species between Illumina sequencing, coupled with MSR and Illumina_BLAST+ analysis, and Sanger sequencing. (b) Concordance of the top classified species between Nanopore sequencing, coupled with Epi2ME and nanoBLAST+, and Sanger sequencing. (c) Concordance of the top classified species among Sanger 16S, resolved Illumina 16S, resolved Nanopore 16S, and reference standard.
WGS analysis for samples with completely discordant taxonomic assignment by Sanger, Illumina, and Nanopore 16S rRNA sequencing
| Sample ID | Species inferred by Sanger 16S | Species inferred by resolved Illumina 16S | Species inferred by resolved Nanopore 16S | WGS results | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Best-matched species by WGS (reference genome accession no.) | Shasta genome assembly | Miniasm genome assembly | ||||||||
| Query coverage (%) | Identity (%) | ANI (%) | Query coverage (%) | Identity (%) | ANI (%) | |||||
| NA | NA | NA | 99.00 | 97.00 | 98.92 | 92.13 | 99.40 | 99.14 | ||
| NA | NA | NA | 94.06 | 99.95 | 99.30 | 88.39 | 99.92 | 99.23 | ||
| R001 |
|
| 42.21 | 87.44 | 78.29 | 42.42 | 87.41 | 78.55 | ||
| R006 |
|
| 43.04 | 79.12 | 78.49 | 42.04 | 87.49 | 78.44 | ||
| R062 |
|
|
| 92.17 | 99.17 | 98.71 | 86.30 | 98.99 | 98.69 | |
| R120 |
|
|
| 62.15 | 85.18 | 82.30 | 62.30 | 85.12 | 82.39 | |
| R121 |
|
|
| 6.07 | 78.55 | 70.34 | 6.04 | 78.24 | 70.86 | |
| R131 |
|
| No reliable ID | 6.19 | 82.12 | 71.21 | 6.29 | 78.25 | 71.26 | |
| R158 |
|
|
| 65.41 | 84.52 | 82.24 | 65.21 | 84.51 | 82.15 | |
| R181 |
|
|
| 31.48 | 89.67 | 82.09 | 30.68 | 89.59 | 81.95 | |
ID, identification; NA, not applicable.
Klebsiella pneumoniae BAA3079 and Staphylococcus aureus BAA3114 served as quality control samples, which were sequenced and analyzed in parallel with the discordant samples for WGS and bioinformatics analysis.
Discordant samples between MSR and Illumina_BLAST+ were resolved by nf-core/ampliseq.
Discordant samples between Epi2me and NanoBLAST+ were resolved by NanoCLUST.
An average nucleotide identity (ANI) of >94% indicated that the samples belonged to the same species as the best-matched genomes.
Samples with mismatched taxa inferred by at least one sequencing platform
| Sample ID | Species-level ID (reference standard) | Sanger sequencing | Illumina sequencing | Nanopore sequencing | |||
|---|---|---|---|---|---|---|---|
| Classified species from Sanger 16S | 16S identity against the reference (%) | Classified species from resolved Illumina 16S | 16S identity against the reference (%) | Classified species from resolved Nanopore 16S | 16S identity against the reference (%) | ||
| R003 |
|
| 99.26 |
| Matched |
| Matched |
| R013 |
|
| Matched |
| 97.47 |
| Matched |
| R017 |
|
| Matched |
| 97.47 |
| Matched |
| R021 |
|
| Matched |
| 97.47 |
| Matched |
| R024 |
|
| Matched |
| Matched |
| 97.62 |
| R025 |
|
| Matched |
| 98.50 |
| Matched |
| R026 |
|
| Matched |
| 98.50 |
| Matched |
| R036 |
|
| Matched |
| 93.29 |
| Matched |
| R040 |
|
| Matched |
| 99.26 |
| Matched |
| R043 |
|
| Matched |
| 99.64 |
| Matched |
| R045 |
|
| Matched |
| 99.86 |
| Matched |
| R047 |
|
| Matched |
| 99.65 |
| Matched |
| R049 |
|
| Matched |
| 99.86 |
| Matched |
| R052 |
|
| Matched |
| 96.60 |
| Matched |
| R053 |
|
| Matched |
| 99.86 |
| Matched |
| R056 |
|
| Matched |
| 93.74 |
| Matched |
| R058 |
|
| Matched |
| 94.67 |
| Matched |
| R062 |
|
| 99.20 |
| 97.07 |
| 98.56 |
| R063 |
|
| Matched |
| 98.04 |
| Matched |
| R069 |
|
| Matched |
| 98.69 |
| Matched |
| R071 |
|
| 99.07 |
| Matched |
| Matched |
| R072 |
|
| Matched |
| 98.96 |
| Matched |
| R073 |
|
| Matched |
| 95.95 |
| Matched |
| R078 |
|
| Matched |
| 99.22 |
| Matched |
| R079 |
|
| 99.07 |
| Matched |
| Matched |
| R083 |
|
| Matched |
| 95.95 |
| Matched |
| R086 |
|
| Matched |
| Matched |
| 99.06 |
| R094 |
|
| 99.07 |
| Matched |
| Matched |
| R096 |
|
| Matched |
| 98.86 |
| Matched |
| R097 |
|
| Matched |
| 97.04 |
| Matched |
| R098 |
|
| Matched |
| Matched |
| 97.84 |
| R104 |
|
| Matched |
| 99.07 |
| Matched |
| R105 |
|
| Matched |
| 99.07 |
| Matched |
| R107 |
|
| Matched |
| 99.19 |
| Matched |
| R108 |
|
| Matched |
| 99.19 |
| Matched |
| R112 |
|
| Matched |
| 98.32 |
| Matched |
| R116 |
|
| Matched |
| 99.86 |
| Matched |
| R123 |
|
| 99.26 |
| Matched |
| Matched |
| R133 |
|
| Matched |
| 99.31 |
| Matched |
| R140 |
|
| Matched |
| 99.38 |
| Matched |
| R141 |
|
| Matched |
| 93.33 |
| Matched |
| R148 |
|
| Matched |
| 99.19 |
| Matched |
| R149 |
|
| Matched |
| 98.48 |
| Matched |
| R151 |
|
| Matched |
| 99.86 |
| Matched |
| R153 |
|
| Matched |
| 94.67 |
| Matched |
| R167 |
|
| Matched |
| 99.19 |
| Matched |
| R175 |
|
| Matched |
| 98.07 |
| Matched |
| R176 |
|
| Matched |
| Matched |
| 100.00 |
| R178 |
|
| Matched |
| Matched |
| 97.82 |
| R179 |
|
| Matched |
| 97.16 |
| Matched |
| R180 |
|
| Matched |
| 94.67 |
| Matched |
| R182 |
|
| Matched |
| 98.34 |
| Matched |
| R183 |
|
| Matched |
| 97.15 |
| Matched |
| R190 |
|
| Matched |
| 97.16 |
| Matched |
| R192 |
|
| Matched |
| 98.85 |
| Matched |
| R204 |
|
| Matched |
| 98.10 |
| Matched |
| R205 |
|
| Matched |
| 93.74 |
| Matched |
| R206 |
|
| Matched |
| 98.16 |
| Matched |
| R208 |
|
| Matched |
| 99.15 |
| Matched |
| R210 |
|
| Matched |
| 97.37 |
| Matched |
Mismatched taxa are underlined. Sanger, Illumina, and Nanopore 16S rRNA sequencing results are shown.
Diagnostic accuracies of the Sanger, Illumina, and Nanopore 16S rRNA sequencing methods
| Sequencing method | No. of samples analyzed | No. of samples with matched taxa | Diagnostic accuracy (%) | 95% CI | |
|---|---|---|---|---|---|
| Sanger 16S | 165 | 159 | 96.36 | 92.25–98.65 | |
| Resolved Illumina 16S | 165 | 115 | 69.70 | 62.07–76.60 | <0.0001* |
| Analyzed by MSR | 165 | 59 | 35.76 | 28.46–43.58 | |
| Analyzed by Illumina_BLAST+ | 165 | 118 | 71.52 | 63.98–78.26 | |
| Resolved Nanopore 16S | 165 | 159 | 96.36 | 92.25–98.65 | 0.0291* |
| Analyzed by Epi2ME | 165 | 147 | 89.09 | 83.31–93.41 | |
| Analyzed by NanoBLAST+ | 165 | 148 | 89.70 | 84.02–93.88 |
Discordant samples between MSR and Illumina_BLAST+ were analyzed by nf-core/ampliseq; classified species in nf-core/ampliseq were considered resolved identities in Illumina workflow.
Discordant samples between Epi2me and NanoBLAST+ were analyzed by NanoCLUST; classified species in NanoCLUST were considered resolved identities in Nanopore workflow.
CI, confidence interval.
*, P < 0.05, statistically significantly different from Sanger 16S results.
FIG 416S rRNA gene sequencing workflow of the HTS technologies.
FIG 5Analysis workflow of sequencing data from each platform. (a) Results from Illumina and Nanopore platforms are compared to the Sanger 16S result. (b) Composite reference standard. (c) Calculating diagnostic accuracy.