| Literature DB >> 32967250 |
Astrid P Heikema1, Deborah Horst-Kreft1, Stefan A Boers2, Rick Jansen3, Saskia D Hiltemann3, Willem de Koning3, Robert Kraaij4, Maria A J de Ridder5, Chantal B van Houten6, Louis J Bont6, Andrew P Stubbs3, John P Hays1.
Abstract
Illumina and nanopore sequencing technologies are powerful tools that can be used to determine the bacterial composition of complex microbial communities. In this study, we compared nasal microbiota results at genus level using both Illumina and nanopore 16S rRNA gene sequencing. We also monitored the progression of nanopore sequencing in the accurate identification of species, using pure, single species cultures, and evaluated the performance of the nanopore EPI2ME 16S data analysis pipeline. Fifty-nine nasal swabs were sequenced using Illumina MiSeq and Oxford Nanopore 16S rRNA gene sequencing technologies. In addition, five pure cultures of relevant bacterial species were sequenced with the nanopore sequencing technology. The Illumina MiSeq sequence data were processed using bioinformatics modules present in the Mothur software package. Albacore and Guppy base calling, a workflow in nanopore EPI2ME (Oxford Nanopore Technologies-ONT, Oxford, UK) and an in-house developed bioinformatics script were used to analyze the nanopore data. At genus level, similar bacterial diversity profiles were found, and five main and established genera were identified by both platforms. However, probably due to mismatching of the nanopore sequence primers, the nanopore sequencing platform identified Corynebacterium in much lower abundance compared to Illumina sequencing. Further, when using default settings in the EPI2ME workflow, almost all sequence reads that seem to belong to the bacterial genus Dolosigranulum and a considerable part to the genus Haemophilus were only identified at family level. Nanopore sequencing of single species cultures demonstrated at least 88% accurate identification of the species at genus and species level for 4/5 strains tested, including improvements in accurate sequence read identification when the basecaller Guppy and Albacore, and when flowcell versions R9.4 (Oxford Nanopore Technologies-ONT, Oxford, UK) and R9.2 (Oxford Nanopore Technologies-ONT, Oxford, UK) were compared. In conclusion, the current study shows that the nanopore sequencing platform is comparable with the Illumina platform in detection bacterial genera of the nasal microbiota, but the nanopore platform does have problems in detecting bacteria within the genus Corynebacterium. Although advances are being made, thorough validation of the nanopore platform is still recommendable.Entities:
Keywords: 16S rRNA gene; Corynebacterium; Illumina sequencing; bacterial species; nanopore sequencing; nasal microbiota
Mesh:
Substances:
Year: 2020 PMID: 32967250 PMCID: PMC7565314 DOI: 10.3390/genes11091105
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Nose swab samples of individuals and negative controls that were sequenced using and Illumina and nanopore 16S rRNA gene sequencing technologies. (a) = a maximum of 5000 raw Illumina sequence reads were analyzed for the classification of genera. (b) = samples with read numbers below the 500 read cut-off. NA = not applicable.
| Sample Information | Illumina Technology | Nanopore Technology | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Sample | Infection | Age | 16S Copies | Raw | Percentage Reads Classified (%) (a) | Genera Identified ≥1% | Genera | Raw | Percentage Reads Classified (%) | Genera Identified ≥1% | Genera |
| 1 | yes | 3.50 | 5 × 105 | 133,880 | 92 | 5 | 4.2 | 34,944 | 77 | 5 | 3.0 |
| 2 | yes | 0.92 | 1 × 105 | 186,250 | 95 | 5 | 1.9 | 15,254 | 79 | 3 | 2.2 |
| 3 | yes | 2.00 | 3 × 105 | 1661 | 94 | 5 | 4.1 | 39,474 | 77 | 4 | 2.9 |
| 4 | yes | 1.50 | 3 × 105 | 154,877 | 96 | 7 | 4.6 | 36,608 | 76 | 6 | 2.3 |
| 5 | yes | 9.00 | 3 × 105 | 114,702 | 97 | 5 | 3.5 | 5107 | 59 | 4 | 2.7 |
| 6 | yes | 2.00 | 3 × 105 | 22,805 | 97 | 5 | 2.7 | 31,642 | 52 | 4 | 1.7 |
| 7 | yes | 5.00 | 2 × 105 | 1940 | 88 | 8 | 3.8 | 2246 | 57 | 6 | 3.1 |
| 8 | yes | 4.00 | 3 × 105 | 24,214 | 100 | 4 | 1.2 | 10,174 | 62 | 3 | 1.2 |
| 9 | yes | 1.67 | 4 × 105 | 104,134 | 93 | 9 | 2.5 | 21,462 | 68 | 6 | 2.6 |
| 10 | yes | 8.00 | 2 × 105 | 186,945 | 96 | 3 | 2.5 | 923 | 68 | 2 | 1.6 |
| 11 | yes | 11.00 | 2 × 105 | 120,867 | 95 | 3 | 3.0 | 27,569 | 78 | 3 | 1.6 |
| 12 | yes | 0.42 | 4 × 105 | 25,743 | 98 | 3 | 3.0 | 5127 | 66 | 3 | 2.2 |
| 13 | yes | 15.00 | 4 × 105 | 261,123 | 95 | 4 | 2.7 | 12,572 | 66 | 5 | 2.0 |
| 14 | yes | 2.17 | 1 × 105 | 6246 | 97 | 4 | 3.0 | 20,441 | 89 | 3 | 2.7 |
| 15 | yes | 3.80 | 3 × 105 | 68,095 | 91 | 3 | 2.3 | 27,077 | 90 | 4 | 2.5 |
| 16 | yes | 2.40 | 1 × 105 | 119,295 | 84 | 7 | 2.9 | 2978 | 85 | 6 | 2.6 |
| 17 | yes | 0.80 | 2 × 105 | 74,902 | 96 | 3 | 1.5 | 4408 | 91 | 2 | 1.1 |
| 18 | yes | 61.00 | 3 × 103 | 77,851 | 86 | 6 | 3.4 | 2141 | 82 | 8 | 4.1 |
| 19 | yes | 0.90 | 3 × 105 | 74,730 | 85 | 4 | 2.3 | 20,584 | 82 | 6 | 1.6 |
| 20 | yes | 0.80 | 3 × 105 | 113,078 | 93 | 3 | 2.4 | 10,974 | 91 | 3 | 1.9 |
| 21 | yes | 78.00 | 2 × 105 | 131,837 | 90 | 2 | 1.7 | 21,449 | 93 | 1 | 1.0 |
| 22 | yes | 1.70 | 3 × 105 | 162,890 | 85 | 4 | 2.4 | 23,530 | 92 | 5 | 1.8 |
| 23 | yes | 2.30 | 2 × 105 | 83,596 | 92 | 8 | 4.4 | 15,748 | 88 | 7 | 3.2 |
| 24 | yes | 73.00 | 2 × 105 | 83,947 | 84 | 4 | 2.0 | 3181 | 88 | 5 | 3.3 |
| 25 | yes | 2.60 | 5 × 105 | 28,221 | 92 | 3 | 3.0 | 15,453 | 50 | 3 | 3.1 |
| 26 | yes | 65.00 | 3 × 105 | 77,012 | 82 | 7 | 4.5 | 31,461 | 85 | 6 | 2.8 |
| 27 | yes | 0.80 | 1 × 105 | 58,962 | 85 | 3 | 2.5 | 23,652 | 90 | 3 | 1.5 |
| 28 | yes | 3.00 | 5 × 105 | 57,600 | 86 | 6 | 3.7 | 22,991 | 84 | 7 | 3.4 |
| 29 | yes | 57.00 | 2 × 105 | 129,131 | 94 | 2 | 1.5 | 48,167 | 90 | 1 | 1.1 |
| 30 | yes | 0.40 | 6 × 105 | 180,796 | 88 | 3 | 2.9 | 3997 | 65 | 4 | 2.3 |
| 31 | yes | 0.90 | 4 × 105 | 547,695 | 98 | 4 | 2.7 | 15,626 | 80 | 7 | 1.7 |
| 32 | yes | 23.00 | 8 × 105 | 750,669 | 97 | 3 | 1.8 | 6653 | 67 | 2 | 1.6 |
| 33 | yes | 3.40 | 1 × 105 | 924,890 | 98 | 7 | 3.3 | 25,148 | 74 | 7 | 2.1 |
| 34 | yes | 4.10 | 1 × 106 | 31,896 | 94 | 5 | 4.0 | 15,979 | 49 | 4 | 2.7 |
| 35 | yes | 14.00 | 3 × 105 | 79,970 | 90 | 3 | 2.1 | 40,551 | 88 | 3 | 1.4 |
| 36 | yes | 0.10 | 3 × 105 | 113,047 | 88 | 3 | 1.7 | 50 | 76 | NA | NA |
| 37 (b) | yes | 0.40 | 3 × 105 | 59,397 | 88 | 4 | 2.9 | 51,254 | 63 | 11 | 3.6 |
| 38 | yes | 0.30 | 6 × 105 | 7421 | 99 | 3 | 1.4 | 41,757 | 89 | 2 | 1.4 |
| 39 | yes | 1.10 | 3 × 105 | 121,819 | 86 | 3 | 2.6 | 6340 | 86 | 6 | 1.9 |
| 40 | yes | 0.20 | 2 × 105 | 83,457 | 83 | 4 | 2.4 | 59,923 | 82 | 6 | 1.9 |
| 41 | yes | 4.20 | 4 × 105 | 92,006 | 87 | 4 | 2.9 | 17,785 | 90 | 4 | 2.3 |
| 42 | yes | 0.10 | 1 × 105 | 36,248 | 90 | 4 | 2.0 | 45,047 | 92 | 3 | 1.9 |
| 43 | yes | 0.10 | 2 × 105 | 55,585 | 92 | 5 | 2.3 | 47,084 | 92 | 3 | 1.4 |
| 44 | yes | 0.40 | 3 × 105 | 101,465 | 87 | 5 | 2.7 | 5288 | 80 | 6 | 1.6 |
| 45 | yes | 1.70 | 7 × 105 | 92,476 | 89 | 3 | 1.9 | 49,104 | 55 | 2 | 1.1 |
| 46 | yes | 0.50 | 3 × 105 | 72,068 | 88 | 4 | 2.1 | 50,486 | 80 | 6 | 1.5 |
| 47 | yes | 0.10 | 5 × 105 | 90,128 | 80 | 6 | 4.0 | 107,161 | 91 | 6 | 3.2 |
| 48 (b) | yes | 67.00 | 2 × 105 | 51,826 | 94 | 5 | 1.3 | 8 | 75 | NA | NA |
| 49 | yes | 0.30 | 9 × 105 | 1148 | 82 | 8 | 4.3 | 14,673 | 66 | 3 | 1.5 |
| 50 | yes | 3.30 | 5 × 106 | 39,030 | 83 | 3 | 2.6 | 12,239 | 66 | 3 | 2.0 |
| 51 | yes | 56.00 | 5 × 106 | 2191 | 85 | 7 | 3.3 | 17,248 | 64 | 7 | 2.4 |
| 52 | no | 28.00 | 3 × 105 | 193,859 | 96 | 2 | 1.2 | 6,789 | 91 | 1 | 1.0 |
| 53 | no | 62.00 | 2 × 105 | 262,184 | 89 | 3 | 2.3 | 18,680 | 88 | 2 | 1.8 |
| 54 | no | 8.10 | 2 × 105 | 308,123 | 83 | 5 | 2.8 | 13,741 | 89 | 4 | 2.3 |
| 55 | no | 7.20 | 3 × 105 | 203,242 | 100 | 6 | 3.4 | 15,490 | 84 | 6 | 4.3 |
| 56 | no | 14.90 | 1 × 105 | 235,820 | 92 | 3 | 1.4 | 18,318 | 88 | 8 | 5.0 |
| 57 | no | 5.40 | 9 × 105 | 90,422 | 86 | 3 | 2.8 | 11,207 | 87 | 4 | 2.2 |
| 58 | no | 7.10 | 6 × 105 | 103,176 | 87 | 5 | 2.9 | 19,604 | 73 | 7 | 2.7 |
| 59 | no | 6.40 | 1 × 105 | 111,844 | 93 | 4 | 1.6 | 17,971 | 88 | 3 | 1.1 |
| Average |
|
|
|
|
|
|
|
|
|
|
|
| Control | |||||||||||
| C-1 | NA | NA | <1 × 102 | 6 | 0 | 0 | NA | 7 | 57 | 4 | NA |
| C-2 | NA | NA | <1 × 102 | 1 | 0 | 0 | 0 | 42 | 74 | 8 | NA |
| C-3 | NA | NA | <1 × 102 | 1 | 0 | 0 | 0 | 33 | 42 | 9 | NA |
| C-4 | NA | NA | <1 × 102 | 3 | 0 | 0 | 0 | 35 | 51 | 11 | NA |
| C-5 | NA | NA | <1 × 102 | 2 | 0 | 0 | 0 | 15 | 67 | 3 | NA |
| C-6 | NA | NA | 2 × 102 | 2440 | 98 | 4 | 4 | 56 | 91 | 6 | NA |
| C-7 | NA | NA | 3 × 102 | 3408 | 94 | 18 | 18 | 0 | 0 | 0 | NA |
Figure 1Nasal microbiota profiles generated using nanopore and Illumina 16S rRNA gene sequencing. DNA was isolated from 57 nose swab samples, and 16S rRNA gene sequencing was performed using both Illumina (a) and nanopore (b) technologies. Each bar in the graph represents a nasal microbiota profile from a single individual. The dashed lines in (b) represent genera that, by default, were reported as unclassified at genus level in the EPI2ME report but were identified when next to reads with a top three blast hit with one genera (num_genus_taxid is 1); reads with a top three blast hit with two genera (num_genus_taxid is 2) were also included. A phylogenetic tree was generated by Pearson/UPGMA clustering of bacterial genera in microbiota profiles, as determined using Illumina sequencing. To compare between the two techniques, the sample order of the samples that were sequenced with the Oxford Nanopore platform was matched to the sample order of the samples that were sequenced with the Illumina platform, and the percentage of agreement was calculated for each nose swab sample (c). The horizontal black line in (c) indicates the mean percentage of agreement.
Figure 2Bland–Altman plots of six main genera present in the nasal microbiota. Bland–Altman plots were generated for the six main genera: (a) Corynebacterium, (b) Dolosigranulum, (c) Haemophilus, (d) Moraxella, (e) Staphylococcus, and (f) Streptococcus. For each genus, the mean difference between the two sequence methods (Illumina versus nanopore) and the limits of agreement (95% reference interval) were calculated and shown (g).
Figure 3Agarose gel with 16S rRNA gene amplicons. Total DNA was isolated from pure bacterial cultures in a similar manner as the isolation of DNA from the nasal swab samples; the DNA concentration was determined by picogreen and a PCR was performed as described for nanopore sequencing using equal amounts of template DNA, with the exception that 30 PCR cycli instead of 25 cycli were used.
Figure 4Genus and species level identification on pure culture species. Pure cultures of bacterial ATCC strains were sequenced using an R9.2 or R9.4 nanopore flowcell and Albacore or Guppy basecalling. Taxonomic assignment was performed at genus (a) and species (b) level using the EPI2ME 16S pipeline and the following thresholds: read length ≥1400 bp ≤ 1700 bp, num_genus_taxid is 1 or lca is 0 and accuracy ≥80%, QC ≥ 7 when albacore basecalling was used, or accuracy ≥85%, QC score ≥9 when Guppy basecalling was used. Similar criteria and the highest scoring BLAST identification (top rank) was used for species level identification. A is Albacore; G is Guppy basecalling.