| Literature DB >> 29971047 |
Jednipit Borthong1, Ryosuke Omori1,2, Chihiro Sugimoto3,4, Orasa Suthienkul5, Ryo Nakao6, Kimihito Ito1,4,5.
Abstract
Metagenomic analysis has become a powerful tool to analyze bacterial communities in environmental samples. However, the detection of a specific bacterial species using metagenomic analysis remains difficult due to false positive detections of sequences shared between different bacterial species. In this study, 16S rRNA amplicon and shotgun metagenomic analyses were conducted on samples collected along a stream and ponds in the campus of Hokkaido University. We compared different database search methods for bacterial detection by focusing on Legionella pneumophila. In this study, we used L. pneumophila-specific nested PCR as a gold standard to evaluate the results of the metagenomic analysis. Comparison with the results from L. pneumophila-specific nested PCR indicated that a blastn search of shotgun reads against the NCBI-NT database led to false positive results and had problems with specificity. We also found that a blastn search of shotgun reads against a database of the catalase-peroxidase (katB) gene detected L. pneumophila with the highest area under the receiver operating characteristic curve among the tested search methods; indicating that a blastn search against the katB gene database had better diagnostic ability than searches against other databases. Our results suggest that sequence searches targeting long genes specifically associated with the bacterial species of interest is a prerequisite to detecting the bacterial species in environmental samples using metagenomic analyses.Entities:
Keywords: Legionella pneumophila; bacteria; detection; metagenomic analysis; receiver operating characteristic curve; water-borne diseases
Year: 2018 PMID: 29971047 PMCID: PMC6018159 DOI: 10.3389/fmicb.2018.01272
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Summary of next generation sequencing reads.
| Methods | Samples | Number of raw reads | Number of passed-QC reads | Number of reads hit with database | Number of reads identified as bacteria by MEGAN | Average length of bacterial reads |
|---|---|---|---|---|---|---|
| 16S rRNA | HKU_A | 46,968 | 39,245 | 39,166a | 35,404 | 455.7 |
| analysis | HKU_B | 29,684 | 25,183 | 25,093a | 24,711 | 453.3 |
| HKU_C | 39,167 | 32,628 | 32,534a | 32,256 | 450.5 | |
| HKU_D | 35,360 | 28,564 | 28,504a | 28,409 | 461.5 | |
| HKU_E | 32,936 | 25,826 | 25,735a | 25,654 | 464.9 | |
| HKU_F | 29,649 | 24,215 | 24,143a | 24,063 | 465.4 | |
| HKU_G | 43,416 | 33,846 | 33,692a | 33,636 | 466.8 | |
| HKU_H | 28,235 | 21,948 | 21,852a | 21,735 | 462.8 | |
| HKU_I | 38,581 | 30,646 | 30,554a | 30,510 | 468.5 | |
| HKU_J | 29,917 | 25,313 | 25,232a | 25,065 | 473.4 | |
| Shotgun | HKU_A | 1,554,614 | N/A | 318,064b | 309,063 | 288.8 |
| analysis | HKU_B | 5,291,304 | N/A | 1,628,823b | 1,600,198 | 293.1 |
| HKU_C | 7,078,858 | N/A | 2,354,608b | 2,323,879 | 289.7 | |
| HKU_D | 5,430,216 | N/A | 1,891,874b | 1,873,938 | 284.5 | |
| HKU_E | 6,046,758 | N/A | 2,283,330b | 2,264,076 | 285.4 | |
| HKU_F | 6,350,502 | N/A | 2,024,966b | 2,006,002 | 284.7 | |
| HKU_G | 4,992,354 | N/A | 1,769,319b | 1,752,738 | 285.8 | |
| HKU_H | 6,039,572 | N/A | 1,896,294b | 1,872,777 | 286.3 | |
| HKU_I | 4,581,078 | N/A | 1,729,619b | 1,714,434 | 285.5 | |
| HKU_J | 3,796,880 | N/A | 1,719,066b | 1,710,319 | 289.7 |
Number of shotgun reads identified as Legionella pneumophila by MEGAN, Kraken, CLARK, VFDB, and mip gene.
| Samples | Nested PCR targeting the | Number of shotgun reads identified as | ||||
|---|---|---|---|---|---|---|
| MEGAN | Kraken | CLARK | VFDB | |||
| HKU_A | Negative | 130 | 90 | 99 | 0 | 0 |
| HKU_B | Negative | 220 | 136 | 125 | 19 | 0 |
| HKU_C | Negative | 200 | 134 | 117 | 22 | 0 |
| HKU_D | Negative | 135 | 63 | 63 | 27 | 0 |
| HKU_E | Negative | 129 | 45 | 45 | 59 | 0 |
| HKU_F | Negative | 159 | 71 | 83 | 28 | 0 |
| HKU_G | Positive | 100 | 27 | 42 | 28 | 0 |
| HKU_H | Positive | 178 | 75 | 79 | 24 | 0 |
| HKU_I | Positive | 81 | 30 | 40 | 18 | 0 |
| HKU_J | Negative | 104 | 86 | 34 | 3 | 0 |
Number of shotgun reads identified as Legionella pneumophila using a blastn search against custom databases of virulence factor genes associated with Legionella pneumophila.
| Samples | Number of reads identified as | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| (789 bp)∗ | (1950 bp)∗ | (3144 bp)∗ | (2349 bp)∗ | (1089 bp)∗ | (2193 bp)∗ | (1032 bp)∗ | (2202 bp)∗ | (588 bp)∗ | |
| HKU_A | 0 / 123 | 0 / 193 | 0 / 0 | 0 / 0 | 1 / 120 | 0 / 226 | 0 / 176 | 0 / 71 | 0 / 77 |
| HKU_B | 2 / 172 | 0 / 317 | 2 / 4 | 0 / 18 | 1 / 113 | 0 / 733 | 2/ 1423 | 1 / 480 | 0 / 370 |
| HKU_C | 3 / 182 | 0 / 373 | 0 / 5 | 1 / 6 | 0 / 67 | 2 / 942 | 0 / 2440 | 0 / 742 | 0 / 529 |
| HKU_D | 0 / 279 | 0 / 595 | 0 / 0 | 0 / 0 | 0 / 10 | 2 / 458 | 0 / 637 | 0 / 452 | 0 / 603 |
| HKU_E | 1 / 386 | 0 / 704 | 0 / 0 | 0 / 0 | 0 / 15 | 5 / 596 | 0 / 1019 | 0 / 507 | 0 / 603 |
| HKU_F | 0 / 356 | 0 / 672 | 0 / 1 | 0 / 0 | 0 / 19 | 2 / 520 | 0 / 793 | 0 / 494 | 1 / 665 |
| HKU_G | 0 / 300 | 0 / 456 | 0 / 0 | 0 / 0 | 0 / 7 | 4 / 390 | 0 / 512 | 0 / 446 | 0 / 523 |
| HKU_H | 0 / 346 | 0 / 559 | 0 / 0 | 0 / 0 | 0 / 7 | 2 / 405 | 0 / 615 | 0 / 460 | 0 / 652 |
| HKU_I | 0 / 299 | 0 / 583 | 0 / 0 | 0 / 0 | 0 / 7 | 1 / 281 | 0 / 500 | 0 / 423 | 0 / 533 |
| HKU_J | 0 / 374 | 0 / 795 | 0 / 0 | 0 / 0 | 0 / 3 | 0 / 197 | 0 / 196 | 0 / 577 | 0 / 380 |