| Literature DB >> 27857707 |
Ditte A Søborg1, Niels B Hendriksen2, Mogens Kilian3, Jan H Christensen4, Niels Kroer5.
Abstract
The occurrence and distribution of clinically relevant bacterial virulence genes across natural (non-human) environments is not well understood. We aimed to investigate the occurrence of homologs to bacterial human virulence genes in a variety of ecological niches to better understand the role of natural environments in the evolution of bacterial virulence. Twenty four bacterial virulence genes were analyzed in 46 diverse environmental metagenomic datasets, representing various soils, seawater, freshwater, marine sediments, hot springs, the deep-sea, hypersaline mats, microbialites, gutless worms and glacial ice. Homologs to 16 bacterial human virulence genes, involved in urinary tract infections, gastrointestinal diseases, skin diseases, and wound and systemic infections, showed global ubiquity. A principal component analysis did not demonstrate clear trends across the metagenomes with respect to occurrence and frequency of observed gene homologs. Full-length (>95%) homologs of several virulence genes were identified, and translated sequences of the environmental and clinical genes were up to 50-100% identical. Furthermore, phylogenetic analyses indicated deep branching positions of some of the environmental gene homologs, suggesting that they represent ancient lineages in the phylogeny of the clinical genes. Fifteen virulence gene homologs were detected in metatranscriptomes, providing evidence of environmental expression. The ubiquitous presence and transcription of the virulence gene homologs in non-human environments point to an important ecological role of the genes for the activity and survival of environmental bacteria. Furthermore, the high degree of sequence conservation between several of the environmental and clinical genes suggests common ancestral origins.Entities:
Keywords: bacteria; environmental metagenomes; evolution; in silico; non-host; virulence
Year: 2016 PMID: 27857707 PMCID: PMC5093120 DOI: 10.3389/fmicb.2016.01712
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Bacterial human virulence genes used for TBLASTN analysis of environmental metagenomic datasets.
| Shiga-like toxin 1 | 315 | ACU32680.1 | |
| Shiga-like toxin 2 | 319 | YP_003234845.1 | |
| Vacuolating cytotoxin | 1296 | YP_002301516.1 | |
| Haemolysin A | 1024 | NP_755445.1 | |
| Hyaluronidase | 1066 | NP_344851.1 | |
| Pneumolysin | 471 | NP_346351.1 | |
| Gamma-haemolysin component C precursor | 325 | YP_041861.1 | |
| P fimbriae | 839 | NP_755465.1 | |
| P fimbriae | 199 | NP_755467.1 | |
| P fimbriae | 195 | NP_755466.1 | |
| Type 1 fimbriae | 303 | NP_757248.1 | |
| F1C fimbriae | 167 | NP_753158.1 | |
| Type III, secretion apparatus | 685 | YP_151923.1 | |
| Type III, secretion apparatus (SPI | 497 | YP_150710.1 | |
| Type III, secretion apparatus (SPI | 403 | YP_150709.1 | |
| Type III, translocators and effectors (SPI | 593 | YP_151912.1 | |
| Type III, translocators and effectors (SPI | 409 | YP_151911.1 | |
| Regulator of virulence determinants | 224 | YP_150858.1 | |
| Regulator of virulence determinants | 487 | YP_150859.1 | |
| Part of O-antigen 111 | 149 | AAC44881.1 | |
| Part of O-antigen 157 | 364 | NP_310868.1 | |
| L-Ara4N transferase | 548 | YP_149876.1 | |
| L-Ara4N transferase | 550 | NP_754685.1 | |
| Penicillin-binding protein | 668 | YP_039515.1 |
Salmonella Pathogenicity Island (SPI).
Frequency of putative bacterial human virulence genes in environmental metagenomic datasets.
| Microbialites (Marine microbial mat) | 20 | 1 | |||||||||||||||
| FLAS (Saltern) | 420 | 840 | 2 | ||||||||||||||
| LineIsland (Seawater) | 4 | 4 | 2 | ||||||||||||||
| PeruMarginSediment (Seawater) | 16 | 16 | 2 | ||||||||||||||
| Mountain lake (Freshwater) | 37 | 74 | 258 | 3 | |||||||||||||
| GeneExpression (Seawater) | 81 | 162 | 162 | 3 | |||||||||||||
| Contaminated soil | 23 | 69 | 46 | 3 | |||||||||||||
| AmazonRiverPlume (Estuarine) | 58 | 43 | 1 | 41 | 4 | ||||||||||||
| Sapelo2008 (Seawater) | 186 | 127 | 17 | 214 | 4 | ||||||||||||
| SargassoSea (Seawater) | 498 | 134 | 197 | 8 | 4 | ||||||||||||
| DeepMed (Seawater) | 555 | 416 | 1527 | 139 | 4 | ||||||||||||
| Yellowstone (Hot spring) | 3554 | 135 | 779 | 34 | 4 | ||||||||||||
| AcidMine (Biofilm) | 212 | 734 | 28 | 295 | 34 | 5 | |||||||||||
| PacificOcean (Seawater) | 4 | 106 | 85 | 1 | 51 | 5 | |||||||||||
| PBSM (Marine beach sand) | 167 | 167 | 834 | 500 | 167 | 5 | |||||||||||
| MontereyBay transect (Seawater) | 33 | 101 | 2 | 512 | 2 | 6 | 6 | ||||||||||
| HOT:454 (Seawater) | 37 | 177 | 4 | 956 | 5 | 10 | 6 | ||||||||||
| N. Pac. Line67 (Seawater) | 1 | 15 | 40 | 29 | 2 | 1 | 6 | ||||||||||
| BotanyBay:Sanger (Seawater) | 13 | 3 | 764 | 81 | 527 | 61 | 6 | ||||||||||
| AlvinellaPompejana (Hydrothermal vent) | 10 | 4 | 286 | 1078 | 28 | 65 | 6 | ||||||||||
| HydrothermalVent | 101 | 182 | 706 | 40 | 948 | 20 | 6 | ||||||||||
| TermiteGut | 16 | 246 | 410 | 66 | 689 | 98 | 6 | ||||||||||
| SalternMetagenome | 5 | 7 | 35 | 14 | 2 | 5 | 14 | 7 | |||||||||
| MontereyBay (Seawater) | 123 | 337 | 7 | 88 | 4 | 4 | 21 | 7 | |||||||||
| GuaymasBasin (Hydrothermal Vent) | 1 | 49 | 122 | 27 | 729 | 8 | 34 | 7 | |||||||||
| Bacterioplankton (Seawater) | 3 | 110 | 3 | 250 | 3 | 503 | 3 | 7 | 8 | ||||||||
| BATS (Seawater) | 1 | 1 | 51 | 151 | 3 | 445 | 2 | 6 | 8 | ||||||||
| BermudaOceanic (Seawater) | 4 | 92 | 454 | 33 | 1141 | 2 | 4 | 122 | 8 | ||||||||
| IceMetagenome (Freshwater ice) | 4 | 4 | 138 | 960 | 13 | 605 | 13 | 21 | 8 | ||||||||
| HypersalineMat (Seawater) | 83 | 142 | 1460 | 107 | 973 | 12 | 12 | 71 | 8 | ||||||||
| CellCapture (Marine sediment) | 63 | 333 | 10 | 1553 | 135 | 1053 | 156 | 177 | 42 | 9 | |||||||
| Drifting-ESP (Seawater) | 0.45 | 0.45 | 0.23 | 21 | 58 | 1 | 152 | 2 | 2 | 9 | |||||||
| GutlessWorm | 16 | 3 | 92 | 658 | 48 | 705 | 10 | 32 | 86 | 9 | |||||||
| HOT:Sanger (Seawater) | 8 | 61 | 1 | 653 | 26 | 1000 | 11 | 1 | 11 | 9 | |||||||
| WesternChannelOMM (Seawater) | 5 | 71 | 1 | 332 | 27 | 787 | 14 | 4 | 4 | 56 | 10 | ||||||
| MILOCO:454Shotgun (Seawater) | 0.25 | 0.06 | 85 | 1 | 275 | 16 | 842 | 10 | 0.19 | 1 | 71 | 11 | |||||
| Washington Lake (Freshwater) | 88 | 5 | 126 | 14 | 1643 | 144 | 824 | 19 | 23 | 42 | 84 | 11 | |||||
| BotanyBay:454 (Seawater) | 2 | 1 | 33 | 1 | 414 | 19 | 352 | 1 | 1 | 1 | 54 | 11 | |||||
| AntarcticaAquatic:Shotgun (Seawater) | 1 | 1 | 41 | 2 | 36 | 7 | 661 | 3 | 1 | 3 | 24 | 11 | |||||
| YLAKE (Hot spring) | 0.25 | 6 | 1 | 69 | 0.12 | 427 | 7 | 543 | 1 | 4 | 8 | 47 | 13 | ||||
| AntarcticaAquatic:454 (Seawater) | 0.44 | 0.04 | 0.13 | 60 | 1 | 250 | 15 | 783 | 4 | 4 | 4 | 52 | 13 | ||||
| BisonMetagenome (Hot spring) | 11 | 2 | 95 | 8 | 144 | 11 | 1242 | 47 | 679 | 6 | 28 | 83 | 44 | 13 | |||
| MILOCO:454 (Seawater) | 0.08 | 1 | 18 | 2 | 4 | 19 | 2 | 80 | 7 | 136 | 1 | 8 | 10 | 5 | 14 | ||
| GOS (Seawater) | 4 | 0.08 | 4 | 19 | 0.16 | 0.16 | 92 | 5 | 685 | 53 | 902 | 4 | 3 | 6 | 5 | 16 | |
| DayNight (Seawater) | 0 | ||||||||||||||||
| SapeloIsland (Seawater) | 0 | ||||||||||||||||
| No. of metagenomes | 24 | 5 | 4 | 1 | 13 | 3 | 4 | 40 | 16 | 42 | 34 | 43 | 22 | 17 | 22 | 29 | |
The prevalence of the individual genes in the different metagenomes were normalized to the metagenomic library sizes, i.e., reported values are frequencies per billion bases sequenced. Virulence genes not detected in any of the metagenomic datasets were ply, vacA, stx1, stx2, hlgB, sipB, sipC, and ssaD. Please refer to Supplementary Table .
Detection (BLASTN; E-value < 10.
Figure 1Rarefaction plot showing numbers of detected virulence genus as function of size (numbers of bases sequenced) of metagenomic datasets.
Figure 2Global richness map of putative bacterial human virulence genes in environmental metagenomic datasets. Circles represent findings of genes, and areas of circles the number of virulence genes detected.
Figure 3Bi-plot of principal component 1 (PC1; 26.4% explained variance) and 2 (PC2; 14.2% explained variance) of the PCA model. Metagenomes are marked in blue, while virulence gene homologs are marked in red.
Figure 4Minimum evolution distance tree (A) and maximum parsimony tree (B) of environmental sequences homologous to the protein sequence of Only environmental sequences with a coverage >95% (>141 aa) are included. The clinical sequence of wbdI is shown in red boldface type. Bootstrap values above 50% are displayed.
Figure 5Minimum evolution distance tree (A) and maximum parsimony tree (B) of environmental sequences homologous to the protein sequence of . Only environmental sequences with a coverage >40% (>220 aa) are included. The clinical sequence of yfbI is shown in red boldface type. Bootstrap values above 50% are displayed.