| Literature DB >> 27371583 |
Shuai Zhi1, Graham Banting1, Qiaozhi Li1, Thomas A Edge2, Edward Topp3, Mykola Sokurenko1, Candis Scott1, Shannon Braithwaite1, Norma J Ruecker4, Yutaka Yasui1, Tim McAllister5, Linda Chui6, Norman F Neumann7.
Abstract
UNLABELLED: Escherichia coli has been proposed to have two habitats-the intestines of mammals/birds and the nonhost environment. Our goal was to assess whether certain strains of E. coli have evolved toward adaptation and survival in wastewater. Raw sewage samples from different treatment plants were subjected to chlorine stress, and ∼59% of the surviving E. coli strains were found to contain a genetic insertion element (IS30) located within the uspC-flhDC intergenic region. The positional location of the IS30 element was not observed across a library of 845 E. coli isolates collected from various animal hosts or within GenBank or whole-genome reference databases for human and animal E. coli isolates (n = 1,177). Phylogenetics clustered the IS30 element-containing wastewater E. coli isolates into a distinct clade, and biomarker analysis revealed that these wastewater isolates contained a single nucleotide polymorphism (SNP) biomarker pattern that was specific for wastewater. These isolates belonged to phylogroup A, possessed generalized stress response (RpoS) activity, and carried the locus of heat resistance, features likely relevant to nonhost environmental survival. Isolates were screened for 28 virulence genes but carried only the fimH marker. Our data suggest that wastewater contains a naturalized resident population of E. coli We developed an endpoint PCR targeting the IS30 element within the uspC-flhDC intergenic region, and all raw sewage samples (n = 21) were positive for this marker. Conversely, the prevalence of this marker in E. coli-positive surface and groundwater samples was low (≤5%). This simple PCR assay may represent a convenient microbial source-tracking tool for identification of water samples affected by municipal wastewater. IMPORTANCE: The results of this study demonstrate that some strains of E. coli appear to have evolved to become naturalized populations in the wastewater environment and possess a number of stress-related genetic elements likely important for survival in this nonhost environment. The presence of non-host-adapted strains in wastewater challenges our understanding of using E. coli as a microbial indicator of wastewater treatment performance, suggesting that the E. coli strains present in human and animal feces may be very different from those found in treated wastewater.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27371583 PMCID: PMC5007776 DOI: 10.1128/AEM.00143-16
Source DB: PubMed Journal: Appl Environ Microbiol ISSN: 0099-2240 Impact factor: 4.792
Presence of the uspC-IS30-flhDC marker in E. coli isolates or populations from various animal and environmental sources
| Animal, human, or environmental source of | No. of isolates tested | No. (%) of isolates with IS |
|---|---|---|
| Bovine | 120 | 0 (0) |
| Bovine ( | 41 | 0 (0) |
| Cat | 21 | 0 (0) |
| Dog | 61 | 0 (0) |
| Deer | 48 | 0 (0) |
| Goose | 54 | 0 (0) |
| Human | 105 | 0 (0) |
| Human ( | 1,107 | 0 (0) |
| Chicken | 59 | 0 (0) |
| Moose | 14 | 0 (0) |
| Muskrat | 56 | 0 (0) |
| Horse | 44 | 0 (0) |
| Pig | 49 | 0 (0) |
| Coyote | 44 | 0 (0) |
| Gull | 18 | 0 (0) |
| Beaver | 40 | 0 (0) |
| Sheep | 47 | 0 (0) |
| Other animals ( | 29 | 0 (0) |
| Wastewater (chlorine treated) | 70 | 41 (59) |
| Wastewater (raw) | 319 | 16 (5) |
| Surface water | 187 | 1 (0.5) |
“In silico” refers to BLAST sequence analysis of E. coli genomes in GenBank.
Includes chickens and pigs.
Chlorine-treated E. coli isolates originated from four different wastewater treatment plants in Alberta, Canada.
Raw wastewater isolates were collected from various wastewater plants in Alberta (post-grit removal).
Isolates from surface water were collected from various rivers and lakes in Alberta and were represented by one isolate per source.
PCR primers for amplification of virulence genes
| Primer set | Gene target | Primer name | Primer sequence (5′-3′) | Product size (bp) | Reference |
|---|---|---|---|---|---|
| I | aer-F | TACCGGATTGTCATATGCAGACCGT | 601 | ||
| aer-R | AATATCTTCCTCCAGTCCGGAGAAG | ||||
| papC-F | GTGGCAGTATGAGTAATGACCGTTA | 202 | |||
| papC-R | ATATCCTTTCTGCAGGGATGCAATA | ||||
| traT-F | GGTGTGGTGCGATGAGCACAG | 287 | |||
| traT-R | CACGGTTCAGCCATCCCTGAG | ||||
| II | PAI | PAI-F | GGACATCCTGTTACAGCGCGCA | 921 | |
| PAI-R | TCGCCACCAATCACAGCCGAAC | ||||
| fimH-F | TGCAGAACGGATAAGCCGTGG | 505 | |||
| fimH-R | GCAGTCACCTGCCCTCCGGTA | ||||
| III | iroN-F | AAGTCAAAGCAGGGGTTGCCCG | 667 | ||
| iroN-R | GACGCCGACATTAAGACGCAG | ||||
| iutA-F | GGCTGGACATCATGGGAACTGG | 301 | |||
| iutA-R | CGTCGGGAACGGGTAGAATCG | ||||
| ibeA-F | AGGCAGGTGTGCGCCGCGTAC | 169 | |||
| ibeA-R | TGGTGCTCCGGCAAACCATGC | ||||
| IV | cnfl-F | AAGATGGAGTTTCCTATGCAGGAG | 497 | ||
| cnfl-R | CATTCAGAGTCCTGCCCTCATTATT | ||||
| papGII-F | GGGATGAGCGGGCCTTTGAT | 189 | |||
| papGII-R | CGGGCCCCCAAGTAACTCG | ||||
| V | fuyA-F | TGATTAACCCCGCGACGGGAA | 784 | ||
| fuyA-R | CGCAGTAGGCACGATGTTGTA | ||||
| papGIII-F | GGCCTGCAATGGATTTACCTGG | 257 | |||
| papGIII-R | CCACCAAATGACCATGCCAGAC | ||||
| VI | sfa/foc-F | CTCCGGAGAACTGGGTGCATCTTAC | 407 | ||
| sfa/foc-R | CGGAGGAGTAATTACAAACCTGGCA | ||||
| hlyA-F | AACAAGGATAAGCACTGTTCTGGCT | 1,176 | |||
| hlyA-R | ACCATATAAGCGGTCATTCCCGTCA | ||||
| VII | iha-F | CTGGCGGAGGCTCTGAGATCA | 826 | ||
| iha-R | TCCTTAAGCTCCCGCGGCTGA | ||||
| VIII | AIDA-I-F | TGCAAACATTAAGGGCTCG | 370 | ||
| AIDA-I-R | CCGGAAACATTGACCATACC | ||||
| aidA-F | CAGTTTATCAATCAGCTCGGG | 450 | |||
| aidA-R | CCACCGTTCCGTTATCCTC | ||||
| aah-F | CTGGGTGACATTATTGCTTGG | 543 | |||
| aah-R | TTTGCTTGTGCGGTAGACTG | ||||
| IX | LT | LTA-F | GGCGACAGATTATACCGTGC | 696 | |
| LTA-R | CCGAATTCTGTTATATATGTC | ||||
| STb-F | ATCGCATTTCTTCTTGCATC | 172 | |||
| STb-R | GGGCGCCAAAGCATGCTCC | ||||
| X | hra-F | CAGAAAACAACCGGTATCAG | 257 | ||
| hra-R | ACCAAGCATGATGTCATGAC | ||||
| XI | eaeA-F | GACCCGGCACAAGCATAAGC | 384 | ||
| eaeA-R | CCACCTGCAGCAACAAGAGG | ||||
| XII | Sta-F | TCTTTCCCCTCTTTTAGTCAG | 166 | ||
| Sta-R | ACAGGCAGGATTACAACAAAG | ||||
| XIII | ipaH-III | GTTCCTTGACCGCCTTTCCGATACCGTC | 600 | ||
| ipaH-IV | GCCGGTCAGCCACCCTCTGAGATAC | ||||
| XIV | usp-F | CGGCTCTTACATCGGTGCGTTG | 614 | ||
| usp-R | GACATATCCAGCCAGCGAGTTC | ||||
| XV | irp2-F | AAGGATTCGCTGTTACCGGAC6 | 285 | ||
| irp2-R | TCGTCGGGCAGCGTTTCTTCT | ||||
| XVI | stx1-F | CATCGCGAGTTGCCAGAAT | 78 | ||
| stx1-R | GCGTAATCCCACGGACTCTTC | ||||
| stx1-P | FAM-CTGCCGGACACATAGAAGGAAACTCATCA-TAMRA | ||||
| XVII | stx2-F | CCGGAATGCAAATCAGTC | 113 | ||
| stx2-R | CAGTGACAAAACGCAGAACT | ||||
| stx2-P | FAM-ACTGAACTCCATTAACGCCAGATATGA-TAMRA |
Probe sequences (stx1-P and stx2-P) are also listed. FAM, 6-carboxyfluorescein; TAMRA, 6-carboxytetramethylrhodamine.
PCR conditions for amplification of virulence genes
| Primer set(s) | Initial denaturation | No. of cycles | Cycling conditions | Final extension | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Denaturation | Annealing | Extension | |||||||||
| Temp (°C) | Time (min) | Temp (°C) | Time (s) | Temp (°C) | Time (s) | Temp (°C) | Time (s) | Temp (°C) | Time (min) | ||
| I to VII, XIV, XV | 94 | 5 | 33 | 94 | 30 | 63 | 30 | 72 | 60 | 72 | 7 |
| VIII | 94 | 5 | 33 | 94 | 60 | 63 | 60 | 72 | 30 | 72 | 10 |
| IX | 94 | 5 | 30 | 94 | 60 | 58 | 60 | 72 | 60 | 72 | 7 |
| X | 94 | 5 | 33 | 94 | 30 | 55 | 30 | 72 | 30 | 72 | 7 |
| XI | 94 | 5 | 30 | 94 | 60 | 58 | 60 | 72 | 40 | 72 | 7 |
| XII | 94 | 5 | 30 | 94 | 60 | 53 | 60 | 72 | 30 | 72 | 7 |
| XIII | 94 | 5 | 33 | 94 | 30 | 50 | 30 | 68 | 60 | 72 | 7 |
The primer sets in this table correspond to the primer sets listed in Table 2.
PCR primers targeting the intergenic regions and the LHR
| Target | Primer | Primer sequence (5′-3′) | Reference |
|---|---|---|---|
| ompF-F | TACGTGATGTGATTCCGTTC | ||
| ompF-R | TGTTATAGATTTCTGCAGCG | ||
| csgD-1 | GGACTTCATTAAACATGATG | ||
| csgD-2 | TGTTTTTCATGCTGTCAC | ||
| flhDC-F | GAGGTATGCATTATTCCCACCC | ||
| flhDC-R | TGGAGAAACGACGCAATC | ||
| flh-IS-F | CGGGGAACAAATGAGAACAC | This study | |
| flh-IS-R | TGGAGAAACGACGCAATC | ||
| LHR fragment A | HR-F1 | TTAGGTACCGCTGTCCATTGCCTGA | |
| HS-R1 | AGACCAATCAGGAAATGCTCTGGACC | ||
| LHR fragment B | HR-F2.2 | GAGGTACCTGTCTTGCCTGACAACGTTG | |
| HR-R2 | TATCTAGAATGTCATTTCTATGGAGGCATGAATCG | ||
| LHR fragment C | HS-F1 | GCAATCCTTTGCCGCAGCTATT | |
| HR-R3 | GTCAAGCTTCTAGGGCTCGTAGTTCG |
FIG 1Unrooted maximum likelihood phylogenetic tree encompassing 848 E. coli strains obtained from chlorine-treated wastewater (68 isolates) and 15 animal host groups (780 isolates) (Table 2) and based on an analysis of the concatenated DNA sequences of two intergenic regions (csgBAC-csgDEFG and asnS-ompF). Of the 70 chlorine-tolerant wastewater E. coli isolates, two failed to produce PCR products for the csgBAC-csgDEFG intergenic region. The colored ring that overlays the unrooted maximum likelihood tree indicates different E. coli groups, as defined by (i) human/animal host sources (yellow), (ii) wastewater isolates possessing the uspC-IS30-flhDC marker (blue), and (iii) wastewater isolates not possessing the uspC-IS30-flhDC marker (red). Note that all chlorine-tolerant wastewater isolates that possessed the uspC-IS30-flhDC marker (and which were isolated from 4 different WWTPs) grouped together within a single clade.
Logic regression-based SNP analysis of E. coli samples classified according to isolation source
| Source | No. of samples | Logic regression | Fivefold cross-validation | ||
|---|---|---|---|---|---|
| Sensitivity | Specificity | Sensitivity | Specificity | ||
| Wastewater | 68 | 0.82 | 1 | 0.76 | 0.99 |
| Bovine | 120 | 0.38 | 0.98 | 0.21 | 0.96 |
| Cat | 21 | 0.29 | 1 | 0 | 1 |
| Dog | 61 | 0.57 | 0.96 | 0.23 | 0.97 |
| Deer | 48 | 0.88 | 0.95 | 0.37 | 1 |
| Goose | 54 | 0.24 | 0.99 | 0.09 | 0.99 |
| Human | 105 | 0.46 | 1 | 0.41 | 1 |
| Chicken | 59 | 0.68 | 1 | 0.56 | 1 |
| Moose | 14 | 0.29 | 0.99 | 0 | 1 |
| Muskrat | 56 | 0.77 | 0.99 | 0.77 | 0.99 |
| Horse | 44 | 0.11 | 0.99 | 0 | 1 |
| Pig | 49 | 0.39 | 1 | 0.22 | 1 |
| Coyote | 44 | 0.64 | 0.99 | 0.59 | 0.99 |
| Gull | 18 | 0.44 | 0.99 | 0.05 | 0.99 |
| Beaver | 40 | 0.45 | 0.99 | 0.35 | 1 |
| Sheep | 47 | 0.49 | 1 | 0.37 | 1 |
Analysis was performed on the csgBAC-csgDEFG (listed as “csg” in the model) and asnS-ompF (listed as “omp” in the model) intergenic regions as described in Materials and Methods. The logic regression-based biomarker model for E. coli isolates from wastewater was as follows: logic E[Y] = 3.33 − 19.9 × {(omp47_T or csg1122_G) or [(not omp564_C) or (not omp335_A)]} − 7.01 × {[(not csg1180_G) and omp440_T and (not omp81_T)] and [csg993_G and omp481_G and (not omp423_A)]}. In this wastewater logic regression model, E[Y] is the probability of an E. coli isolate originating from a wastewater source. In reading the model output, a value designated “omp47_T” refers to the thymine base at nucleotide position 47 in the multisequence alignment of the asnS-ompF intergenic region. For further interpretation of model variables, see the work of Zhi et al. (20).
Prevalence of the uspC-IS30-flhDC marker in E. coli-positive surface water, drinking water, and wastewater samples
| Source of | No. of samples | No. (%) of marker-positive samples |
|---|---|---|
| Wastewater (total) | 50 | 43 (92) |
| Untreated wastewater (post-grit removal) | 21 | 21 (100) |
| Secondarily treated wastewater | 17 | 16 (94) |
| UV-treated wastewater | 12 | 9 (75) |
| Surface water | 71 | 2 (3) |
| Drinking water (groundwater) | 57 | 3 (5) |
FIG 2PCR amplification of the uspC-IS30-flhDC marker from Colilert-enriched E. coli-positive wastewater, drinking water, and surface water samples. Lanes 2 to 16 represent the PCR results for individual water samples. Lanes 1 and 19, molecular size marker (GeneRuler 100 bp; Thermo Scientific); lanes 2 to 6, wastewater; lanes 7 to 11, drinking water; lanes 12 to 16, surface water; lane 17, positive control (genomic DNA from an E. coli isolate possessing the uspC-IS30-flhDC marker); lane 18, negative control (genomic DNA from an E. coli isolate that did not possess the uspC-IS30-flhDC marker). The PCR amplicon obtained by targeting the uspC-IS30-flhDC marker is 386 bp long.